Open Access. Powered by Scholars. Published by Universities.®

Science and Technology Studies Commons

Open Access. Powered by Scholars. Published by Universities.®

Articles 1 - 30 of 30

Full-Text Articles in Science and Technology Studies

Research In Semantic Web And Information Retrieval: Trust, Sensors, And Search, Krishnaprasad Thirunarayan Dec 2009

Research In Semantic Web And Information Retrieval: Trust, Sensors, And Search, Krishnaprasad Thirunarayan

Kno.e.sis Publications

No abstract provided.


A Contrast Pattern Based Clustering Quality Index For Categorical Data, Qingbao Liu, Guozhu Dong Dec 2009

A Contrast Pattern Based Clustering Quality Index For Categorical Data, Qingbao Liu, Guozhu Dong

Kno.e.sis Publications

Since clustering is unsupervised and highly explorative, clustering validation (i.e. assessing the quality of clustering solutions) has been an important and long standing research problem. Existing validity measures have significant shortcomings. This paper proposes a novel contrast pattern based clustering quality index (CPCQ) for categorical data, by utilizing the quality and diversity of the contrast patterns (CPs) which contrast the clusters in clusterings. High quality CPs can characterize clusters and discriminate them against each other. Experiments show that the CPCQ index (1) can recognize that expert-determined classes are the best clusters for many datasets from the UCI repository; (2) does …


Sparql Query Re-Writing For Spatial Datasets Using Partonomy Based Transformation Rules, Prateek Jain, Cory Andrew Henson, Amit P. Sheth, Peter Z. Yeh, Kunal Verma Dec 2009

Sparql Query Re-Writing For Spatial Datasets Using Partonomy Based Transformation Rules, Prateek Jain, Cory Andrew Henson, Amit P. Sheth, Peter Z. Yeh, Kunal Verma

Kno.e.sis Publications

Often the information present in a spatial knowledge base is represented at a different level of granularity and abstraction than the query constraints. For querying ontology’s containing spatial information, the precise relationships between spatial entities has to be specified in the basic graph pattern of SPARQL query which can result in long and complex queries. We present a novel approach to help users intuitively write SPARQL queries to query spatial data, rather than relying on knowledge of the ontology structure. Our framework re-writes queries, using transformation rules to exploit part-whole relations between geographical entities to address the mismatches between query …


A Local Qualitative Approach To Referral And Functional Trust, Krishnaprasad Thirunarayan, Dharan Althuru, Cory Andrew Henson, Amit P. Sheth Dec 2009

A Local Qualitative Approach To Referral And Functional Trust, Krishnaprasad Thirunarayan, Dharan Althuru, Cory Andrew Henson, Amit P. Sheth

Kno.e.sis Publications

Trust and confidence are becoming key issues in diverse applications such as ecommerce, social networks, semantic sensor web, semantic web information retrieval systems, etc. Both humans and machines use some form of trust to make informed and reliable decisions before acting. In this work, we briefly review existing work on trust networks, pointing out some of its drawbacks. We then propose a local framework to explore two different kinds of trust among agents called referral trust and functional trust, that are modelled using local partial orders, to enable qualitative trust personalization. The proposed approach formalizes reasoning with trust, distinguishing between …


Ontology-Driven Provenance Management In Escience: An Application In Parasite Research, Satya S. Sahoo, D. Brent Weatherly, Raghava Mutharaju, Pramod Anantharam, Amit P. Sheth, Rick L. Tarleton Nov 2009

Ontology-Driven Provenance Management In Escience: An Application In Parasite Research, Satya S. Sahoo, D. Brent Weatherly, Raghava Mutharaju, Pramod Anantharam, Amit P. Sheth, Rick L. Tarleton

Kno.e.sis Publications

Provenance, from the French word “provenir”, describes the lineage or history of a data entity. Provenance is critical information in scientific applications to verify experiment process, validate data quality and associate trust values with scientific results. Current industrial scale eScience projects require an end-to-end provenance management infrastructure. This infrastructure needs to be underpinned by formal semantics to enable analysis of large scale provenance information by software applications. Further, effective analysis of provenance information requires well-defined query mechanisms to support complex queries over large datasets. This paper introduces an ontology-driven provenance management infrastructure for biology experiment data, as part …


A Survey Of The Semantic Specification Of Sensors, Michael Compton, Cory Andrew Henson, Laurent Lefort, Holger Neuhaus, Amit P. Sheth Oct 2009

A Survey Of The Semantic Specification Of Sensors, Michael Compton, Cory Andrew Henson, Laurent Lefort, Holger Neuhaus, Amit P. Sheth

Kno.e.sis Publications

Semantic sensor networks use declarative descriptions of sensors promote reuse and integration, and to help solve the difficulties of installing, querying and maintaining complex, heterogeneous sensor networks. This paper reviews the state of the art for the semantic specification of sensors, one of the fundamental technologies in the semantic sensor network vision. Twelve sensor ontologies are reviewed and analysed for the range and expressive power of their concepts. The reasoning and search technology developed in conjunction with these ontologies is also reviewed, as is technology for annotating OGC standards with links to ontologies. Sensor concepts that cannot be expressed accurately …


Provenir Ontology: Towards A Framework For Escience Provenance Management, Satya S. Sahoo, Amit P. Sheth Oct 2009

Provenir Ontology: Towards A Framework For Escience Provenance Management, Satya S. Sahoo, Amit P. Sheth

Kno.e.sis Publications

Provenance metadata describes the 'lineage' or history of an entity and necessary information to verify the quality of data, validate experiment protocols, and associate trust value with scientific results. eScience projects generate data and the associated provenance metadata in a distributed environment (such as myGrid) and on a very large scale that often precludes manual analysis. Given this scenario, provenance information should be, (a) interoperable across projects, research groups, and application domains, and (b) support analysis over large datasets using reasoning to discover implicit information. In this paper, we introduce an ontology-driven framework for eScience provenance management underpinned by an …


Ibm Altocumulus: A Cross-Cloud Middleware And Platform, E. Michael Maximilien, Ajith Harshana Ranabahu, Roy Engehausen, Laura Anderson Oct 2009

Ibm Altocumulus: A Cross-Cloud Middleware And Platform, E. Michael Maximilien, Ajith Harshana Ranabahu, Roy Engehausen, Laura Anderson

Kno.e.sis Publications

Cloud computing has become the new face of computing and promises to offer virtually unlimited, cheap, readily available, "utility type" computing resources. Many vendors have entered this market with different offerings ranging from infrastructure-as-a-service such as Amazon, to fully functional platform services such as Google App Engine. However, as a result of this heterogeneity, deploying applications to a cloud and managing them needs to be done using vendor specific methods. This "lock in" is seen as a major hurdle in adopting cloud technologies to the enterprise. IBM Altocumulus, the cloud middleware platform from IBM Almaden Services Research, aims to solve …


A Best Practice Model For Cloud Middleware Systems, Ajith Harshana Ranabahu, E. Michael Maximilien Oct 2009

A Best Practice Model For Cloud Middleware Systems, Ajith Harshana Ranabahu, E. Michael Maximilien

Kno.e.sis Publications

Cloud computing is the latest trend in computing where the intention is to facilitate cheap, utility type computing resources in a service-oriented manner. However, the cloud landscape is still maturing and there are heterogeneities between the clouds, ranging from the application development paradigms to their service interfaces,and scaling approaches. These differences hinder the adoption of cloud by major enterprises. We believe that a cloud middleware can solve most of these issues to allow cross-cloud inter-operation. Our proposed system is Altocumulus, a cloud middleware that homogenizes the clouds. In order to provide the best use of the cloud resources and make …


Context And Domain Knowledge Enhanced Entity Spotting In Informal Text, Daniel Gruhl, Meena Nagarajan, Jan Pieper, Christine Robson, Amit P. Sheth Oct 2009

Context And Domain Knowledge Enhanced Entity Spotting In Informal Text, Daniel Gruhl, Meena Nagarajan, Jan Pieper, Christine Robson, Amit P. Sheth

Kno.e.sis Publications

This paper explores the application of restricted relationship graphs (RDF) and statistical NLP techniques to improve named entity annotation in challenging Informal English domains. We validate our approach using on-line forums discussing popular music. Named entity annotation is particularly difficult in this domain because it is characterized by a large number of ambiguous entities, such as the Madonna album “Music” or Lilly Allen’s pop hit “Smile”.

We evaluate improvements in annotation accuracy that can be obtained by restricting the set of possible entities using real-world constraints. We find that constrained domain entity extraction raises the annotation accuracy significantly, making an …


Context Is Highly Contextual!, Amit P. Sheth Sep 2009

Context Is Highly Contextual!, Amit P. Sheth

Kno.e.sis Publications

No abstract provided.


Tableau Algorithm For Concept Satisfiability In Description Logic Alch, Satya S. Sahoo, Krishnaprasad Thirunarayan Jul 2009

Tableau Algorithm For Concept Satisfiability In Description Logic Alch, Satya S. Sahoo, Krishnaprasad Thirunarayan

Kno.e.sis Publications

The provenir ontology is an upper-level ontology to facilitate interoperability of provenance information in scientific applications. The description logic (DL) expressivity of provenir ontology is ALCH, that is, it models role hierarchies (H) (without transitive roles and inverse roles). Even though the complexity results for concept satisfiability for numerous variants of DL such as ALC with transitively closed roles (ALCR+ also called S), inverse roles SI, and role hierarchy SHI have been well-established, similar results for ALCH has been surprisingly missing from the literature. Here, we show that the complexity of the concept satisfiability problem for the ALCH variant …


“Best K”: Critical Clustering Structures In Categorical Datasets, Keke Chen, Ling Liu Jul 2009

“Best K”: Critical Clustering Structures In Categorical Datasets, Keke Chen, Ling Liu

Kno.e.sis Publications

The demand on cluster analysis for categorical data continues to grow over the last decade. A well-known problem in categorical clustering is to determine the best K number of clusters. Although several categorical clustering algorithms have been developed, surprisingly, none has satisfactorily addressed the problem of best K for categorical clustering. Since categorical data does not have an inherent distance function as the similarity measure, traditional cluster validation techniques based on geometric shapes and density distributions are not appropriate for categorical data. In this paper, we study the entropy property between the clustering results of categorical data with different K …


Ontology Supported Knowledge Discovery In The Field Of Human Performance And Cognition, Christopher Thomas, Pablo N. Mendes, Delroy H. Cameron, Amit P. Sheth, Krishnaprasad Thirunarayan, Cartic Ramakrishnan Jun 2009

Ontology Supported Knowledge Discovery In The Field Of Human Performance And Cognition, Christopher Thomas, Pablo N. Mendes, Delroy H. Cameron, Amit P. Sheth, Krishnaprasad Thirunarayan, Cartic Ramakrishnan

Kno.e.sis Publications

No abstract provided.


Extending Sparql To Support Spatially And Temporally Related Information, Prateek Jain, Amit P. Sheth, Peter Z. Yeh, Kunal Verma Jun 2009

Extending Sparql To Support Spatially And Temporally Related Information, Prateek Jain, Amit P. Sheth, Peter Z. Yeh, Kunal Verma

Kno.e.sis Publications

No abstract provided.


Analysis And Monetization Of Social Data, Amit P. Sheth Jun 2009

Analysis And Monetization Of Social Data, Amit P. Sheth

Kno.e.sis Publications

No abstract provided.


An Ontological Representation Of Time Series Observations On The Semantic Sensor Web, Cory Andrew Henson, Holger Neuhaus, Amit P. Sheth, Krishnaprasad Thirunarayan, Rajkumar Buyya Jun 2009

An Ontological Representation Of Time Series Observations On The Semantic Sensor Web, Cory Andrew Henson, Holger Neuhaus, Amit P. Sheth, Krishnaprasad Thirunarayan, Rajkumar Buyya

Kno.e.sis Publications

Time series observations are a common method of collecting sensor data. The Open Geospatial Consortium (OGC) Sensor Web Enablement (SWE) provides a standard representation for time series observations within the Observations and Measurements language, and therefore is in heavy use on the Sensor Web. By providing a common model, Observations and Measurements (O&M) facilitates syntax-level integration, but lacks the ability to facilitate semantic-level integration. This inability can cause problems with interoperability between disparate sensor networks that may have subtle variations in their sensing methods. An ontological representation of time series observations could provide a more expressive model and resolve problems …


Trykipedia: Collaborative Bio-Ontology Development Using Wiki Environment, Pramod Anantharam, Satya S. Sahoo, Brent Weatherly, Flora Logan, Raghava Mutharaju, Amit P. Sheth, Rick L. Tarleton Jun 2009

Trykipedia: Collaborative Bio-Ontology Development Using Wiki Environment, Pramod Anantharam, Satya S. Sahoo, Brent Weatherly, Flora Logan, Raghava Mutharaju, Amit P. Sheth, Rick L. Tarleton

Kno.e.sis Publications

Biomedical ontology development is an intensely collaborative process between biology experts and computer scientists. With the proliferation of ontology based approach to solve informatics problems in biological domain, there is a need for collaborative environment that is intuitive and widely accepted for modeling the ontology.


Situation Awareness Via Abductive Reasoning For Semantic Sensor Data: A Preliminary Report, Krishnaprasad Thirunarayan, Cory Andrew Henson, Amit P. Sheth May 2009

Situation Awareness Via Abductive Reasoning For Semantic Sensor Data: A Preliminary Report, Krishnaprasad Thirunarayan, Cory Andrew Henson, Amit P. Sheth

Kno.e.sis Publications

Semantic sensor Web enhances raw sensor data with spatial, temporal, and thematic annotations to enable high-level reasoning. In this paper, we explore how abductive reasoning framework can benefit formalization and interpretation of sensor data to garner situation awareness. Specifically, we show how abductive logic programming techniques, in conjunction with symbolic knowledge rules, can be used to detect inconsistent sensor data and to generate human accessible description of the state of the world from consistent subset of the sensor data. We also show how trust/belief information can be incorporated into the interpreter to enhance reliability. For concreteness, we formalize weather domain …


Semsos: Semantic Sensor Observation Service, Cory Andrew Henson, Josh Pschorr, Amit P. Sheth, Krishnaprasad Thirunarayan May 2009

Semsos: Semantic Sensor Observation Service, Cory Andrew Henson, Josh Pschorr, Amit P. Sheth, Krishnaprasad Thirunarayan

Kno.e.sis Publications

Sensor observation service (SOS) is a Web service specification defined by the Open Geospatial Consortium (OGC) Sensor Web Enablement (SWE) group in order to standardize the way sensors and sensor data are discovered and accessed on the Web. This standard goes a long way in providing interoperability between repositories of heterogeneous sensor data and applications that use this data. Many of these applications, however, are ill equipped at handling raw sensor data as provided by SOS and require actionable knowledge of the environment in order to be practically useful. There are two approaches to deal with this obstacle, make the …


Semantic Integration Of Citizen Sensor Data And Multilevel Sensing: A Comprehensive Path Towards Event Monitoring And Situational Awareness, Amit P. Sheth Feb 2009

Semantic Integration Of Citizen Sensor Data And Multilevel Sensing: A Comprehensive Path Towards Event Monitoring And Situational Awareness, Amit P. Sheth

Kno.e.sis Publications

No abstract provided.


Computing For Human Experience: Sensors, Perception, Semantics, Web N.0, And Beyond, Amit P. Sheth Feb 2009

Computing For Human Experience: Sensors, Perception, Semantics, Web N.0, And Beyond, Amit P. Sheth

Kno.e.sis Publications

Traditionally there has been a strong separation between computing and human activities in the real world. The approach has largely been that of mapping the complexity and richness of the real world to constrained computer models and languages for more efficient computation, and then transferring the results for use in the real world. I think the time is ripe to reverse the situation, for computing and communication to transparently enrich and enhance human experience. Today, devices enable something more than a 'human instructs machine' paradigm. We are seeing computing and communication engage transparently in human activities by enriching them in …


Why Gujarat Needs Much Better Higher Education & Research To Succeed In Knowledge Economy & What We Can Do About It?, Amit P. Sheth, Kamlesh Lulla, Sanjay Chaudhary Jan 2009

Why Gujarat Needs Much Better Higher Education & Research To Succeed In Knowledge Economy & What We Can Do About It?, Amit P. Sheth, Kamlesh Lulla, Sanjay Chaudhary

Kno.e.sis Publications

This white paper distills the deliberations on the role of higher education and research as a key enabler of a Knowledge based Society. In particular it discusses (a) the importance of higher quality PhDs for building a knowledge society, (b) the initiatives and progress in competing economies in higher education and research, (c) where Gujarat stands in comparison, and (d) some recommendations on what Gujarat can do to enable timely progress towards building a knowledge based society and economy. These deliberations were conducted in conjunction with the International Conference on 'Reconnecting Gujarati Diaspora with its Homeland: Contribution to its Development …


Semantics-Empowered Social Computing, Amit P. Sheth, Meenakshi Nagarajan Jan 2009

Semantics-Empowered Social Computing, Amit P. Sheth, Meenakshi Nagarajan

Kno.e.sis Publications

In this article, we discuss some of the challenges in marking-up or annotating UGC, a first step toward the realization of the social semantic Web. Using examples from real- world UGC, we show how domain knowledge can effectively complement statistical natural language processing techniques for metadata creation.


Prom: A Semantic Web Framework For Provenance Management In Science, Satya S. Sahoo, Roger Barga, Amit P. Sheth, Krishnaprasad Thirunarayan, Pascal Hitzler Jan 2009

Prom: A Semantic Web Framework For Provenance Management In Science, Satya S. Sahoo, Roger Barga, Amit P. Sheth, Krishnaprasad Thirunarayan, Pascal Hitzler

Kno.e.sis Publications

The eScience paradigm is enabling researchers to collaborate over the Web in virtual laboratories and conduct experiments on an industrial scale. But, the inherent variability in the quality and trust associated with eScience resources necessitates the use of provenance information describing the origin of an entity. Existing systems often model provenance using ambiguous terminology, have poor domain semantics and include modeling inconsistencies that hinders interoperability. Further, mere collection of provenance information is of little value without a well-defined and scalable query mechanism.

In this paper, we present 'PrOM', a framework that addresses both the modeling and querying issues in eScience …


Citizen Sensing, Social Signals, And Enriching Human Experience, Amit P. Sheth Jan 2009

Citizen Sensing, Social Signals, And Enriching Human Experience, Amit P. Sheth

Kno.e.sis Publications

In this article, I introduce the exciting paradigm of citizen sensing enabled by mobile sensors and human computing - that is, humans as citizens on the ubiquitous Web, acting as sensors and sharing their observations and views using mobile devices and Web 2.0 services.


Service Level Agreement In Cloud Computing, Pankesh Patel, Ajith H. Ranabahu, Amit P. Sheth Jan 2009

Service Level Agreement In Cloud Computing, Pankesh Patel, Ajith H. Ranabahu, Amit P. Sheth

Kno.e.sis Publications

Cloud computing that provides cheap and pay-as-you-go computing resources is rapidly gaining momentum as an alternative to traditional IT Infrastructure. As more and more consumers delegate their tasks to cloud providers, Service Level Agreements(SLA) between consumers and providers emerge as a key aspect. Due to the dynamic nature of the cloud, continuous monitoring on Quality of Service (QoS) attributes is necessary to enforce SLAs. Also numerous other factors such as trust (on the cloud provider) come into consideration, particularly for enterprise customers that may outsource its critical data. This complex nature of the cloud landscape warrants a sophisticated means of …


Characterization Of 1h Nmr Spectroscopic Data And The Generation Of Synthetic Validation Sets, Paul E. Anderson, Michael L. Raymer, Benjamin J. Kelly, Nicholas V. Reo, Nicholas J. Delraso, Travis E. Doom Jan 2009

Characterization Of 1h Nmr Spectroscopic Data And The Generation Of Synthetic Validation Sets, Paul E. Anderson, Michael L. Raymer, Benjamin J. Kelly, Nicholas V. Reo, Nicholas J. Delraso, Travis E. Doom

Kno.e.sis Publications

Motivation: Common contemporary practice within the nuclear magnetic resonance (NMR) metabolomics community is to evaluate and validate novel algorithms on empirical data or simplified simulated data. Empirical data captures the complex characteristics of experimental data, but the optimal or most correct analysis is unknown a priori; therefore, researchers are forced to rely on indirect performance metrics, which are of limited value. In order to achieve fair and complete analysis of competing techniques more exacting metrics are required. Thus, metabolomics researchers often evaluate their algorithms on simplified simulated data with a known answer. Unfortunately, the conclusions obtained on simulated data are …


User-Generated Content On Social Media Challenges, Opportunities, Meenakshi Nagarajan Jan 2009

User-Generated Content On Social Media Challenges, Opportunities, Meenakshi Nagarajan

Kno.e.sis Publications

Understanding and exploiting user generated (textual) content (UGC) on social media is at the forefront of information management challenges today. The variety of UGC in detailed blog commentaries, collaborative wiki-content, online conversations, short messages in micro-blogs etc., are powering several personalization, monetization, crowd/business intelligence applications, and also providing an electronic microscope on social phenomena at an extraordinary scale. Certain characteristics of UGC however, necessitate key computational linguistic interventions before systems can tap into this data. A large portion of language found on social media is in the Informal English domain a blend of abbreviations, slang and context dependent terms delivered …


Determining The Best K For Clustering Transactional Datasets: A Coverage Density-Based Approach, Hua Yan, Keke Chen, Ling Liu Jan 2009

Determining The Best K For Clustering Transactional Datasets: A Coverage Density-Based Approach, Hua Yan, Keke Chen, Ling Liu

Kno.e.sis Publications

The problem of determining the optimal number of clusters is important but mysterious in cluster analysis. In this paper, we propose a novel method to find a set of candidate optimal number Ks of clusters in transactional datasets. Concretely, we propose Transactional-cluster-modes Dissimilarity based on the concept of coverage density as an intuitive transactional inter-cluster dissimilarity measure. Based on the above measure, an agglomerative hierachical clustering algorithm is developed and the Merge Dissimilarity Indexes, which are generated in hierachical cluster merging processes, are used to find the candidate optimal number Ks of clusters of transactional data. Our experimental results on …