Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

PDF

Bioinformatics

2008

Semantic Provenance

Articles 1 - 2 of 2

Full-Text Articles in Entire DC Network

Semantic Provenance For Escience: Managing The Deluge Of Scientific Data, Satya S. Sahoo, Amit P. Sheth, Cory Andrew Henson Jan 2008

Semantic Provenance For Escience: Managing The Deluge Of Scientific Data, Satya S. Sahoo, Amit P. Sheth, Cory Andrew Henson

Kno.e.sis Publications

Provenance information in eScience is metadata that's critical to effectively manage the exponentially increasing volumes of scientific data from industrial-scale experiment protocols. Semantic provenance, based on domain-specific provenance ontologies, lets software applications unambiguously interpret data in the correct context. The semantic provenance framework for eScience data comprises expressive provenance information and domain-specific provenance ontologies and applies this information to data management. The authors' "two degrees of separation" approach advocates the creation of high-quality provenance information using specialized services. In contrast to workflow engines generating provenance information as a core functionality, the specialized provenance services are integrated into a scientific workflow …


Ontology Driven Semantic Provenance For Heterogeneous Bionomics Experimental Data, Satya S. Sahoo, Michael L. Raymer, Cory Andrew Henson, Amit P. Sheth, William S. York Jan 2008

Ontology Driven Semantic Provenance For Heterogeneous Bionomics Experimental Data, Satya S. Sahoo, Michael L. Raymer, Cory Andrew Henson, Amit P. Sheth, William S. York

Kno.e.sis Publications

Scientific experimental data generated by all the bionomic technologies is characterized by heterogeneity in its representation formats, constituents, and generation processes and, therefore, also in its usage. Using the proteomics domain we demonstrate the important role of provenance information o manage, interpret and analyze experimental data. We present a novel approach that employs an ontology as a knowledge model to automatically create semantic provenance information for high-throughput mass spectrometry (MS) data in the glycoproteomics domain. The Semantic Provenance Annotation of Data in protEomics (SPADE) implementation is based on the ProPreO ontology, a large-process ontology ( ~500 classes, 40 named relationships …