Skip to main content
Article
Semantic Provenance for eScience: Managing the Deluge of Scientific Data
IEEE Internet Computing
  • Satya S. Sahoo, Wright State University - Main Campus
  • Amit P. Sheth, Wright State University - Main Campus
  • Cory Andrew Henson, Wright State University - Main Campus
Document Type
Article
Publication Date
1-1-2008
Abstract

Provenance information in eScience is metadata that's critical to effectively manage the exponentially increasing volumes of scientific data from industrial-scale experiment protocols. Semantic provenance, based on domain-specific provenance ontologies, lets software applications unambiguously interpret data in the correct context. The semantic provenance framework for eScience data comprises expressive provenance information and domain-specific provenance ontologies and applies this information to data management. The authors' "two degrees of separation" approach advocates the creation of high-quality provenance information using specialized services. In contrast to workflow engines generating provenance information as a core functionality, the specialized provenance services are integrated into a scientific workflow on demand. This article describes an implementation of the semantic provenance framework for glycoproteomics.

Comments

Posted with permission from IEEE.

DOI
10.1109/MIC.2008.86
Citation Information
Satya S. Sahoo, Amit P. Sheth and Cory Andrew Henson. "Semantic Provenance for eScience: Managing the Deluge of Scientific Data" IEEE Internet Computing Vol. 12 Iss. 4 (2008) p. 46 - 54 ISSN: 10897801
Available at: http://works.bepress.com/amit_sheth/275/