YeastHub: a semantic web use case for integrating data in the life sciences domain

  • Authors:
  • Kei-Hoi Cheung;Kevin Y. Yip;Andrew Smith;Remko Deknikker;Andy Masiar;Mark Gerstein

  • Affiliations:
  • Center for Medical Informatics, Yale University New Haven, CT 06520, USA;Department of Computer Science, Yale University New Haven, CT 06520, USA;Department of Computer Science, Yale University New Haven, CT 06520, USA;Center for Medical Informatics, Yale University New Haven, CT 06520, USA;Center for Medical Informatics, Yale University New Haven, CT 06520, USA;Department of Computer Science, Yale University New Haven, CT 06520, USA

  • Venue:
  • Bioinformatics
  • Year:
  • 2005

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: As the semantic web technology is maturing and the need for life sciences data integration over the web is growing, it is important to explore how data integration needs can be addressed by the semantic web. The main problem that we face in data integration is a lack of widely-accepted standards for expressing the syntax and semantics of the data. We address this problem by exploring the use of semantic web technologies---including resource description framework (RDF), RDF site summary (RSS), relational-database-to-RDF mapping (D2RQ) and native RDF data repository---to represent, store and query both metadata and data across life sciences datasets. Results: As many biological datasets are presently available in tabular format, we introduce an RDF structure into which they can be converted. Also, we develop a prototype web-based application called YeastHub that demonstrates how a life sciences data warehouse can be built using a native RDF data store (Sesame). This data warehouse allows integration of different types of yeast genome data provided by different resources in different formats including the tabular and RDF formats. Once the data are loaded into the data warehouse, RDF-based queries can be formulated to retrieve and query the data in an integrated fashion. Availability: The YeastHub website is accessible via the following URL: http://yeasthub.gersteinlab.org Contact: kei.cheung@yale.edu