On-the-Fly Integration and Ad Hoc Querying of Life Sciences Databases Using LifeDB

  • Authors:
  • Anupam Bhattacharjee;Aminul Islam;Mohammad Shafkat Amin;Shahriyar Hossain;Shazzad Hosain;Hasan Jamil;Leonard Lipovich

  • Affiliations:
  • Department of Computer Science, Wayne State University, USA;Department of Computer Science, Wayne State University, USA;Department of Computer Science, Wayne State University, USA;Department of Computer Science, Wayne State University, USA;Department of Computer Science, Wayne State University, USA;Department of Computer Science, Wayne State University, USA;Center for Molecular Medicine and Genetics, Wayne State University, USA

  • Venue:
  • DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data intensive applications in Life Sciences extensively use the hidden web as a platform for information sharing. Access to these heterogeneous hidden web resources is limited through the use of predefined web forms and interactive interfaces that users navigate manually, and assume responsibility for reconciling schema heterogeneity, extracting information and piping, transforming formats and so on in order to implement desired query sequences or scientific work flows. In this paper, we present a new data management system, called LifeDB , in which we offer support for currency without view materialization, and autonomous reconciliation of schema heterogeneity in one single platform through a declarative query language called BioFlow . In our approach, schema heterogeneity is resolved at run time by treating the hidden web resources as a virtual warehouses, and by supporting a set of primitives for data integration on-the-fly, extracting information and piping to other resources, and manipulating data in a way similar to traditional database systems to respond to application demands.