A data warehouse approach to semantic integration of pseudomonas data

  • Authors:
  • Kamar Marrakchi;Abdelaali Briache;Amine Kerzazi;Ismael Navas-Delgado;José Francisco Aldana-Montes;Mohamed Ettayebi;Khalid Lairini;Badr Din Rossi Hassani

  • Affiliations:
  • Department of Biology, Faculty of Sciences and Techniques, University Abdelmalek Essaâdi, Tangier, Morocco;Department of Biology, Faculty of Sciences and Techniques, University Abdelmalek Essaâdi, Tangier, Morocco;Department of Computer Languages and Computing Science, Higher Technical School of Computer Science Engineering University of Málaga, Malaga, Spain;Department of Computer Languages and Computing Science, Higher Technical School of Computer Science Engineering University of Málaga, Malaga, Spain;Department of Computer Languages and Computing Science, Higher Technical School of Computer Science Engineering University of Málaga, Malaga, Spain;Department of Biology, Faculty of Sciences Dhar-Mahraz, Sidi Med Ben Abdellah University, Atlas, Fez, morocco;Department of Biology, Faculty of Sciences and Techniques, University Abdelmalek Essaâdi, Tangier, Morocco;Department of Biology, Faculty of Sciences and Techniques, University Abdelmalek Essaâdi, Tangier, Morocco

  • Venue:
  • DILS'10 Proceedings of the 7th international conference on Data integration in the life sciences
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Biological research and development are routinely producing terabytes of data that need to be organized, queried and reduced to useful scientific knowledge. Even though data integration can provide solutions to such biological problems, it is often problematic due to the sources' heterogeneity and their semantic and structural diversity. Moreover, necessary updates of both structure and content of databases provide further challenges for an integration process. We present a new biological data warehouse for Pseudomonas species "PseudomonasDW" to integrate annotation and pathway data from highly different resources. The combination of knowledge from multiple disciplines and sources should advance the understanding of cellular processes and lead to the prediction of cellular behavior in its entirety. The key aspect of our approach is the combination of a materialized and a virtual data integration to exploit their advantages in a new hybrid approach. The data are extracted from the original data sources using SB-KOM (System Biology Khaos Ontology-based Mediator) and then stored locally in the data warehouse to ensure a fast performance and data consistency.