An efficient storage model for the SBML documents using object databases

  • Authors:
  • Seung-Hyun Jung;Tae-Sung Jung;Tae-Kyung Kim;Kyoung-Ran Kim;Jae-Soo Yoo;Wan-Sup Cho

  • Affiliations:
  • Dept. of Information Industrial Engineering, Chungbuk National University, Cheongju, Chungbuk, Korea;Dept. of Information Industrial Engineering, Chungbuk National University, Cheongju, Chungbuk, Korea;Dept. of Information Industrial Engineering, Chungbuk National University, Cheongju, Chungbuk, Korea;Dept. of Information Industrial Engineering, Chungbuk National University, Cheongju, Chungbuk, Korea;Dept. of Computer and Communication Engineering, Chungbuk National University, Cheongju, Chungbuk, Korea;Dept. of MIS, Chungbuk National University, Cheongju, Chungbuk, Korea

  • Venue:
  • VDMB'06 Proceedings of the First international conference on Data Mining and Bioinformatics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

As SBML is regarded as a de-facto standard to express the biological network data in systems biology, the amount of the SBML documents is exponentially increasing. We propose an SBML data management system (SMS) on top of an object database. Since the object database supports abundant data types like multi-valued attributes and object references, mapping from the SBML documents into the object database is straightforward. We adopt the event-based SAX parser instead of the DOM parser for dealing with huge SBML documents. Note that DOM parser suffers from excessive memory overhead for the document parsing. For high quality data, SMS supports data cleansing function by using gene ontology. Finally, SMS generates user query results in an SBML format (for data exchange) or in a visual graphs (for intuitive understanding). Real experiments show that our approach is superior to the one using conventional relational databases in the aspects of the modeling capability, storage requirements, and data quality.