Structural similarity mining in semi-structured microarray data for efficient storage construction

  • Authors:
  • Jongil Jeong;Dongil Shin;Chulho Cho;Dongkyoo Shin

  • Affiliations:
  • Department of Computer Science and Engineering, Sejong University, Seoul, Korea;Department of Computer Science and Engineering, Sejong University, Seoul, Korea;College of Business Administration, Kyung Hee University, Seoul, Korea;Department of Computer Science and Engineering, Sejong University, Seoul, Korea

  • Venue:
  • OTM'06 Proceedings of the 2006 international conference on On the Move to Meaningful Internet Systems: AWeSOMe, CAMS, COMINF, IS, KSinBIT, MIOS-CIAO, MONET - Volume Part I
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many researches related to storing XML data have been performed and some of them proposed methods to improve the performance of databases by reducing the joins between tables Those methods are very efficient in deriving and optimizing tables from a DTD or XML schema in which elements and attributes are defined Nevertheless, those methods are not effective in an XML schema for biological information such as microarray data because even though microarray data have complex hierarchies just a few core values of microarray data repeatedly appear in the hierarchies In this paper, we propose a new algorithm to extract core features which is repeatedly occurs in an XML schema for biological information, and elucidate how to improve classification speed and efficiency by using a decision tree rather than pattern matching in classifying structural similarities We designed a database for storing biological information using features extracted by our algorithm By experimentation, we showed that the proposed classification algorithm also reduced the number of joins between tables.