Relational subgroup discovery for descriptive analysis of microarray data

  • Authors:
  • Igor Trajkovski;Filip Železný;Jakub Tolar;Nada Lavrač

  • Affiliations:
  • Department of Knowledge Technologies, Jozef Stefan Institute, Ljubljana, Slovenia;Department of Cybernetics, Czech Technical University in Prague, Praha 6, Czech Republic;Department of Pediatrics, University of Minnesota Medical School, Minneapolis;Department of Knowledge Technologies, Jozef Stefan Institute, Ljubljana, Slovenia

  • Venue:
  • CompLife'06 Proceedings of the Second international conference on Computational Life Sciences
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a method that uses gene ontologies, together with the paradigm of relational subgroup discovery, to help find description of groups of genes differentially expressed in specific cancers. The descriptions are represented by means of relational features, extracted from gene ontology information, and are straightforwardly interpretable by the medical experts. We applied the proposed method to two known data sets: acute lymphoblastic leukemia (ALL) vs. acute myeloid leukemia and classification of fourteen types of cancer. Significant number of discovered groups of genes had a description, confirmed by the medical expert, which highlighted the underlying biological process that is responsible for distinguishing one class from the other classes. We view our methodology not just as a prototypical example of applying sophisticated machine learning algorithms to microarray data, but also as a motivation for developing more sophisticated functional annotations and ontologies, that can be processed by such learning algorithms.