Graph-based data mining for biological applications

Authors:
Leander Schietgat
Affiliations:
Department of Computer Science, Katholieke Universiteit Leuven, Celestijnenlaan 200A, 3001 Leuven, Belgium. E-mail: leander.schietgat@cs.kuleuven.be
Venue:
AI Communications
Year:
2011

Citing 3
Cited 0

Decision trees for hierarchical multi-label classification

Machine Learning
An Efficiently Computable Graph-Based Metric for the Classification of Small Molecules

DS '08 Proceedings of the 11th International Conference on Discovery Science
Effective feature construction by maximum common subgraph sampling

Machine Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

In many real-world problems, one deals with input or output data that are structured. This thesis investigates the use of graphs as a representation for structured data and introduces relational learning techniques that can efficiently process them. We apply the techniques to two biological problems. On the one hand, we use decision trees to predict the functions of genes, of which the hierarchical relationships can be structured as a graph. On the other hand, we predict chemical activity of molecules by representing them as graphs. We show that, by exploiting graph properties, efficient learning techniques can be developed. It turns out that in both cases, the relational models are not only learned more efficiently, but their predictive performance significantly improves as well.