Application of Graph-based Data Mining to Metabolic Pathways

  • Authors:
  • Chang Hun You;Lawrence B. Holder;Diane J. Cook

  • Affiliations:
  • Washington State University;Washington State University;Washington State University

  • Venue:
  • ICDMW '06 Proceedings of the Sixth IEEE International Conference on Data Mining - Workshops
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a method for finding biologically meaningful patterns on metabolic pathways using the SUBDUE graph-based relational learning system. A huge amount of biological data that has been generated by long-term research encourages us to move our focus to a systems-level understanding of bio-systems. A biological network, containing various biomolecules and their relationships, is a fundamental way to describe bio-systems. Multi-relational data mining finds the relational patterns in both the entity attributes and relations in the data. A graph consisting of vertices and edges between these vertices is a natural data structure to represent biological networks. This paper presents a graph representation of metabolic pathways to contain all features, and describes the application of graph-based relational learning algorithms in both supervised and unsupervised scenarios. Supervised learning finds the unique substructures in a specific type of pathway, which help us understand better how pathways differ. Unsupervised learning shows hierarchical clusters that describe the common substructures in a specific type of pathway, which allow us to better understand the common features in pathways.