An efficient algorithm for detecting frequent subgraphs in biological networks

  • Authors:
  • Mehmet Koyutürk;Ananth Grama;Wojciech Szpankowski

  • Affiliations:
  • Department of Computer Sciences, Purdue University, West Lafayette, IN 47907, USA;Department of Computer Sciences, Purdue University, West Lafayette, IN 47907, USA;Department of Computer Sciences, Purdue University, West Lafayette, IN 47907, USA

  • Venue:
  • Bioinformatics
  • Year:
  • 2004

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: With rapidly increasing amount of network and interaction data in molecular biology, the problem of effectively analyzing this data is an important one. Graph theoretic formalisms, commonly used for these analysis tasks, often lead to computationally hard problems due to their relation with subgraph isomorphism. Results: This paper presents an innovative new algorithm for detecting frequently occurring patterns and modules in biological networks. Using an innovative graph simplification technique, which is ideally suited to biological networks, our algorithm renders these problems computationally tractable. Indeed, we show experimentally that our algorithm can extract frequently occurring patterns in metabolic pathways extracted from the KEGG database within seconds. The proposed model and algorithm are applicable to a variety of biological networks either directly or with minor modifications. Availability: Implementation of the proposed algorithms in the C programming language is available as open source at http://www.cs.purdue.edu/homes/koyuturk/pathway/