Efficient Detection of Network Motifs
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
A multi-layered approach to protein data integration for diabetes research
Artificial Intelligence in Medicine
IEEE Transactions on Knowledge and Data Engineering
GOSAP: Gene Ontology-Based Semantic Alignment of Biological Pathways
International Journal of Bioinformatics Research and Applications
RAM: Randomized Approximate Graph Mining
SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
GADDI: distance index based subgraph matching in biological networks
Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Frequent subgraph pattern mining on uncertain graph data
Proceedings of the 18th ACM conference on Information and knowledge management
Discovery of frequent graph patterns that consist of the vertices with the complex structures
MCD'07 Proceedings of the 3rd ECML/PKDD international conference on Mining complex data
Protein function prediction based on patterns in biological networks
RECOMB'08 Proceedings of the 12th annual international conference on Research in computational molecular biology
MARGIN: Maximal frequent subgraph mining
ACM Transactions on Knowledge Discovery from Data (TKDD)
Full duplicate candidate pruning for frequent connected subgraph mining
Integrated Computer-Aided Engineering
DESSIN: mining dense subgraph patterns in a single graph
SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
Frequent approximate subgraphs as features for graph-based image classification
Knowledge-Based Systems
SubMAP: aligning metabolic pathways with subnetwork mappings
RECOMB'10 Proceedings of the 14th Annual international conference on Research in Computational Molecular Biology
Pairwise local alignment of protein interaction networks guided by models of evolution
RECOMB'05 Proceedings of the 9th Annual international conference on Research in Computational Molecular Biology
The parameterized complexity of enumerating frequent itemsets
IWPEC'06 Proceedings of the Second international conference on Parameterized and Exact Computation
Biological networks: comparison, conservation, and evolutionary trees
RECOMB'06 Proceedings of the 10th annual international conference on Research in Computational Molecular Biology
Assessing significance of connectivity and conservation in protein interaction networks
RECOMB'06 Proceedings of the 10th annual international conference on Research in Computational Molecular Biology
Analyzing incomplete biological pathways using network motifs
Proceedings of the 27th Annual ACM Symposium on Applied Computing
On commutativity based edge lean search
ICALP'07 Proceedings of the 34th international conference on Automata, Languages and Programming
MFMS: maximal frequent module set mining from multiple human gene expression data sets
Proceedings of the 12th International Workshop on Data Mining in Bioinformatics
Mining spatiotemporal patterns in dynamic plane graphs
Intelligent Data Analysis - Dynamic Networks and Knowledge Discovery
Hi-index | 3.84 |
Motivation: With rapidly increasing amount of network and interaction data in molecular biology, the problem of effectively analyzing this data is an important one. Graph theoretic formalisms, commonly used for these analysis tasks, often lead to computationally hard problems due to their relation with subgraph isomorphism. Results: This paper presents an innovative new algorithm for detecting frequently occurring patterns and modules in biological networks. Using an innovative graph simplification technique, which is ideally suited to biological networks, our algorithm renders these problems computationally tractable. Indeed, we show experimentally that our algorithm can extract frequently occurring patterns in metabolic pathways extracted from the KEGG database within seconds. The proposed model and algorithm are applicable to a variety of biological networks either directly or with minor modifications. Availability: Implementation of the proposed algorithms in the C programming language is available as open source at http://www.cs.purdue.edu/homes/koyuturk/pathway/