Mining coherent dense subgraphs across massive biological networks for functional discovery

Authors:
Haiyan Hu;Xifeng Yan;Yu Huang;Jiawei Han;Xianghong Jasmine Zhou
Affiliations:
Program in Molecular and Computational Biology, University of Southern California Los Angeles, CA 90089, USA;Department of Computer Science, University of Illinois at Urbana-Champaign Urbana, IL 61801, USA;Program in Molecular and Computational Biology, University of Southern California Los Angeles, CA 90089, USA;Department of Computer Science, University of Illinois at Urbana-Champaign Urbana, IL 61801, USA;Program in Molecular and Computational Biology, University of Southern California Los Angeles, CA 90089, USA
Venue:
Bioinformatics
Year:
2005

Citing 0
Cited 38

Coherent closed quasi-clique discovery from large dense graph databases

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Out-of-core coherent closed quasi-clique mining from large dense graph databases

ACM Transactions on Database Systems (TODS)
Dynamical Systems for Discovering Protein Complexes and Functional Modules from Biological Networks

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Recovering temporally rewiring networks: a model-based approach

Proceedings of the 24th international conference on Machine learning
Maximal Biclique Subgraphs and Closed Pattern Pairs of the Adjacency Matrix: A One-to-One Correspondence and Mining Algorithms

IEEE Transactions on Knowledge and Data Engineering
An algorithm for modularization of MAPK and calcium signaling pathways: Comparative analysis among different species

Journal of Biomedical Informatics
A Extraction Method of Overlapping Cluster Based on Network Structure Analysis

WI-IATW '07 Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops
Efficient mining of frequent XML query patterns with repeating-siblings

Information and Software Technology
CSV: visualizing and mining cohesive subgraphs

Proceedings of the 2008 ACM SIGMOD international conference on Management of data
SkyGraph: an algorithm for important subgraph discovery in relational graphs

Data Mining and Knowledge Discovery
Effective Pruning Techniques for Mining Quasi-Cliques

ECML PKDD '08 Proceedings of the European conference on Machine Learning and Knowledge Discovery in Databases - Part II
FOGGER: an algorithm for graph generator discovery

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Multi-way set enumeration in real-valued tensors

Proceedings of the 2nd Workshop on Data Mining using Matrices and Tensors
Mining the Largest Dense Vertexlet in a Weighted Scale-free Graph

Fundamenta Informaticae
Protein Structure Classification Based on Conserved Hydrophobic Residues

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Comparing stars: on approximating graph edit distance

Proceedings of the VLDB Endowment
Deterministic graph-theoretic algorithm for detecting modules in biological interaction networks

International Journal of Bioinformatics Research and Applications
Network legos: building blocks of cellular wiring diagrams

RECOMB'07 Proceedings of the 11th annual international conference on Research in computational molecular biology
Comparison of protein-protein interaction confidence assignment schemes

RECOMB'05 Proceedings of the 2005 joint annual satellite conference on Systems biology and regulatory genomics
An efficient algorithm for enumerating pseudo cliques

ISAAC'07 Proceedings of the 18th international conference on Algorithms and computation
Protein function prediction based on patterns in biological networks

RECOMB'08 Proceedings of the 12th annual international conference on Research in computational molecular biology
An integrative network approach to map the transcriptome to the phenome

RECOMB'08 Proceedings of the 12th annual international conference on Research in computational molecular biology
Developing query patterns

ECDL'09 Proceedings of the 13th European conference on Research and advanced technology for digital libraries
Predicting prognostic markers for glioma using gene co-expression network analysis

Proceedings of the First ACM International Conference on Bioinformatics and Computational Biology
DESSIN: mining dense subgraph patterns in a single graph

SSDBM'10 Proceedings of the 22nd international conference on Scientific and statistical database management
On triangulation-based dense neighborhood graph discovery

Proceedings of the VLDB Endowment
Assessing significance of connectivity and conservation in protein interaction networks

RECOMB'06 Proceedings of the 10th annual international conference on Research in Computational Molecular Biology
Dense Neighborhoods on Affinity Graph

International Journal of Computer Vision
Survey: Graph clustering

Computer Science Review
Discovery of top-k dense subgraphs in dynamic graph collections

SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
mDBN: motif based learning of gene regulatory networks using dynamic bayesian networks

Proceedings of the 15th annual conference on Genetic and evolutionary computation
A Polynomial Time Algorithm for Rayleigh Ratio on Discrete Variables: Replacing Spectral Techniques for Expander Ratio, Normalized Cut, and Cheeger Constant

Operations Research
A multiobjective evolutionary programming framework for graph-based data mining

Information Sciences: an International Journal
A supervised approach to detect protein complex by combining biological and topological properties

International Journal of Data Mining and Bioinformatics
MFMS: maximal frequent module set mining from multiple human gene expression data sets

Proceedings of the 12th International Workshop on Data Mining in Bioinformatics
Three-objective subgraph mining using multiobjective evolutionary programming

Journal of Computer and System Sciences
Campaign extraction from social media

ACM Transactions on Intelligent Systems and Technology (TIST) - Special Section on Intelligent Mobile Knowledge Discovery and Management Systems and Special Issue on Social Web Mining
Modelling and exploring historical records to facilitate service composition

International Journal of Web and Grid Services

Quantified Score

Hi-index	3.84

Visualization

Abstract

Motivation: The rapid accumulation of biological network data translates into an urgent need for computational methods for graph pattern mining. One important problem is to identify recurrent patterns across multiple networks to discover biological modules. However, existing algorithms for frequent pattern mining become very costly in time and space as the pattern sizes and network numbers increase. Currently, no efficient algorithm is available for mining recurrent patterns across large collections of genome-wide networks. Results: We developed a novel algorithm, CODENSE, to efficiently mine frequent coherent dense subgraphs across large numbers of massive graphs. Compared with previous methods, our approach is scalable in the number and size of the input graphs and adjustable in terms of exact or approximate pattern mining. Applying CODENSE to 39 co-expression networks derived from microarray datasets, we discovered a large number of functionally homogeneous clusters and made functional predictions for 169 uncharacterized yeast genes. Availability: http://zhoulab.usc.edu/CODENSE/ Contact: xjzhou@usc.edu