Efficient Algorithms for Mining Significant Substructures in Graphs with Quality Guarantees

Authors:
Huahai He;Ambuj K. Singh
Affiliations:
-;-
Venue:
ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Year:
2007

Citing 0
Cited 5

Mining graph patterns efficiently via randomized summaries

Proceedings of the VLDB Endowment
Towards proximity pattern mining in large graphs

Proceedings of the 2010 ACM SIGMOD International Conference on Management of data
Frequent subgraph mining in outerplanar graphs

Data Mining and Knowledge Discovery
On dense pattern mining in graph streams

Proceedings of the VLDB Endowment
All normalized anti-monotonic overlap graph measures are bounded

Data Mining and Knowledge Discovery

Quantified Score

Hi-index	0.00

Visualization

Abstract

Graphs have become popular for modeling scientific data in recent years. As a result, techniques for mining graphs are extremely important for understanding inherent data and domain characteristics. One such exploratory mining paradigm is the k-MST (minimum spanning tree over k vertices) problem that can be used to discover significant local substructures. In this paper, we present an efficient approximation algorithm for the k-MST problem in large graphs. The algorithm has an O (k) approximation ratio and O (n log n + m log m log k + nk2 log k) running time, where n and m are the number of vertices and edges respectively. Experimental results on synthetic graphs and protein interaction networks show that the algorithm is scalable to large graphs and useful for discovering biological pathways. The highlight of the algorithm is that it offers both analytical guarantees and empirical evidence of good running time and quality.