ORIGAMI: A Novel and Effective Approach for Mining Representative Orthogonal Graph Patterns

Authors:
Vineet Chaoji;Mohammad Al Hasan;Saeed Salem;Jeremy Besson;Mohammed J. Zaki
Affiliations:
Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA;Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA;Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA;INSA-Lyon, LIRIS UMR5205, F-69621 Villeurbanne, France;Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
Venue:
Statistical Analysis and Data Mining
Year:
2008

Citing 0
Cited 7

Output space sampling for graph patterns

Proceedings of the VLDB Endowment
Direct local pattern sampling by efficient two-step random procedures

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Linear space direct pattern sampling using coupling from the past

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Sampling minimal frequent boolean (DNF) patterns

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Approximate graph mining with label costs

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Randomly sampling maximal itemsets

Proceedings of the ACM SIGKDD Workshop on Interactive Data Exploration and Analytics
A polynomial-time maximum common subgraph algorithm for outerplanar graphs and its application to chemoinformatics

Annals of Mathematics and Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we introduce the concept of α-orthogonal patterns to mine a representative set of graph patterns. Intuitively, two graph patterns are α-orthogonal if their similarity is bounded above by α. Each α-orthogonal pattern is also a representative for those patterns that are at least β similar to it. Given user defined α, β ∈ [0, 1], the goal is to mine an α-orthogonal, β-representative set that minimizes the set of unrepresented patterns. We present ORIGAMI, an effective algorithm for mining the set of representative orthogonal patterns. ORIGAMI first uses a randomized algorithm to randomly traverse the pattern space, seeking previously unexplored regions, to return a set of maximal patterns. ORIGAMI then extracts an α-orthogonal, β-representative set from the mined maximal patterns. We show the effectiveness of our algorithm on a number of real and synthetic datasets. In particular, we show that our method is able to extract high-quality patterns even in cases where existing enumerative graph mining methods fail to do so. Copyright © 2008 Wiley Periodicals, Inc., A Wiley Company Statistical Analy Data Mining 1: 000-000, 2008 The first two authors contributed equally for this research.