ORIGAMI: A Novel and Effective Approach for Mining Representative Orthogonal Graph Patterns

  • Authors:
  • Vineet Chaoji;Mohammad Al Hasan;Saeed Salem;Jeremy Besson;Mohammed J. Zaki

  • Affiliations:
  • Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA;Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA;Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA;INSA-Lyon, LIRIS UMR5205, F-69621 Villeurbanne, France;Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA

  • Venue:
  • Statistical Analysis and Data Mining
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we introduce the concept of α-orthogonal patterns to mine a representative set of graph patterns. Intuitively, two graph patterns are α-orthogonal if their similarity is bounded above by α. Each α-orthogonal pattern is also a representative for those patterns that are at least β similar to it. Given user defined α, β ∈ [0, 1], the goal is to mine an α-orthogonal, β-representative set that minimizes the set of unrepresented patterns. We present ORIGAMI, an effective algorithm for mining the set of representative orthogonal patterns. ORIGAMI first uses a randomized algorithm to randomly traverse the pattern space, seeking previously unexplored regions, to return a set of maximal patterns. ORIGAMI then extracts an α-orthogonal, β-representative set from the mined maximal patterns. We show the effectiveness of our algorithm on a number of real and synthetic datasets. In particular, we show that our method is able to extract high-quality patterns even in cases where existing enumerative graph mining methods fail to do so. Copyright © 2008 Wiley Periodicals, Inc., A Wiley Company Statistical Analy Data Mining 1: 000-000, 2008 The first two authors contributed equally for this research.