RAM: Randomized Approximate Graph Mining

  • Authors:
  • Shijie Zhang;Jiong Yang

  • Affiliations:
  • EECS Department, Case Western Reserve Univ., Cleveland, USA OH 44106;EECS Department, Case Western Reserve Univ., Cleveland, USA OH 44106

  • Venue:
  • SSDBM '08 Proceedings of the 20th international conference on Scientific and Statistical Database Management
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a definition for frequent approximate patterns in order to model important subgraphs in a graph database with incomplete or inaccurate information. By our definition, frequent approximate patterns possess three main properties: possible absence of exact match, maximal representation, and the Apriori Property. Since approximation increases the number of frequent patterns, we present a novel randomized algorithm (called RAM) using feature retrieval. A large number of real and synthetic data sets are used to demonstrate the effectiveness and efficiency of the frequent approximate graph pattern model and the RAM algorithm.