Fast Graph Pattern Matching

  • Authors:
  • Jiefeng Cheng;Jeffrey Xu Yu;Bolin Ding;Philip S. Yu;Haixun Wang

  • Affiliations:
  • The Chinese University of Hong Kong, China. jfcheng@se.cuhk.edu.hk;The Chinese University of Hong Kong, China. yu@se.cuhk.edu.hk;The Chinese University of Hong Kong, China. blding@se.cuhk.edu.hk;University of Illinois at Chicago, USA. psyu@cs.uic.edu;T. J. Watson Research Center, IBM, USA. haixun@us.ibm.com

  • Venue:
  • ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Due to rapid growth of the Internet technology and new scientific/technological advances, the number of applications that model data as graphs increases, because graphs have high expressive power to model complicated structures. The dominance of graphs in real-world applications asks for new graph data management so that users can access graph data effectively and efficiently. In this paper, we study a graph pattern matching problem over a large data graph. The problem is to find all patterns in a large data graph that match a user-given graph pattern. We propose a new two-step R-join (reachability join) algorithm with filter step and fetch step based on a cluster-based join-index with graph codes. We consider the filter step as an R-semijoin, and propose a new optimization approach by interleaving R-joins with R-semijoins. We conducted extensive performance studies, and confirm the efficiency of our proposed new approaches.