Top-K Correlation Sub-graph Search in Graph Databases

  • Authors:
  • Lei Zou;Lei Chen;Yansheng Lu

  • Affiliations:
  • Huazhong University of Science and Technology, Wuhan, China;Hong Kong of Science and Technology, Hong Kong, China;Huazhong University of Science and Technology, Wuhan, China

  • Venue:
  • DASFAA '09 Proceedings of the 14th International Conference on Database Systems for Advanced Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recently, due to its wide applications, (similar) subgraph search has attracted a lot of attentions from database and data mining community, such as [13,18,19,5]. In [8], Ke et al. first proposed correlation sub-graph search problem (CGSearch for short) to capture the underlying dependency between sub-graphs in a graph database, that is CGS algorithm. However, CGS algorithm requires the specification of a minimum correlation threshold *** to perform computation. In practice, it may not be trivial for users to provide an appropriate threshold *** , since different graph databases typically have different characteristics. Therefore, we propose an alternative mining task: top -K c orrelation sub- g raph search (TOP-CGSearh for short). The new problem itself does not require setting a correlation threshold, which leads the previous proposed CGS algorithm inefficient if we apply it directly to TOP-CGSearch problem. To conduct TOP-CGSearch efficiently, we develop a p attern- g rowth algorithm (that is PG-search algorithm) and utilize graph indexing methods to speed up the mining task. Extensive experiment results evaluate the efficiency of our methods.