DB-FSG: An SQL-Based Approach for Frequent Subgraph Mining

  • Authors:
  • Sharma Chakravarthy;Subhesh Pradhan

  • Affiliations:
  • IT Laboratory & Department of Computer Science and Engineering, The University of Texas at Arlington, Arlington, TX 76019;IT Laboratory & Department of Computer Science and Engineering, The University of Texas at Arlington, Arlington, TX 76019

  • Venue:
  • DEXA '08 Proceedings of the 19th international conference on Database and Expert Systems Applications
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Mining frequent subgraphs (FSG) is one form of graph mining for which only main memory algorithms exist currently. There are many applications in social networks, biology, computer networks, chemistry and the World Wide Web that require mining of frequent subgraphs. The focus of this paper is to apply relational database techniques to support frequent subgraph mining. Some of the computations, such as duplicate elimination, canonical labeling, and isomorphism checking are not straightforward using SQL. The contribution of this paper is to efficiently map complex computations to relational operators. Unlike the main memory counter parts of FSG, our approach addresses the most general graph representation including multiple edges between any two vertices, bi-directional edges, and cycles. Experimental evaluation of the proposed approach is also presented in the paper.