A new algorithm for mining frequent connected subgraphs based on adjacency matrices

  • Authors:
  • Andrés Gago-Alonso;Abel Puentes-Luberta;Jesús A. Carrasco-Ochoa;José E. Medina-Pagola;José Fco. Martínez-Trinidad

  • Affiliations:
  • (Correspd. E-mail: agago@cenatav.co.cu) Advanced Technologies Application Center, Havana, Cuba and Computer Science Department, National Institute of Astrophysics, Optics and Electronics, Puebla, ...;Advanced Technologies Application Center, Havana, Cuba and Faculty of Mathematics and Computer Sciences, University of Havana, Havana, Cuba;Computer Science Department, National Institute of Astrophysics, Optics and Electronics, Puebla, México;Advanced Technologies Application Center, Havana, Cuba;Computer Science Department, National Institute of Astrophysics, Optics and Electronics, Puebla, México

  • Venue:
  • Intelligent Data Analysis
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Most of the Frequent Connected Subgraph Mining (FCSM) algorithms have been focused on detecting duplicate candidates using canonical form (CF) tests. CF tests have high computational complexity, which affects the efficiency of graph miners. In this paper, we introduce novel properties of the canonical adjacency matrices for reducing the number of CF tests in FCSM. Based on these properties, a new algorithm for frequent connected subgraph mining called grCAM is proposed. The experiments on real world datasets show the impact of the proposed properties in FCSM. Besides, the performance of our algorithm is compared against some other reported algorithms.