Full duplicate candidate pruning for frequent connected subgraph mining

  • Authors:
  • André/s Gago-Alonso;Jesú/s A. Carrasco-Ochoa;José/ E. Medina-Pagola;José/ Fco. Martí/nez-Trinidad

  • Affiliations:
  • Advanced Technologies Application Center, Havana, Cuba;(Correspd. Tel.: +52 (222) 266 31 00, ext 8311/ Fax: +52 (222) 266 34 52/ E-mail: ariel@inaoep.mx) Computer Science Department, National Institute of Astrophysics, Optics and Electronics, Puebla, ...;Advanced Technologies Application Center, Havana, Cuba;Computer Science Department, National Institute of Astrophysics, Optics and Electronics, Puebla, Mé/xico

  • Venue:
  • Integrated Computer-Aided Engineering
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Support calculation and duplicate detection are the most challenging and unavoidable subtasks in frequent connected subgraph (FCS) mining. The most successful FCS mining algorithms have focused on optimizing these subtasks since the existing solutions for both subtasks have high computational complexity. In this paper, we propose two novel properties that allow removing all duplicate candidates before support calculation. Besides, we introduce a fast support calculation strategy based on embedding structures. Both properties and the new embedding structure are used for designing two new algorithms: gdFil for mining all FCSs; and gdClosed for mining all closed FCSs. The experimental results show that our proposed algorithms get the best performance in comparison with other well known algorithms.