20 years of pattern mining: a bibliometric survey

  • Authors:
  • Arnaud Giacometti;Dominique H. Li;Patrick Marcel;Arnaud Soulet

  • Affiliations:
  • Université François-Rabelais de Tours, Blois France;Université François-Rabelais de Tours, Blois France;Université François-Rabelais de Tours, Blois France;Université François-Rabelais de Tours, Blois France

  • Venue:
  • ACM SIGKDD Explorations Newsletter
  • Year:
  • 2014

Quantified Score

Hi-index 0.00

Visualization

Abstract

In 1993, Rakesh Agrawal, Tomasz Imielinski and Arun N. Swami published one of the founding papers of Pattern Mining: "Mining Association Rules between Sets of Items in Large Databases". Beyond the introduction to a new problem, it introduced a new methodology in terms of resolution and evaluation. For two decades, Pattern Mining has been one of the most active fields in Knowledge Discovery in Databases. This paper provides a bibliometric survey of the literature relying on 1,087 publications from five major international conferences: KDD, PKDD, PAKDD, ICDM and SDM. We first measured a slowdown of research dedicated to Pattern Mining while the KDD field continues to grow. Then, we quantified the main contributions with respect to languages, constraints and condensed representations to outline the current directions. We observe a sophistication of languages over the last 20 years, although association rules and itemsets are so far the most studied ones. As expected, the minimal support constraint predominates the extraction of patterns with approximately 50% of the publications. Finally, condensed representations used in 10% of the papers had relative success particularly between 2005 and 2008.