Generating a Condensed Representation for Association Rules

  • Authors:
  • Nicolas Pasquier;Rafik Taouil;Yves Bastide;Gerd Stumme;Lotfi Lakhal

  • Affiliations:
  • I3S (CNRS UMR 6070)--Université de Nice-Sophia Antipolis, France 06903;LI--Université Francois Rabelais de Tours, Blois, France 41000;IRISA--INRIA Rennes, Campus Universitaire de Beaulieu, Rennes, France 35042;Fachbereich Mathematik/Informatik, Universität Kassel, Kassel, Germany 34121;LIM (CNRS FRE 2246)--Université de la Méditerranée, Marseille, France 13288

  • Venue:
  • Journal of Intelligent Information Systems
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Association rule extraction from operational datasets often produces several tens of thousands, and even millions, of association rules. Moreover, many of these rules are redundant and thus useless. Using a semantic based on the closure of the Galois connection, we define a condensed representation for association rules. This representation is characterized by frequent closed itemsets and their generators. It contains the non-redundant association rules having minimal antecedent and maximal consequent, called min-max association rules. We think that these rules are the most relevant since they are the most general non-redundant association rules. Furthermore, this representation is a basis, i.e., a generating set for all association rules, their supports and their confidences, and all of them can be retrieved needless accessing the data. We introduce algorithms for extracting this basis and for reconstructing all association rules. Results of experiments carried out on real datasets show the usefulness of this approach. In order to generate this basis when an algorithm for extracting frequent itemsets--such as Apriori for instance--is used, we also present an algorithm for deriving frequent closed itemsets and their generators from frequent itemsets without using the dataset.