MINI: Mining Informative Non-redundant Itemsets

  • Authors:
  • Arianna Gallo;Tijl Bie;Nello Cristianini

  • Affiliations:
  • University of Bristol, Department of Engineering Mathematics, UK;University of Bristol, Department of Engineering Mathematics, UK;University of Bristol, Department of Engineering Mathematics, UK and University of Bristol, Department of Computer Science, UK

  • Venue:
  • PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Frequent itemset mining assists the data mining practitioner in searching for strongly associated items (and transactions) in large transaction databases. Since the number of frequent itemsets is usually extremely large and unmanageable for a human user, recent works have sought to define condensed representations of them, e.g. closedor maximalfrequent itemsets. We argue that not only these methods often still fall short in sufficiently reducing of the output size, but they also output many redundant itemsets. In this paper we propose a philosophically new approach that resolves both these issues in a computationally tractable way. We present and empirically validate a statistically founded approach called MINI, to compress the set of frequent itemsets down to a list of informative and non-redundant itemsets.