Looking for a structural characterization of the sparseness measure of (frequent closed) itemset contexts

  • Authors:
  • T. Hamrouni;S. Ben Yahia;E. Mephu Nguifo

  • Affiliations:
  • LIPAH, Computer Science Department, Faculty of Sciences of Tunis, Tunis El Manar University, University Campus, Tunis, Tunisia;LIPAH, Computer Science Department, Faculty of Sciences of Tunis, Tunis El Manar University, University Campus, Tunis, Tunisia;Clermont Université, Université Blaise Pascal, LIMOS, BP 10448, F-63000 Clermont-Ferrand, France and CNRS, UMR 6158, LIMOS, F-63173 Aubière, France

  • Venue:
  • Information Sciences: an International Journal
  • Year:
  • 2013

Quantified Score

Hi-index 0.07

Visualization

Abstract

It is widely recognized that the performances of frequent-pattern mining algorithms are closely dependent on data being handled, i.e., sparse or dense. The same situation applies to the efficiency of concise representations of frequently occurring patterns with respect to the extraction task and the obtained compactness rates, as well as for other data mining techniques such as clustering, and for the mining algorithms of different pattern classes such as hypergraphs. In this paper, we raise a fundamental question: how can we formally define the sparseness of an arbitrary context and assess its value? As an answer, based on the framework of the succinct system of minimal generators, we present an innovative characterization of context sparseness, as well as a new sparseness measure which results from the aggregation of two complementary measures, namely the succinctness and compactness measures of each equivalence class, induced by the Galois closure operator. Experiments carried out mainly attain a finer classification of benchmark contexts and, then, confirm our viewpoint that the ''dense'' and ''sparse'' labels are not absolute.