On the estimation of frequent itemsets for data streams: theory and experiments

  • Authors:
  • Pierre-Alain Laur;Richard Nock;Jean-Emile Symphor;Pascal Poncelet

  • Affiliations:
  • U. Antilles-Guyane, Martinique, France;U. Antilles-Guyane, Martinique, France;U. Antilles-Guyane, Martinique, France;Lg2ip-Mines d'Alès, France

  • Venue:
  • Proceedings of the 14th ACM international conference on Information and knowledge management
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we devise a method for the estimation of the true support of itemsets on data streams, with the objective to maximize one chosen criterion among {precision, recall} while ensuring a degradation as reduced as possible for the other criterion. We discuss the strengths, weaknesses and range of applicability of this method that relies on conventional uniform convergence results, yet guarantees statistical optimality from different standpoints.