Statistical supports for frequent itemsets on data streams

  • Authors:
  • Pierre-Alain Laur;Jean-Emile Symphor;Richard Nock;Pascal Poncelet

  • Affiliations:
  • GRIMAAG-Dépt Scientifique Interfacultaire, Université Antilles-Guyane, Schoelcher Cedex, Martinique, France;GRIMAAG-Dépt Scientifique Interfacultaire, Université Antilles-Guyane, Schoelcher Cedex, Martinique, France;GRIMAAG-Dépt Scientifique Interfacultaire, Université Antilles-Guyane, Schoelcher Cedex, Martinique, France;LG2IP-Ecole des Mines d'Alès, Site EERIE, Nîmes Cedex, France

  • Venue:
  • MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

When we mine information for knowledge on a whole data streams it's necessary to cope with uncertainty as only a part of the stream is available. We introduce a stastistical technique, independant from the used algorithm, for estimating the frequent itemset on a stream. This statistical support allows to maximize either the precision or the recall as choosen by the user, while it doesn't damage the other. Experiments with various association rules databases demonstrate the potential of such technique.