Realistic Synthetic Data for Testing Association Rule Mining Algorithms for Market Basket Databases

  • Authors:
  • Colin Cooper;Michele Zito

  • Affiliations:
  • Department of Computer Science, Kings' College, London WC2R 2LS, UK;Department of Computer Science, University of Liverpool, Liverpool, L69 3BX, UK

  • Venue:
  • PKDD 2007 Proceedings of the 11th European conference on Principles and Practice of Knowledge Discovery in Databases
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We investigate the statistical properties of the databases generated by the IBM QUEST program. Motivated by the claim (also supported empirical evidence) that item occurrences in real life market basket databases follow a rather different pattern, we propose an alternative model for generating artificial data.