Efficient Approximate Mining of Frequent Patterns over Transactional Data Streams

  • Authors:
  • Willie Ng;Manoranjan Dash

  • Affiliations:
  • Centre for Advanced Information Systems, Nanyang Technological University, Singapore 639798;Centre for Advanced Information Systems, Nanyang Technological University, Singapore 639798

  • Venue:
  • DaWaK '08 Proceedings of the 10th international conference on Data Warehousing and Knowledge Discovery
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We investigate the problem of finding frequent patterns in a continuous stream of transactions. It is recognized that the approximate solutions are usually sufficient and many existing literature explicitly trade off accuracy for speed where the quality of the final approximate counts are governed by an error parameter, 茂戮驴. However, the quantification of 茂戮驴is never simple. By setting a small 茂戮驴, we achieve good accuracy but suffer in terms of efficiency. A bigger 茂戮驴improves the efficiency but seriously degrades the mining accuracy. To alleviate this problem, we offer an alternative which allows user to customize a set of error bounds based on his requirement. Our experimental studies show that the proposed algorithm has high precision, requires less memory and consumes less CPU time.