Automatic Item Weight Generation for Pattern Mining and its Application

  • Authors:
  • Russel Pears;Yun Sing Koh;Gillian Dobbie

  • Affiliations:
  • Auckland University of Technology, New Zealand;The University of Auckland, New Zealand;The University of Auckland, New Zealand

  • Venue:
  • International Journal of Data Warehousing and Mining
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Association rule mining discovers relationships among items in a transactional database. Most approaches assume that all items within a dataset have a uniform distribution with respect to support. However, this is not always the case, and weighted association rule mining WARM was introduced to provide importance to individual items. Previous approaches to the weighted association rule mining problem require users to assign weights to items. In certain cases, it is difficult to provide weights to all items within a dataset. In this paper, the authors propose a method that is based on a novel Valency model that automatically infers item weights based on interactions between items. The authors experiment shows that the weighting scheme results in rules that better capture the natural variation that occurs in a dataset when compared with a miner that does not employ a weighting scheme. The authors applied the model in a real world application to mine text from a given collection of documents. The use of item weighting enabled the authors to attach more importance to terms that are distinctive. The results demonstrate that keyword discrimination via item weighting leads to informative rules.