A data analysis approach for evaluating the behavior of interestingness measures

  • Authors:
  • Xuan-Hiep Huynh;Fabrice Guillet;Henri Briand

  • Affiliations:
  • LINA CNRS 2729, Polytechnic School of Nantes University, Nantes, France;LINA CNRS 2729, Polytechnic School of Nantes University, Nantes, France;LINA CNRS 2729, Polytechnic School of Nantes University, Nantes, France

  • Venue:
  • DS'05 Proceedings of the 8th international conference on Discovery Science
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

In recent years, the problem of finding the different aspects existing in a dataset has attracted many authors in the domain of knowledge quality in KDD. The discovery of knowledge in the form of association rules has become an important research. One of the most difficult issues is that an enormous number of association rules are discovered, so it is not easy to choose the best association rules or knowledge for a given dataset. Some methods are proposed for choosing the best rules with an interestingness measure or matching properties of interestingness measure for a given set of interestingness measures. In this paper, we propose a new approach to discover the clusters of interestingness measures existing in a dataset. Our approach is based on the evaluation of the distance computed between interestingness measures. We use two techniques: agglomerative hierarchical clustering (AHC) and partitioning around medoids (PAM) to help the user graphically evaluates the behavior of interestingness measures.