An Instantiation of Hierarchical Distance-Based Conceptual Clustering for Propositional Learning

  • Authors:
  • Ana Funes;Cesar Ferri;Jose Hernández-Orallo;Maria José Ramírez-Quintana

  • Affiliations:
  • DSIC, Universidad Politécnica de Valencia, Valencia, 46022 and Universidad Nacional de San Luis, San Luis, Argentina 5700;DSIC, Universidad Politécnica de Valencia, Valencia, 46022;DSIC, Universidad Politécnica de Valencia, Valencia, 46022;DSIC, Universidad Politécnica de Valencia, Valencia, 46022

  • Venue:
  • PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this work we analyse the relationship between distance and generalisation operators for real numbers, nominal data and tuples in the context of hierarchical distance-based conceptual clustering (HDCC). HDCC is a general approach to conceptual clustering that extends the traditional algorithm for hierarchical clustering by producing conceptual generalisations of the discovered clusters. This makes it possible to combine the flexibility of changing distances for several clustering problems and the advantage of having concepts which are crucial for tasks as summarisation and descriptive data mining in general. In this work we propose a set of generalisation operators and distances for the data types mentioned before and we analyse the properties by them satisfied on the basis of three different levels of agreement between the clustering hierarchy obtained from the linkage distance and the hierarchy obtained by using generalisation operators.