Hierarchical Distance-Based Conceptual Clustering

  • Authors:
  • Ana Maria Funes;Cesar Ferri;Jose Hernández-Orallo;Maria Jose Ramírez-Quintana

  • Affiliations:
  • Universidad Nacional de San Luis, San Luis, Argentina 5700 and DSIC, Universidad Politécnica de Valencia, Valencia 46022;DSIC, Universidad Politécnica de Valencia, Valencia 46022;DSIC, Universidad Politécnica de Valencia, Valencia 46022;DSIC, Universidad Politécnica de Valencia, Valencia 46022

  • Venue:
  • ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this work we analyse the relation between hierarchical distance-based clustering and the concepts that can be obtained from the hierarchy by generalisation. Many inconsistencies may arise, because the distance and the conceptual generalisation operator are usually incompatible. To overcome this, we propose an algorithm which integrates distance-based and conceptual clustering. The new dendrograms can show when an element has been integrated to the cluster because it is near in the metric space or because it is covered by the concept. In this way, the new clustering can differ from the original one but the metric traceability is clear. We introduce three different levels of agreement between the clustering hierarchy obtained from the linkage distance and the new hierarchy, and we define properties these generalisation operators should satisfy in order to produce distance-consistent dendrograms.