Regular Article: A Structured Family of Clustering and Tree Construction Methods

  • Authors:
  • David Bryant;Vincent Berry

  • Affiliations:
  • LIRMM, Montpellier, France and Departments of Mathematics and Computer Science, McGill University, Canadaf1bryant@math.mcgill.caf1;LIRMM, Montpellier, France

  • Venue:
  • Advances in Applied Mathematics
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

A cluster A is an Apresjan cluster if every pair of objects within A is more similar than either is to any object outside A. The criterion is intuitive, compelling, but often too restrictive for applications in classification. We therefore explore extensions of Apresjan clustering to a family of related hierarchical clustering methods. The extensions are shown to be closely connected with the well-known single and average linkage tree constructions. A dual family of methods for classification by splits is also presented. Splits are partitions of the set of objects into two disjoint blocks and are widely used in domains such as phylogenetics. Both the cluster and split methods give rise to progressively refined tree representations. We exploit dualities and connections between the various methods, giving polynomial time construction algorithms for most of the constructions and NP-hardness results for the rest.