Interpretable clustering using unsupervised binary trees

  • Authors:
  • Ricardo Fraiman;Badih Ghattas;Marcela Svarc

  • Affiliations:
  • Universidad de San Andrés and Universidad de la República, Buenos Aires, Argentina;Département de Mathématiques, Université de la Méditerrannée Faculté des Sciences de Luminy, Marseille Cedex 09, France 13288;Universidad de San Andrés and Conicet, Buenos Aires, Argentina

  • Venue:
  • Advances in Data Analysis and Classification
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We herein introduce a new method of interpretable clustering that uses unsupervised binary trees. It is a three-stage procedure, the first stage of which entails a series of recursive binary splits to reduce the heterogeneity of the data within the new subsamples. During the second stage (pruning), consideration is given to whether adjacent nodes can be aggregated. Finally, during the third stage (joining), similar clusters are joined together, even if they do not share the same parent originally. Consistency results are obtained, and the procedure is used on simulated and real data sets.