A clustering algorithm for huge trees

  • Authors:
  • D. Auber;M. Delest

  • Affiliations:
  • LaBRI-Université Bordeaux 1, 351, Cours de la Libération, 33405 Talence, France;LaBRI-Université Bordeaux 1, 351, Cours de la Libération, 33405 Talence, France

  • Venue:
  • Advances in Applied Mathematics
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a new tree clustering algorithm based on combinatorial statistics on trees. Using well-known measures on trees giving the number of leaves of a subtree or the number of siblings of a node, we design a parameter that can be used to detect irregularities in a tree. We obtain a clustering approach for trees by implementing classical statistical tests on this parameter, thus providing well balanced drawings of trees offering better aspect ratios which can be useful when dealing with large, irregular hierarchical data. Our algorithm is linear in time and can thus be applied to large data structures. Moreover the larger the structure is the better the precision of the statistical tools.