Adaptive Hausdorff distances and dynamic clustering of symbolic interval data

  • Authors:
  • Francisco de A. T. de Carvalho;Renata M. C. R. de Souza;Marie Chavent;Yves Lechevallier

  • Affiliations:
  • Centro de Informática, Universidade Federal de Pernambuco, Caixa Postal 7851, CEP 50732-970 Recife (PE), Brazil;Centro de Informática, Universidade Federal de Pernambuco, Caixa Postal 7851, CEP 50732-970 Recife (PE), Brazil;Mathématiques Appliquées de Bordeaux, UMR 5466 CNRS, Université Bordeaux 1-351, Cours de la liberation, 33405 Talence Cedex, France;INRIA-Institut National de Recherche en Informatique et en Automatique, Domaine de Voluceau-Rocquencourt, B.P. 105, 78153 Le Chesnay Cedex, France

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2006

Quantified Score

Hi-index 0.10

Visualization

Abstract

This paper presents a partitional dynamic clustering method for interval data based on adaptive Hausdorff distances. Dynamic clustering algorithms are iterative two-step relocation algorithms involving the construction of the clusters at each iteration and the identification of a suitable representation or prototype (means, axes, probability laws, groups of elements, etc.) for each cluster by locally optimizing an adequacy criterion that measures the fitting between the clusters and their corresponding representatives. In this paper, each pattern is represented by a vector of intervals. Adaptive Hausdorff distances are the measures used to compare two interval vectors. Adaptive distances at each iteration change for each cluster according to its intra-class structure. The advantage of these adaptive distances is that the clustering algorithm is able to recognize clusters of different shapes and sizes. To evaluate this method, experiments with real and synthetic interval data sets were performed. The evaluation is based on an external cluster validity index (corrected Rand index) in a framework of a Monte Carlo experiment with 100 replications. These experiments showed the usefulness of the proposed method.