Approximating a similarity matrix by a latent class model: A reappraisal of additive fuzzy clustering

  • Authors:
  • Cajo J. F. ter Braak;Yiannis Kourmpetis;Henk A. L. Kiers;Marco C. A. M. Bink

  • Affiliations:
  • Biometris, Wageningen University and Research Centre, The Netherlands;Biometris, Wageningen University and Research Centre, The Netherlands;Heymans Institute of Psychology, University of Groningen, The Netherlands;Biometris, Wageningen University and Research Centre, The Netherlands

  • Venue:
  • Computational Statistics & Data Analysis
  • Year:
  • 2009

Quantified Score

Hi-index 0.03

Visualization

Abstract

Let Q be a given nxn square symmetric matrix of nonnegative elements between 0 and 1, e.g. similarities. Fuzzy clustering results in fuzzy assignment of individuals to K clusters. In additive fuzzy clustering, the nxK fuzzy memberships matrix P is found by least-squares approximation of the off-diagonal elements of Q by inner products of rows of P. By contrast, kernelized fuzzy c-means is not least-squares and requires an additional fuzziness parameter. The aim is to popularize additive fuzzy clustering by interpreting it as a latent class model, whereby the elements of Q are modeled as the probability that two individuals share the same class on the basis of the assignment probability matrix P. Two new algorithms are provided, a brute force genetic algorithm (differential evolution) and an iterative row-wise quadratic programming algorithm of which the latter is the more effective. Simulations showed that (1) the method usually has a unique solution, except in special cases, (2) both algorithms reached this solution from random restarts and (3) the number of clusters can be well estimated by AIC. Additive fuzzy clustering is computationally efficient and combines attractive features of both the vector model and the cluster model.