A NON-PARAMETRIC APPROACH TO SIMPLICITY CLUSTERING

  • Authors:
  • Peter Hines;Emmanuel M. Pothos;Nick Chater

  • Affiliations:
  • Department of Computer Science, University of York, York, U.K.;Department of Psychology, University of Wales, Swansea, U.K.;Department of Psychology, University College London, London, U.K.

  • Venue:
  • Applied Artificial Intelligence
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

The simplicity principle-an updating of Ockham's razor to take into account modern information theory-states that the preferred theory for a set of data is the one that allows for the most efficient encoding of the data. We consider this in the context of classification, or clustering, as a data reduction technique that helps describe a set of objects by dividing the objects into groups. The simplicity model we present favors clusters such that the similarity of the items in the clusters is maximal, while the similarity of items between clusters is minimal. Several novel features of our clustering criterion make it especially appropriate for clustering of data derived from, psychological procedures (e.g., similarity ratings): It is non-parametric, and may be applied in situations where the metric axioms are violated without requiring (information-forgetting) transformation procedures. We illustrate the use of the criterion with a selection of data sets. A distinctive aspect of this research is that it motivates a clustering algorithm from psychological principles.