Kernel MDL to Determine the Number of Clusters

  • Authors:
  • Ivan O Kyrgyzov;Olexiy O Kyrgyzov;Henri Maître;Marine Campedel

  • Affiliations:
  • Competence Centre for Information Extraction, and Image Understanding for Earth Observation, GET/Télécom Paris - LTCI, UMR 5141, CNRS, 46, rue Barrault, 75013, Paris, France;Department of Computer Science and Electrical Engineering, OGI School of Science and Engineering, Oregon Health and Science University, 20000 NW Walker Road, Beaverton, OR, 97006, USA;Competence Centre for Information Extraction, and Image Understanding for Earth Observation, GET/Télécom Paris - LTCI, UMR 5141, CNRS, 46, rue Barrault, 75013, Paris, France;Competence Centre for Information Extraction, and Image Understanding for Earth Observation, GET/Télécom Paris - LTCI, UMR 5141, CNRS, 46, rue Barrault, 75013, Paris, France

  • Venue:
  • MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper we propose a new criterion, based on Minimum Description Length (MDL), to estimate an optimal number of clusters. This criterion, called Kernel MDL (KMDL), is particularly adapted to the use of kernel K-means clustering algorithm. Its formulation is based on the definition of MDL derived for Gaussian Mixture Model (GMM). We demonstrate the efficiency of our approach on both synthetic data and real data such as SPOT5 satellite images.