Kernel MDL to Determine the Number of Clusters

Authors:
Ivan O Kyrgyzov;Olexiy O Kyrgyzov;Henri Maître;Marine Campedel
Affiliations:
Competence Centre for Information Extraction, and Image Understanding for Earth Observation, GET/Télécom Paris - LTCI, UMR 5141, CNRS, 46, rue Barrault, 75013, Paris, France;Department of Computer Science and Electrical Engineering, OGI School of Science and Engineering, Oregon Health and Science University, 20000 NW Walker Road, Beaverton, OR, 97006, USA;Competence Centre for Information Extraction, and Image Understanding for Earth Observation, GET/Télécom Paris - LTCI, UMR 5141, CNRS, 46, rue Barrault, 75013, Paris, France;Competence Centre for Information Extraction, and Image Understanding for Earth Observation, GET/Télécom Paris - LTCI, UMR 5141, CNRS, 46, rue Barrault, 75013, Paris, France
Venue:
MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
Year:
2007

Citing 7
Cited 2

Algorithms for clustering data

Algorithms for clustering data
Unsupervised Learning of Finite Mixture Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond

Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond
Choosing Multiple Parameters for Support Vector Machines

Machine Learning
Information Theory, Inference & Learning Algorithms

Information Theory, Inference & Learning Algorithms
Kernel Methods for Pattern Analysis

Kernel Methods for Pattern Analysis
The minimum description length principle in coding and modeling

IEEE Transactions on Information Theory

A novel method for image retrieval using relevance feedback and unsupervised clustering

COMPUTE '11 Proceedings of the Fourth Annual ACM Bangalore Conference
Cluster validity measures based on the minimum description length principle

KES'11 Proceedings of the 15th international conference on Knowledge-based and intelligent information and engineering systems - Volume Part I

Quantified Score

Hi-index	0.01

Visualization

Abstract

In this paper we propose a new criterion, based on Minimum Description Length (MDL), to estimate an optimal number of clusters. This criterion, called Kernel MDL (KMDL), is particularly adapted to the use of kernel K-means clustering algorithm. Its formulation is based on the definition of MDL derived for Gaussian Mixture Model (GMM). We demonstrate the efficiency of our approach on both synthetic data and real data such as SPOT5 satellite images.