Fuzzy clustering of incomplete data based on cluster dispersion

  • Authors:
  • Ludmila Himmelspach;Stefan Conrad

  • Affiliations:
  • Institute of Computer Science, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany;Institute of Computer Science, Heinrich-Heine-Universität Düsseldorf, Düsseldorf, Germany

  • Venue:
  • IPMU'10 Proceedings of the Computational intelligence for knowledge-based systems design, and 13th international conference on Information processing and management of uncertainty
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Clustering algorithms are used to identify groups of similar data objects within large data sets. Since traditional clustering methods were developed to analyse complete data sets, they cannot be applied to many practical problems, e.g. on incomplete data. Approaches proposed for adapting clustering algorithms for dealing with missing values work well on uniformly distributed data sets. But in real world applications clusters are generally differently sized. In this paper we present an extension for existing fuzzy c-means clustering algorithms for incomplete data, which uses the information about the dispersion of clusters. In experiments on artificial and real data sets we show that our approach outperforms other clustering methods for incomplete data.