The new k-windows algorithm for improving the k-means clustering algorithm

  • Authors:
  • M. N. Vrahatis;B. Boutsinas;P. Alevizos;G. Pavlides

  • Affiliations:
  • Department of Mathematics, University of Patras (UOP), University of Patras Artificial Intelligence Research Center (UPAIRC), GR-26500 Patras, Greece;Department of Business Administration, UOP, UPAIRC, GR-26500 Patras, Greece;Department of Mathematics, UOP, UPAIRC, GR-26500 Patras, Greece;Department of Computer Engineering and Inf., UOP, UPAIRC, GR-26500 Patras, Greece

  • Venue:
  • Journal of Complexity
  • Year:
  • 2002

Quantified Score

Hi-index 0.01

Visualization

Abstract

The process of partitioning a large set of patterns into disjoint and homogeneous clusters is fundamental in knowledge acquisition. It is called Clustering in the literature and it is applied in various fields including data mining, statistical data analysis, compression and vector quantization. The k-means is a very popular algorithm and one of the best for implementing the clustering process. The k-means has a time complexity that is dominated by the product of the number of patterns, the number of clusters, and the number of iterations. Also, it often converges to a local minimum. In this paper, we present an improvement of the k-means clustering algorithm, aiming at a better time complexity and partitioning accuracy. Our approach reduces the number of patterns that need to be examined for similarity, in each iteration, using a windowing technique. The latter is based on well known spatial data structures, namely the range tree, that allows fast range searches.