Two-level k-means clustering algorithm for k-τ relationship establishment and linear-time classification

  • Authors:
  • Radha Chitta;M. Narasimha Murty

  • Affiliations:
  • Department of Electrical Engineering, Indian Institute of Science, Bangalore 560012, India;Department of Computer Science and Automation, Indian Institute of Science, Bangalore 560012, India

  • Venue:
  • Pattern Recognition
  • Year:
  • 2010

Quantified Score

Hi-index 0.02

Visualization

Abstract

Partitional clustering algorithms, which partition the dataset into a pre-defined number of clusters, can be broadly classified into two types: algorithms which explicitly take the number of clusters as input and algorithms that take the expected size of a cluster as input. In this paper, we propose a variant of the k-means algorithm and prove that it is more efficient than standard k-means algorithms. An important contribution of this paper is the establishment of a relation between the number of clusters and the size of the clusters in a dataset through the analysis of our algorithm. We also demonstrate that the integration of this algorithm as a pre-processing step in classification algorithms reduces their running-time complexity.