Fast agglomerative clustering using information of k-nearest neighbors

  • Authors:
  • Chih-Tang Chang;Jim Z. C. Lai;M. D. Jeng

  • Affiliations:
  • Department of Electrical Engineering, National Taiwan Ocean University, Keelung, Taiwan 202, ROC;Department of Computer Science and Engineering, National Taiwan Ocean University, Keelung, Taiwan 202, ROC;Department of Electrical Engineering, National Taiwan Ocean University, Keelung, Taiwan 202, ROC

  • Venue:
  • Pattern Recognition
  • Year:
  • 2010

Quantified Score

Hi-index 0.01

Visualization

Abstract

In this paper, we develop a method to lower the computational complexity of pairwise nearest neighbor (PNN) algorithm. Our approach determines a set of candidate clusters being updated after each cluster merge. If the updating process is required for some of these clusters, k-nearest neighbors are found for them. The number of distance calculations for our method is O(N^2), where N is the number of data points. To further reduce the computational complexity of the proposed algorithm, some available fast search approaches are used. Compared to available approaches, our proposed algorithm can reduce the computing time and number of distance calculations significantly. Compared to FPNN, our method can reduce the computing time by a factor of about 26.8 for the data set from a real image. Compared with PMLFPNN, our approach can reduce the computing time by a factor of about 3.8 for the same data set.