Lightweight clustering technique for distributed data mining applications

  • Authors:
  • Lamine M. Aouad;Nhien-An Le-Khac;Tahar M. Kechadi

  • Affiliations:
  • School of Computer Science and Informatics, University College Dublin, Ireland;School of Computer Science and Informatics, University College Dublin, Ireland;School of Computer Science and Informatics, University College Dublin, Ireland

  • Venue:
  • ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Many parallel and distributed clustering algorithms have already been proposed. Most of them are based on the aggregation of local models according to some collected local statistics. In this paper, we propose a lightweight distributed clustering algorithm based on minimum variance increases criterion which requires a very limited communication overhead. We also introduce the notion of distributed perturbation to improve the globally generated clustering. We show that this algorithm improves the quality of the overall clustering and manage to find the real structure and number of clusters of the global dataset.