Distributed clustering based on sampling local density estimates

  • Authors:
  • Matthias Klusch;Stefano Lodi;Gianluca Moro

  • Affiliations:
  • Deduction and Multiagent Systems, German Research Centre for Artificial Intelligence, Saarbruecken, Germany;Department of Electronics, Computer Science and Systems, University of Bologna, Bologna, BO, Italy;Department of Electronics, Computer Science and Systems, University of Bologna, Cesena, FC, Italy

  • Venue:
  • IJCAI'03 Proceedings of the 18th international joint conference on Artificial intelligence
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

Huge amounts of data are stored in autonomous, geographically distributed sources. The discovery of previously unknown, implicit and valuable knowledge is a key aspect of the exploitation of such sources. In recent years several approaches to knowledge discovery and data mining, and in particular to clustering, have been developed, but only a few of them are designed for distributed data sources. We propose a novel distributed clustering algorithm based on non-parametric kernel density estimation, which takes into account the issues of privacy and communication costs that arise in a distributed environment.