Progressive Clustering for Database Distribution on a Grid

  • Authors:
  • Valerie FIOLET;Bernard TOURSEL

  • Affiliations:
  • University of Mons-Hainault, Belgium;(UMR CNRS 8022) University of Lille 1, Villeneuve CEDEX - FRANCE

  • Venue:
  • ISPDC '05 Proceedings of the The 4th International Symposium on Parallel and Distributed Computing
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The increasing availability of clusters and grids of workstations provides cheap and powerful ressources for distributed datamining. To exploit these ressources we need new algorithms adapted to this kind of environment, in particular with respect to the way to fragment data and to use this fragmentation. An "intelligent" distribution of data is required and can be obtained from clustering. Most existing parallel methods of clustering are developped for supercomputers with shared memory and hence can not be used on a Grid. This paper presents a new clustering algorithm, called Progressive Clustering, which executes a clustering in an efficient and incremental distributed way. The data clusters resulting from this algorithm can subsequently be used in distributed data mining tasks.