Parallel fuzzy c-means cluster analysis

  • Authors:
  • Marta V. Modenesi;Myrian C. A. Costa;Alexandre G. Evsukoff;Nelson F. F. Ebecken

  • Affiliations:
  • COPPE/Federal University of Rio de Janeiro, Rio de Janeiro, RJ, Brazil;COPPE/Federal University of Rio de Janeiro, Rio de Janeiro, RJ, Brazil;COPPE/Federal University of Rio de Janeiro, Rio de Janeiro, RJ, Brazil;COPPE/Federal University of Rio de Janeiro, Rio de Janeiro, RJ, Brazil

  • Venue:
  • VECPAR'06 Proceedings of the 7th international conference on High performance computing for computational science
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This work presents an implementation of a parallel Fuzzy c-means cluster analysis tool, which implements both aspects of cluster investigation: the calculation of clusters' centers with the degrees of membership of records to clusters, and the determination of the optimal number of clusters for the data, by using the PBM validity index to evaluate the quality of the partition. The work's main contributions are the implementation of the entire cluster's analysis process, which is a new approach in literature, integrating to clusters calculation the finding of the best natural pattern present in data, and also, the parallel processing implementation of this tool, which enables this approach to be used with vary large volumes of data, a increasing need for data analysis in nowadays industries and business databases, making the cluster analysis a feasible tool to support specialist's decision in all fields of knowledge. The results presented in the paper show that this approach is scalable and brings processing time reduction as an benefit that parallel processing can bring to the matter of cluster analysis.