Parallel Fuzzy c-Means Clustering for Large Data Sets

Authors:
Terence Kwok;Kate A. Smith;Sebastián Lozano;David Taniar
Affiliations:
-;-;-;-
Venue:
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Year:
2002

Citing 9
Cited 15

Using MPI (2nd ed.): portable parallel programming with the message-passing interface

Using MPI (2nd ed.): portable parallel programming with the message-passing interface
Data clustering: a review

ACM Computing Surveys (CSUR)
Pattern Recognition with Fuzzy Objective Function Algorithms

Pattern Recognition with Fuzzy Objective Function Algorithms
Mining Very Large Databases

Computer
Parallel k/h-Means Clustering for Large Data Sets

Euro-Par '99 Proceedings of the 5th International Euro-Par Conference on Parallel Processing
A Data-Clustering Algorithm on Distributed Memory Multiprocessors

Revised Papers from Large-Scale Parallel Data Mining, Workshop on Large-Scale Parallel KDD Systems, SIGKDD
Large-Scale Parallel Data Clustering

ICPR '96 Proceedings of the International Conference on Pattern Recognition (ICPR '96) Volume IV-Volume 7472 - Volume 7472
A Scalable Parallel Subspace Clustering Algorithm for Massive Data Sets

ICPP '00 Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters

IEEE Transactions on Computers

Some studies on fuzzy clustering of psychosis data

International Journal of Business Intelligence and Data Mining
A family of Extended Fuzzy Description Logics

International Journal of Business Intelligence and Data Mining
Skin Pores Detection for Image-Based Skin Analysis

IDEAL '08 Proceedings of the 9th International Conference on Intelligent Data Engineering and Automated Learning
Examining the potential parallel scalability of a fuzzy semi-supervised classification algorithm

SpringSim '09 Proceedings of the 2009 Spring Simulation Multiconference
RFID-based human behavior modeling and anomaly detection for elderly care

Mobile Information Systems
RFID-based human behavior modeling and anomaly detection for elderly care

Mobile Information Systems
An SMP soft classification algorithm for remote sensing

Proceedings of the 19th High Performance Computing Symposia
Skin surface reconstruction from stereo images

Proceedings of the 4th International Conference on Uniquitous Information Management and Communication
ASCCN: Arbitrary Shaped Clustering Method with Compatible Nucleoids

International Journal of Data Warehousing and Mining
Finding Associations in Composite Data Sets: The CFARM Algorithm

International Journal of Data Warehousing and Mining
Weak Ratio Rules: A Generalized Boolean Association Rules

International Journal of Data Warehousing and Mining
Data Field for Hierarchical Clustering

International Journal of Data Warehousing and Mining
FAR-miner: a fast and efficient algorithm for fuzzy association rule mining

International Journal of Business Intelligence and Data Mining
Weighted Fuzzy-Possibilistic C-Means Over Large Data Sets

International Journal of Data Warehousing and Mining
Document clustering based on web search hit counts

International Journal of Business Intelligence and Data Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

The parallel fuzzy c-means (PFCM) algorithm for clustering large data sets is proposed in this paper. The proposed algorithm is designed to run on parallel computers of the Single Program Multiple Data (SPMD) model type with the Message Passing Interface (MPI). A comparison is made between PFCM and an existing parallel k-means (PKM) algorithm in terms of their parallelisation capability and scalability. In an implementation of PFCM to cluster a large data set from an insurance company, the proposed algorithm is demonstrated to have almost ideal speedups as well as an excellent scaleup with respect to the size of the data sets.