Editorial: New fuzzy c-means clustering model based on the data weighted approach

Authors:
Chenglong Tang;Shigang Wang;Wei Xu
Affiliations:
School of Mechanical and Dynamical Engineering of Shanghai Jiao Tong University, No.800 Dong Chuan Road, Minhang District, Shanghai 200240, PR China;School of Mechanical and Dynamical Engineering of Shanghai Jiao Tong University, No.800 Dong Chuan Road, Minhang District, Shanghai 200240, PR China;School of Mechanical and Dynamical Engineering of Shanghai Jiao Tong University, No.800 Dong Chuan Road, Minhang District, Shanghai 200240, PR China
Venue:
Data & Knowledge Engineering
Year:
2010

Citing 17
Cited 2

A Validity Measure for Fuzzy Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
Characterization and detection of noise in clustering

Pattern Recognition Letters
LOF: identifying density-based local outliers

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Hierarchical mesh decomposition using fuzzy clustering and cuts

ACM SIGGRAPH 2003 Papers
Improving fuzzy c-means clustering based on feature-weight learning

Pattern Recognition Letters
Dual Clustering: Integrating Data Clustering over Optimization and Constraint Domains

IEEE Transactions on Knowledge and Data Engineering
Fast Distributed Outlier Detection in Mixed-Attribute Data Sets

Data Mining and Knowledge Discovery
Fast mining of distance-based outliers in high-dimensional datasets

Data Mining and Knowledge Discovery
Developing a feature weight self-adjustment mechanism for a K-means clustering algorithm

Computational Statistics & Data Analysis
Detecting outlier samples in multivariate time series dataset

Knowledge-Based Systems
Graph nodes clustering with the sigmoid commute-time kernel: A comparative study

Data & Knowledge Engineering
TOD: Temporal outlier detection by using quasi-functional temporal dependencies

Data & Knowledge Engineering
On the use of variable-size fuzzy clustering for classification

MDAI'06 Proceedings of the Third international conference on Modeling Decisions for Artificial Intelligence
Comments on “A possibilistic approach to clustering”

IEEE Transactions on Fuzzy Systems
Generalized weighted conditional fuzzy clustering

IEEE Transactions on Fuzzy Systems
A Possibilistic Fuzzy c-Means Clustering Algorithm

IEEE Transactions on Fuzzy Systems
On cluster validity for the fuzzy c-means model

IEEE Transactions on Fuzzy Systems

An architecture for component-based design of representative-based clustering algorithms

Data & Knowledge Engineering
CRUDAW: a novel fuzzy technique for clustering records following user defined attribute weights

AusDM '12 Proceedings of the Tenth Australasian Data Mining Conference - Volume 134

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a new kind of data weighted fuzzy c-means clustering approach. Different from most existing fuzzy clustering approaches, the data weighted clustering approach considers the internal connectivity of all data points. An exponent impact factors vector and an influence exponent are introduced to the new model. Together they influence the clustering process. The data weighted clustering can simultaneously produce three categories of parameters: fuzzy membership degrees, exponent impact factors and the cluster prototypes. A new fuzzy algorithm, DWG-K, is developed by combining the data weighted approach and the G-K. Two groups of numerical experiments were executed. Group 1 demonstrates the clustering performance of the DWG-K. The counterpart is the G-K. The results show the DWG-K can obtain better clustering quality and meanwhile it holds the same level of computational efficiency as the G-K holds. Group 2 checks the ability of the DWG-K in mining the outliers. The counterpart is the well-known LOF. The results show the DWG-K has considerable advantage over the LOF in computational efficiency. And the outliers mined by the DWG-K are global. It was pointed out that the data weighted clustering approach has its unique advantages when mining the outliers of the large scale data sets, when clustering the data set for better clustering results, and especially when these two tasks are done simultaneously.