Data Set Homeomorphism Transformation Based Meta-clustering

Authors:
Xianchao Zhang;Yu Zong;He Jiang;Xinyue Liu
Affiliations:
School of Software, Dalian University of Technology, Dalian 116621, China;School of Software, Dalian University of Technology, Dalian 116621, China;School of Software, Dalian University of Technology, Dalian 116621, China;School of Software, Dalian University of Technology, Dalian 116621, China
Venue:
ICCS '07 Proceedings of the 7th international conference on Computational Science, Part III: ICCS 2007
Year:
2007

Citing 9
Cited 0

Automatic subspace clustering of high dimensional data for data mining applications

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
OPTICS: ordering points to identify the clustering structure

SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
FREM: fast and robust EM clustering for large data sets

Proceedings of the eleventh international conference on Information and knowledge management
BIRCH: A New Data Clustering Algorithm and Its Applications

Data Mining and Knowledge Discovery
Principal Direction Divisive Partitioning

Data Mining and Knowledge Discovery
Chameleon: Hierarchical Clustering Using Dynamic Modeling

Computer
WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Efficient and Effective Clustering Methods for Spatial Data Mining

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
STING: A Statistical Information Grid Approach to Spatial Data Mining

VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases

Quantified Score

Hi-index	0.00

Visualization

Abstract

Clustering analysis is an important data mining technique with a variety of applications. In this paper, the data set is treated in a dynamic way and a Data Set Homeomorphism Transformation Based Meta-Clustering algorithm (DSHTBMC) is proposed. DSHTBMC decomposes the task of clustering into multiple stages. It firstly constructs a series of homeomorphous data sets ranging from high regularity to low, and then iteratively clusters each homeomorphism data set based on the clustering result of the preceding homeomorphism data set. Since data sets of high regularities are easier to be clustered, and the clustering result of each homeomorphism data set can be used to induce high quality clusters in the following-up homeomorphism data set, in this way, the hardness of the problem is decreased. Two strategies (i.e., Displacement and Noising) for data set homeomorphism transformation are proposed, with classical hierarchical divisive method---Bisecting k-means as DSHTBMC's subordinate clustering algorithm, two new clustering algorithms---HD-DSHTBMC-D and HD-DSHTBMC-N are obtained. Experimental results indicate that the new clustering algorithms are remarkably better than Bisecting k-means algorithm in terms of clustering quality.