Automatic subspace clustering of high dimensional data for data mining applications
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Constrained K-means Clustering with Background Knowledge
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
STING: A Statistical Information Grid Approach to Spatial Data Mining
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Automatic Subspace Clustering of High Dimensional Data
Data Mining and Knowledge Discovery
GCHL: A grid-clustering algorithm for high-dimensional very large spatial data bases
Pattern Recognition Letters
Design and implementation of a data mining grid-aware architecture
Future Generation Computer Systems - Special section: Data mining in grid computing environments
Hi-index | 0.00 |
Clustering analysis is an important issue in data mining fields. Clustering in high dimensional space is especially difficult for a series of problems, such as the sparseness of spatial distribution of data, too much noise data points. Based on the analysis of current clustering algorithms can not get satisfying clustering results of high dimensional data. The theory of rough set and the idea of semi-supervised are introduced. And a semi-supervised grid clustering algorithm RSGrid based on the reduction of rough set theory is proposed. The theoretical analysis and experimental results indicate the algorithm can solve the problem of clustering in high dimensional space efficiently.