Automatic subspace clustering of high dimensional data for data mining applications
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
WaveCluster: A Multi-Resolution Clustering Approach for Very Large Spatial Databases
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Optimal Grid-Clustering: Towards Breaking the Curse of Dimensionality in High-Dimensional Clustering
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
STING: A Statistical Information Grid Approach to Spatial Data Mining
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
Clustering in Dynamic Spatial Databases
Journal of Intelligent Information Systems
Mountain Clustering on Non-Uniform Grids Using P-Trees
Fuzzy Optimization and Decision Making
GCHL: A grid-clustering algorithm for high-dimensional very large spatial data bases
Pattern Recognition Letters
A Shrinking-Based Clustering Approach for Multidimensional Data
IEEE Transactions on Knowledge and Data Engineering
Survey of clustering algorithms
IEEE Transactions on Neural Networks
Hi-index | 0.00 |
In order to solve the problem that traditional grid-based clustering techniques lack of the capability of dealing with data of high dimensionality, we propose an intersecting grid partition method and a density estimation method. The partition method can greatly reduce the number of grid cells generated in high dimensional data space and make the neighbor-searching easily. On basis of the two methods, we propose grid-based clustering algorithm (GCOD), which merges two intersecting grids according to density estimation. The algorithm requires only one parameter and the time complexity is linear to the size of the input data set or data dimension. The experimental results show that GCOD can discover arbitrary shapes of clusters and scale well.