A new hybrid method based on partitioning-based DBSCAN and ant clustering

Authors:
Hua Jiang;Jing Li;Shenghe Yi;Xiangyang Wang;Xin Hu
Affiliations:
College of Computer Science, Northeast Normal University, Changchun, Jilin 130117, China;College of Computer Science, Northeast Normal University, Changchun, Jilin 130117, China;College of Computer Science, Northeast Normal University, Changchun, Jilin 130117, China;College of Computer Science, Northeast Normal University, Changchun, Jilin 130117, China;College of Computer Science, Northeast Normal University, Changchun, Jilin 130117, China
Venue:
Expert Systems with Applications: An International Journal
Year:
2011

Citing 13
Cited 0

The dynamics of collective sorting robot-like ants and ant-like robots

Proceedings of the first international conference on simulation of adaptive behavior on From animals to animats
Diversity and adaptation in populations of clustering ants

SAB94 Proceedings of the third international conference on Simulation of adaptive behavior : from animals to animats 3: from animals to animats 3
Data clustering: a review

ACM Computing Surveys (CSUR)
Approaches for scaling DBSCAN algorithm to large spatial databases

Journal of Computer Science and Technology
Vector quantization based on genetic simulated annealing

Signal Processing
Alternatives to the k-means algorithm that find better clusterings

Proceedings of the eleventh international conference on Information and knowledge management
On the performance of ant-based clustering

Design and application of hybrid intelligent systems
ST-DBSCAN: An algorithm for clustering spatial-temporal data

Data & Knowledge Engineering
Data Clustering: Theory, Algorithms, and Applications (ASA-SIAM Series on Statistics and Applied Probability)

Data Clustering: Theory, Algorithms, and Applications (ASA-SIAM Series on Statistics and Applied Probability)
A hybridized approach to data clustering

Expert Systems with Applications: An International Journal
Adaptation of the F-measure to cluster based lexicon quality evaluation

Evalinitiatives '03 Proceedings of the EACL 2003 Workshop on Evaluation Initiatives in Natural Language Processing: are evaluation methods, metrics and resources reusable?
KNN-kernel density-based clustering for high-dimensional multivariate data

Computational Statistics & Data Analysis
Survey of clustering algorithms

IEEE Transactions on Neural Networks

Quantified Score

Hi-index	12.05

Visualization

Abstract

Clustering problem is an unsupervised learning problem. It is a procedure that partition data objects into matching clusters. The data objects in the same cluster are quite similar to each other and dissimilar in the other clusters. Density-based clustering algorithms find clusters based on density of data points in a region. DBSCAN algorithm is one of the density-based clustering algorithms. It can discover clusters with arbitrary shapes and only requires two input parameters. DBSCAN has been proved to be very effective for analyzing large and complex spatial databases. However, DBSCAN needs large volume of memory support and often has difficulties with high-dimensional data and clusters of very different densities. So, partitioning-based DBSCAN algorithm (PDBSCAN) was proposed to solve these problems. But PDBSCAN will get poor result when the density of data is non-uniform. Meanwhile, to some extent, DBSCAN and PDBSCAN are both sensitive to the initial parameters. In this paper, we propose a new hybrid algorithm based on PDBSCAN. We use modified ant clustering algorithm (ACA) and design a new partitioning algorithm based on 'point density' (PD) in data preprocessing phase. We name the new hybrid algorithm PACA-DBSCAN. The performance of PACA-DBSCAN is compared with DBSCAN and PDBSCAN on five data sets. Experimental results indicate the superiority of PACA-DBSCAN algorithm.