Proceedings of the ACM first Ph.D. workshop in CIKM
Rough-DBSCAN: A fast hybrid density based clustering method for large data sets
Pattern Recognition Letters
Weighted k-nearest leader classifier for large data sets
PReMI'07 Proceedings of the 2nd international conference on Pattern recognition and machine intelligence
APSCAN: A parameter free algorithm for clustering
Pattern Recognition Letters
Behavioural Proximity Discovery: an adaptive approach for root cause analysis
International Journal of Business Intelligence and Data Mining
Hi-index | 0.00 |
Density based clustering techniques like DBSCAN can find arbitrary shaped clusters along with noisy outliers. A severe drawback of the method is its huge time requirement which makes it a unsuitable one for large data sets. One solution is to apply DBSCAN using only a few selected prototypes. But because of this the clustering result can deviate from that which uses the full data set. A novel method proposed in the paper is to use two types of prototypes, one at a coarser level meant to reduce the time requirement, and the other at a finer level meant to reduce the deviation of the result. Prototypes are derived using leaders clustering method. The proposed hybrid clustering method called l-DBSCAN is analyzed and experimentally compared with DBSCAN which shows that it could be a suitable one for large data sets.