G-TREACLE: a new grid-based and tree-alike pattern clustering technique for large databases

Authors:
Cheng-Fa Tsai;Chia-Chen Yen
Affiliations:
Department of Management Information Systems, National Pingtung University of Science and Technology, Pingtung, Taiwan;Department of Management Information Systems, National Pingtung University of Science and Technology, Pingtung, Taiwan
Venue:
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Year:
2008

Citing 3
Cited 6

Automatic subspace clustering of high dimensional data for data mining applications

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
ACODF: a novel data clustering approach for data mining in large databases

Journal of Systems and Software - Special issue: Performance modeling and analysis of computer systems and networks
ANGEL: a new effective and efficient hybrid clustering technique for large databases

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining

SDCC: A New Stable Double-Centroid Clustering Technique Based on K-Means for Non-spherical Patterns

ISNN 2009 Proceedings of the 6th International Symposium on Neural Networks: Advances in Neural Networks - Part II
DDCT: detecting density differences using a novel clustering technique

MUSP'09 Proceedings of the 9th WSEAS international conference on Multimedia systems & signal processing
FARM: a new efficient and effective data clustering algorithm

MUSP'09 Proceedings of the 9th WSEAS international conference on Multimedia systems & signal processing
GOD-CS: A New Grid-Oriented Dissection Clustering Scheme for Large Databases

ADMA '09 Proceedings of the 5th International Conference on Advanced Data Mining and Applications
The investigation of discovering potential musical instruments teachers by effective data clustering scheme

WSEAS Transactions on Computers
EIDBSCAN: An Extended Improving DBSCAN algorithm with sampling techniques

International Journal of Business Intelligence and Data Mining

Quantified Score

Hi-index	0.00

Visualization

Abstract

As data mining having attracted a significant amount of research attention, many clustering methods have been proposed in past decades. However, most of those techniques have annoying obstacles in precise pattern recognition. This paper presents a new clustering algorithm termed G-TREACLE, which can fulfill numerous clustering requirements in data mining applications. As a hybrid approach that adopts grid-based concept, the proposed algorithm recognizes the solid framework of clusters and, then, identifies the arbitrary edge of clusters by utilization of a new density-based expansion process, which named "tree-alike pattern". Experimental results illustrate that the new algorithm precisely recognizes the whole cluster, and efficiently reduces the problem of high computational time. It also indicates that the proposed new clustering algorithm performs better than several existing well-known approaches such as the K-means, DBSCAN, CLIQUE and GDH algorithms, while produces much smaller errors than the K-means, DBSCAN, CLIQUE and GDH approaches in most the cases examined herein