Self-Tuning Clustering: An Adaptive Clustering Method for Transaction Data
DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
A robust and efficient clustering algorithm based on cohesion self-merging
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
An Efficient Mining and Clustering Algorithm for Interactive Walk-Through Traversal Patterns
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
IEEE Transactions on Knowledge and Data Engineering
TCSOM: Clustering Transactions Using Self-Organizing Map
Neural Processing Letters
k-ANMI: A mutual information based clustering algorithm for categorical data
Information Fusion
Bregman bubble clustering: A robust framework for mining dense clusters
ACM Transactions on Knowledge Discovery from Data (TKDD)
Expert Systems with Applications: An International Journal
A new clustering algorithm for transaction data via caucus
PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Transaction clustering using a seeds based approach
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
An improvement algorithm for accessing patterns through clustering in interactive VRML environments
PCM'04 Proceedings of the 5th Pacific Rim conference on Advances in Multimedia Information Processing - Volume Part III
Rare association rule mining via transaction clustering
AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87
Hi-index | 0.03 |
In this paper, we devise an efficient algorithm for clustering market-basket data items. In view of the nature of clustering market basket data, we devise in this paper a novel measurement, called the small-large (abbreviated as SL) ratio, and utilize this ratio to perform the clustering. With this SL ratio measurement, we develop an efficient clustering algorithm for data items to minimize the SL ratio in each group. The proposed algorithm not only incurs an execution time that is significantly smaller than that by prior work but also leads to the clustering results of very good quality.