Clustering transactions using large items
Proceedings of the eighth international conference on Information and knowledge management
ROCK: a robust clustering algorithm for categorical attributes
Information Systems
COOLCAT: an entropy-based algorithm for categorical clustering
Proceedings of the eleventh international conference on Information and knowledge management
Interactive Clustering for Transaction Data
DaWaK '01 Proceedings of the Third International Conference on Data Warehousing and Knowledge Discovery
CLOPE: a fast and effective clustering algorithm for transactional data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Characterizing Web User Accesses: A Transactional Approach to Web Log Clustering
ITCC '02 Proceedings of the International Conference on Information Technology: Coding and Computing
Towards parameter-free data mining
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Subspace clustering for high dimensional categorical data
ACM SIGKDD Explorations Newsletter
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
GHIC: A Hierarchical Pattern-Based Clustering Algorithm for Grouping Web Transactions
IEEE Transactions on Knowledge and Data Engineering
A general model for clustering binary data
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Introduction to Data Mining, (First Edition)
Introduction to Data Mining, (First Edition)
Efficiently clustering transactional data with weighted coverage density
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Clicks: An effective algorithm for mining subspace clusters in categorical datasets
Data & Knowledge Engineering
Top-Down Parameter-Free Clustering of High-Dimensional Categorical Data
IEEE Transactions on Knowledge and Data Engineering
Mining Projected Clusters in High-Dimensional Spaces
IEEE Transactions on Knowledge and Data Engineering
Adapting the right measures for K-means clustering
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Adherence clustering: an efficient method for mining market-basket clusters
Information Systems
Hi-index | 0.00 |
In this paper we consider the problem of clustering transaction data. Most of existing transactional clustering algorithms encounter difficulties in the presence of overlapping clusters with a large number outlier items that do not contribute to formation of clusters. Furthermore, the vast majority of existing approaches are dependent on multiple parameters which may be difficult to tune, especially in real-life applications. To these problems, we propose a parameter-free transactional clustering algorithm. Our algorithm first scans the data set in a sequential manner such that the destination of the next transaction is guided by a novel objective function. Once the first scan of the data set is completed, the algorithm performs a few other passes over the data set in order to refine the clustering. The proposed algorithm is able to automatically identify clusters in the presence of large number of outlier items in the data set without any parameters setting by the user. The suitability of our proposal has been demonstrated through an empirical study using synthetic and real data sets.