Algorithms for clustering data
Algorithms for clustering data
Recent trends in hierarchic document clustering: a critical review
Information Processing and Management: an International Journal
Scatter/Gather: a cluster-based approach to browsing large document collections
SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
BIRCH: an efficient data clustering method for very large databases
SIGMOD '96 Proceedings of the 1996 ACM SIGMOD international conference on Management of data
Bayesian classification (AutoClass): theory and results
Advances in knowledge discovery and data mining
Database management systems
Syntactic clustering of the Web
Selected papers from the sixth international conference on World Wide Web
Information Retrieval
A Distribution-Based Clustering Algorithm for Mining in Large Spatial Databases
ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
Incremental Clustering for Mining in a Data Warehousing Environment
VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Clustering through decision tree construction
Proceedings of the ninth international conference on Information and knowledge management
Squeezer: an efficient algorithm for clustering categorical data
Journal of Computer Science and Technology
Self-Tuning Clustering: An Adaptive Clustering Method for Transaction Data
DaWaK 2000 Proceedings of the 4th International Conference on Data Warehousing and Knowledge Discovery
Scalable Hierarchical Clustering Method for Sequences of Categorical Values
PAKDD '01 Proceedings of the 5th Pacific-Asia Conference on Knowledge Discovery and Data Mining
Efficient similarity search for market basket data
The VLDB Journal — The International Journal on Very Large Data Bases
CLOPE: a fast and effective clustering algorithm for transactional data
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Clustering Item Data Sets with Association-Taxonomy Similarity
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
Segmenting Customer Transactions Using a Pattern-Based Clustering Approach
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
TopCat: Data Mining for Topic Identification in a Text Corpus
IEEE Transactions on Knowledge and Data Engineering
An Efficient Mining and Clustering Algorithm for Interactive Walk-Through Traversal Patterns
WI '04 Proceedings of the 2004 IEEE/WIC/ACM International Conference on Web Intelligence
Categorical data visualization and clustering using subjective factors
Data & Knowledge Engineering
A model for association rules based on clustering
Proceedings of the 2005 ACM symposium on Applied computing
GHIC: A Hierarchical Pattern-Based Clustering Algorithm for Grouping Web Transactions
IEEE Transactions on Knowledge and Data Engineering
Summarizing itemset patterns: a profile-based approach
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
CLICKS: an effective algorithm for mining subspace clusters in categorical datasets
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
TCSOM: Clustering Transactions Using Self-Organizing Map
Neural Processing Letters
Adherence clustering: an efficient method for mining market-basket clusters
Information Systems
Generating semantic annotations for frequent patterns with context analysis
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Projected clustering for categorical datasets
Pattern Recognition Letters
Efficiently clustering transactional data with weighted coverage density
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Clicks: An effective algorithm for mining subspace clusters in categorical datasets
Data & Knowledge Engineering
Privacy-enhancing personalized web search
Proceedings of the 16th international conference on World Wide Web
Mining association rules using clustering
Intelligent Data Analysis
Semantic annotation of frequent patterns
ACM Transactions on Knowledge Discovery from Data (TKDD)
Top-Down Parameter-Free Clustering of High-Dimensional Categorical Data
IEEE Transactions on Knowledge and Data Engineering
Text document clustering based on frequent word meaning sequences
Data & Knowledge Engineering
k-ANMI: A mutual information based clustering algorithm for categorical data
Information Fusion
Efficient mining of maximal frequent itemsets from databases on a cluster of workstations
Knowledge and Information Systems
Discovering Knowledge from Local Patterns with Global Constraints
ICCSA '08 Proceedings of the international conference on Computational Science and Its Applications, Part II
Identifying Users Stereotypes with Semantic Web Mining
ER '08 Proceedings of the ER 2008 Workshops (CMLSA, ECDM, FP-UML, M2AS, RIGiM, SeCoGIS, WISM) on Advances in Conceptual Modeling: Challenges and Opportunities
Models for association rules based on clustering and correlation
Intelligent Data Analysis
Data Mining and Knowledge Discovery
XML documents clustering based on representative path
ICCOMP'09 Proceedings of the WSEAES 13th international conference on Computers
Adherence clustering: an efficient method for mining market-basket clusters
Information Systems
SCALE: a scalable framework for efficiently clustering transactional data
Data Mining and Knowledge Discovery
A new clustering algorithm for transaction data via caucus
PAKDD'03 Proceedings of the 7th Pacific-Asia conference on Advances in knowledge discovery and data mining
Transaction clustering using a seeds based approach
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
A weighted common structure based clustering technique for XML documents
Journal of Systems and Software
Discovering Knowledge-Sharing Communities in Question-Answering Forums
ACM Transactions on Knowledge Discovery from Data (TKDD)
ICCOMP'10 Proceedings of the 14th WSEAS international conference on Computers: part of the 14th WSEAS CSCC multiconference - Volume I
A practical approach for clustering transaction data
MLDM'11 Proceedings of the 7th international conference on Machine learning and data mining in pattern recognition
A new sequential mining approach to XML document clustering*
APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
XCLS: a fast and effective clustering algorithm for heterogenous XML documents
PAKDD'06 Proceedings of the 10th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining
XML documents clustering by structures
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Clustering and retrieval of XML documents by structure
ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part II
CPCQ: Contrast pattern based clustering quality index for categorical data
Pattern Recognition
Incremental clustering of newsgroup articles
IEA/AIE'06 Proceedings of the 19th international conference on Advances in Applied Artificial Intelligence: industrial, Engineering and Other Applications of Applied Intelligent Systems
DHCC: Divisive hierarchical clustering of categorical data
Data Mining and Knowledge Discovery
An improvement algorithm for accessing patterns through clustering in interactive VRML environments
PCM'04 Proceedings of the 5th Pacific Rim conference on Advances in Multimedia Information Processing - Volume Part III
Term graph model for text classification
ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Clustering categorical data using coverage density
ADMA'05 Proceedings of the First international conference on Advanced Data Mining and Applications
Mining positive and negative association rules from XML query patterns for caching
DASFAA'05 Proceedings of the 10th international conference on Database Systems for Advanced Applications
Clustering of heterogeneously typed data with soft computing - a case study
MICAI'11 Proceedings of the 10th international conference on Artificial Intelligence: advances in Soft Computing - Volume Part II
Privacy preservation by disassociation
Proceedings of the VLDB Endowment
A self-organizing map for transactional data and the related categorical domain
Applied Soft Computing
Rough Set Based Clustering Using Active Learning Approach
International Journal of Artificial Life Research
Hamming Distance based Clustering Algorithm
International Journal of Information Retrieval Research
Rare association rule mining via transaction clustering
AusDM '08 Proceedings of the 7th Australasian Data Mining Conference - Volume 87
Hi-index | 0.00 |
In traditional data clustering, similarity of a cluster of objects is measured by pairwise similarity of objects in that cluster. We argue that such measures are not appropriate for transactions that are sets of items. We propose the notion of large items, i.e., items contained in some minimum fraction of transactions in a cluster, to measure the similarity of a cluster of transactions. The intuition of our clustering criterion is that there should be many large items within a cluster and little overlapping of such items across clusters. We discuss the rationale behind our approach and its implication on providing a better solution to the clustering problem. We present a clustering algorithm based on the new clustering criterion and evaluate its effectiveness.