Mining association rules between sets of items in large databases
SIGMOD '93 Proceedings of the 1993 ACM SIGMOD international conference on Management of data
Inductive learning algorithms and representations for text categorization
Proceedings of the seventh international conference on Information and knowledge management
Learning to classify text from labeled and unlabeled documents
AAAI '98/IAAI '98 Proceedings of the fifteenth national/tenth conference on Artificial intelligence/Innovative applications of artificial intelligence
Pruning and summarizing the discovered associations
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Probabilistic latent semantic indexing
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Scalable association-based text classification
Proceedings of the ninth international conference on Information and knowledge management
IEEE Transactions on Pattern Analysis and Machine Intelligence
ACM SIGIR Forum
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Using conjunction of attribute values for classification
Proceedings of the eleventh international conference on Information and knowledge management
High-performing feature selection for text classification
Proceedings of the eleventh international conference on Information and knowledge management
Scoring the Data Using Association Rules
Applied Intelligence
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
CMAR: Accurate and Efficient Classification Based on Multiple Class-Association Rules
ICDM '01 Proceedings of the 2001 IEEE International Conference on Data Mining
Building Hierarchical Classifiers Using Class Proximity
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Text Document Categorization by Term Association
ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
The Journal of Machine Learning Research
An extensive empirical study of feature selection metrics for text classification
The Journal of Machine Learning Research
Feature Extraction Based on ICA for Binary Classification Problems
IEEE Transactions on Knowledge and Data Engineering
Mining Strong Affinity Association Patterns in Data Sets with Skewed Support Distribution
ICDM '03 Proceedings of the Third IEEE International Conference on Data Mining
ICML '04 Proceedings of the twenty-first international conference on Machine learning
SAT-MOD: moderate itemset fittest for text classification
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
OCFS: optimal orthogonal centroid feature selection for text categorization
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Data Mining and Knowledge Discovery
2-PS based associative text classification
DaWaK'05 Proceedings of the 7th international conference on Data Warehousing and Knowledge Discovery
Class normalization in centroid-based text categorization
Information Sciences: an International Journal
Top-down mining of frequent closed patterns from very high dimensional data
Information Sciences: an International Journal
Automatic text categorization based on content analysis with cognitive situation models
Information Sciences: an International Journal
Mining correlated subgraphs in graph databases
PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Abstracting for Dimensionality Reduction in Text Classification
International Journal of Intelligent Systems
Hi-index | 0.07 |
The use of association patterns for text categorization has attracted great interest and a variety of useful methods have been developed. However, the key characteristics of pattern-based text categorization remain unclear. Indeed, there are still no concrete answers for the following two questions: Firstly, what kind of association pattern is the best candidate for pattern-based text categorization? Secondly, what is the most desirable way to use patterns for text categorization? In this paper, we focus on answering the above two questions. More specifically, we show that hyperclique patterns are more desirable than frequent patterns for text categorization. Along this line, we develop an algorithm for text categorization using hyperclique patterns. As demonstrated by our experimental results on various real-world text documents, our method provides much better computational performance than state-of-the-art methods while retaining classification accuracy.