SAT-MOD: moderate itemset fittest for text classification

Authors:
Jianlin Feng;Huijun Liu;Jing Zou
Affiliations:
Huazhong Univ. of Sci. & Tech., Hubei, China;Huazhong Univ. of Sci. & Tech., Hubei, China;Huazhong Univ. of Sci. & Tech., Hubei, China
Venue:
WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Year:
2005

Citing 7
Cited 5

Inductive learning algorithms and representations for text categorization

Proceedings of the seventh international conference on Information and knowledge management
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Machine learning in automated text categorization

ACM Computing Surveys (CSUR)
FOIL: A Midterm Report

ECML '93 Proceedings of the European Conference on Machine Learning
Fast Algorithms for Mining Association Rules in Large Databases

VLDB '94 Proceedings of the 20th International Conference on Very Large Data Bases
Text Document Categorization by Term Association

ICDM '02 Proceedings of the 2002 IEEE International Conference on Data Mining
eMailSift: mining-based approaches to email classification

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval

Adapting association patterns for text categorization: weaknesses and enhancements

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
On the strength of hyperclique patterns for text categorization

Information Sciences: an International Journal
Word co-occurrence features for text classification

Information Systems
Sentential association based text classification systems

APWeb'05 Proceedings of the 7th Asia-Pacific web conference on Web Technologies Research and Development
Feature selection, rule extraction, and score model: making ATC competitive with SVM

RSKT'06 Proceedings of the First international conference on Rough Sets and Knowledge Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a novel association-based method called SAT-MOD for text classification. SAT-MOD views a sentence rather than a document as a transaction, and uses a novel heuristic called MODFIT to select the most significant itemsets for constructing a category classifier. The effectiveness of SAT-MOD has been demonstrated comparable to well-known alternatives such as LinearSVM and much better than current document-level words association based methods on the Reuters corpus.