Text classification with tournament methods

Authors:
Louise Guthrie;Wei Liu;Yunqing Xia
Affiliations:
Department of Computer Science, University of Sheffield, Sheffield, UK;Department of Computer Science, University of Sheffield, Sheffield, UK;Department of Systems Engineering and Engineering Management, The Chinese University of Hong Kong, Hong Kong
Venue:
TSD'05 Proceedings of the 8th international conference on Text, Speech and Dialogue
Year:
2005

Citing 7
Cited 0

C4.5: programs for machine learning

C4.5: programs for machine learning
An Evaluation of Statistical Approaches to Text Categorization

Information Retrieval
A statistical learning learning model of text classification for support vector machines

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Naive (Bayes) at Forty: The Independence Assumption in Information Retrieval

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Document classification by machine: theory and practice

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Email categorization with tournament methods

NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
FASiL adaptive email categorization system

CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper compares the effectiveness of n-way (n2) classification using a probabilistic classifier to the use of multiple binary probabilistic classifiers. We describe the use of binary classifiers in both Round Robin and Elimination tournaments, and compare both tournament methods and n-way classification when determining the language of origin of speakers (both native and non-native English speakers) speaking English. We conducted hundreds of experiments by varying the number of categories as well as the categories themselves. In all experiments the tournament methods performed better than the n-way classifier, and of these tournament methods, on average, Round Robin performs slightly better than the Elimination tournament.