Case-Sensitivity of Classifiers for WSD: Complex Systems Disambiguate Tough Words Better

Authors:
Harri M. Saarikoski;Steve Legrand;Alexander Gelbukh
Affiliations:
KIT Language Technology Doctorate School, Helsinki University, Finland;Department of Computer Science, University of Jyväskylä, Finland;Instituto Politecnico Nacional, Mexico City, Mexico
Venue:
CICLing '07 Proceedings of the 8th International Conference on Computational Linguistics and Intelligent Text Processing
Year:
2009

Citing 17
Cited 0

Instance-Based Learning Algorithms

Machine Learning
Generalizing from case studies: a case study

ML92 Proceedings of the ninth international workshop on Machine learning
C4.5: programs for machine learning

C4.5: programs for machine learning
The nature of statistical learning theory

The nature of statistical learning theory
Characterizing Model Erros and Differences

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Introduction to the special issue on evaluating word sense disambiguation systems

Natural Language Engineering
Evaluating sense disambiguation across diverse parameter spaces

Natural Language Engineering
Parameter optimization for machine-learning of word sense disambiguation

Natural Language Engineering
Word sense disambiguation with pattern learning and automatic feature selection

Natural Language Engineering
Learning from little: comparison of classifiers given little training

PKDD '04 Proceedings of the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases
A dynamically growing self-organizing tree (DGSOT) for hierarchical clustering gene expression profiles

Bioinformatics
Assessing system agreement and instance difficulty in the lexical sample tasks of SENSEVAL-2

WSD '02 Proceedings of the ACL-02 workshop on Word sense disambiguation: recent successes and future directions - Volume 8
YALE: rapid prototyping for complex data mining tasks

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Estimating continuous distributions in Bayesian classifiers

UAI'95 Proceedings of the Eleventh conference on Uncertainty in artificial intelligence
Building an optimal WSD ensemble using per-word selection of best system

CIARP'06 Proceedings of the 11th Iberoamerican conference on Progress in Pattern Recognition, Image Analysis and Applications
Combining heterogeneous classifiers for word-sense disambiguation

SENSEVAL '01 The Proceedings of the Second International Workshop on Evaluating Word Sense Disambiguation Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a novel method for improving disambiguation accuracy by building an optimal ensemble (OE) of systems where we predict the best available system for target word using a priori case factors (e.g. amount of training per sense). We report promising results of a series of best-system prediction tests (best prediction accuracy is 0.92) and show that complex/simple systems disambiguate tough/easy words better. The method provides the following benefits: (1) higher disambiguation accuracy for virtually any base systems (current best OE yields close to 2% accuracy gain over Senseval-3 state of the art) and (2) economical way of building more effective ensembles of all types (e.g. optimal, weighted voting and cross-validation based). The method is also highly scalable in that it utilizes readily available factors available for any ambiguous word in any language for estimating word difficulty and defines classifier complexity using known properties only.