An empirical study of the behavior of active learning for word sense disambiguation

Authors:
Jinying Chen;Andrew Schein;Lyle Ungar;Martha Palmer
Affiliations:
University of Pennsylvania, Philadelphia, PA;University of Pennsylvania, Philadelphia, PA;University of Pennsylvania, Philadelphia, PA;University of Colorado, Boulder, CO
Venue:
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Year:
2006

Citing 11
Cited 22

A sequential algorithm for training text classifiers

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Query Learning Strategies Using Boosting and Bagging

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Active Learning for Natural Language Parsing and Information Extraction

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Employing EM and Pool-Based Active Learning for Text Classification

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Selective sampling for example-based word sense disambiguation

Computational Linguistics
Active learning for statistical natural language parsing

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Investigations into the role of lexical semantics in word sense disambiguation

Investigations into the role of lexical semantics in word sense disambiguation
Building a sense tagged corpus with open mind word expert

WSD '02 Proceedings of the ACL-02 workshop on Word sense disambiguation: recent successes and future directions - Volume 8
Active learning for logistic regression

Active learning for logistic regression
Multi-criteria-based active learning for named entity recognition

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
OntoNotes: the 90% solution

NAACL-Short '06 Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Short Papers

Active learning for logistic regression: an evaluation

Machine Learning
A New Decision Rule for Statistical Word Sense Disambiguation

ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Theoretical and Methodological Issues
A Vicarious Words Method for Word Sense Discrimination

ICIC '08 Proceedings of the 4th international conference on Intelligent Computing: Advanced Intelligent Computing Theories and Applications - with Aspects of Theoretical and Methodological Issues
A Density-Based Re-ranking Technique for Active Learning for Data Annotations

ICCPOL '09 Proceedings of the 22nd International Conference on Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy
Multi-criteria-based strategy to stop active learning for data annotation

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Active learning with sampling by uncertainty and density for word sense disambiguation and text classification

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Word sense disambiguation using OntoNotes: an empirical study

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Using language modeling to select useful annotation data

SRWS '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Student Research Workshop and Doctoral Consortium
SemEval-2007 task 17: English lexical sample, SRL and all words

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
Criteria for the manual grouping of verb senses

LAW '07 Proceedings of the Linguistic Annotation Workshop
A combination of active learning and semi-supervised learning starting with positive and unlabeled examples for word sense disambiguation: an empirical study on Japanese web search query

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Confidence-based stopping criteria for active learning for data annotation

ACM Transactions on Speech and Language Processing (TSLP)
Active learning with sampling by uncertainty and density for data annotations

IEEE Transactions on Audio, Speech, and Language Processing
Bringing active learning to life

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Minimum Description Length Penalization for Group and Multi-Task Sparse Learning

The Journal of Machine Learning Research
Evaluating the impact of coder errors on active learning

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Good seed makes a good crop: accelerating active learning using language modeling

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Reducing the need for double annotation

LAW V '11 Proceedings of the 5th Linguistic Annotation Workshop
Uncertainty-based active learning with instability estimation for text classification

ACM Transactions on Speech and Language Processing (TSLP)
Applying active learning to assertion classification of concepts in clinical text

Journal of Biomedical Informatics
Contextual and active learning-based affect-sensing from virtual drama improvisation

ACM Transactions on Speech and Language Processing (TSLP)
An adaptive ensemble classifier for mining concept drifting data streams

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper shows that two uncertainty-based active learning methods, combined with a maximum entropy model, work well on learning English verb senses. Data analysis on the learning process, based on both instance and feature levels, suggests that a careful treatment of feature extraction is important for the active learning to be useful for WSD. The overfitting phenomena that occurred during the active learning process are identified as classic overfitting in machine learning based on the data analysis.