Assessing agreement on classification tasks: the kappa statistic
Computational Linguistics
Feature Extraction, Construction and Selection: A Data Mining Perspective
Feature Extraction, Construction and Selection: A Data Mining Perspective
Applied morphological processing of English
Natural Language Engineering
High precision extraction of grammatical relations
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Using an ontology to determine English countability
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Transformation-based learning in the fast lane
NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Learning the countability of English nouns from corpus data
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Introduction to the CoNLL-2000 shared task: chunking
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Learning the countability of English nouns from corpus data
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Reinforcing English countability prediction with one countability per discourse property
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
A Method for Reinforcing Noun Countability Prediction
IEICE - Transactions on Information and Systems
A Computer Science Text Corpus/Search Engine X-Tec and Its Applications
Proceedings of the 2006 conference on Information Modelling and Knowledge Bases XVII
Bootstrapping deep lexical resources: resources for courses
DeepLA '05 Proceedings of the ACL-SIGLEX Workshop on Deep Lexical Acquisition
Deep lexical acquisition of verb-particle constructions
Computer Speech and Language
Detecting article errors based on the mass count distinction
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Hi-index | 0.00 |
This paper compares a range of methods for classifying words based on linguistic diagnostics, focusing on the task of learning countabilities for English nouns. We propose two basic approaches to feature representation: distribution-based representation, which simply looks at the distribution of features in the corpus data, and agreement-based representation which analyses the level of token-wise agreement between multiple preprocessor systems. We additionally compare a single multiclass classifier architecture with a suite of binary classifiers, and combine analyses from multiple preprocessors. Finally, we present and evaluate a feature selection method.