Learning methods to combine linguistic indicators: improving aspectual classification and revealing linguistic insights

Authors:
Eric V. Siegel;Kathleen R. McKeown
Affiliations:
Columbia University;Columbia University
Venue:
Computational Linguistics
Year:
2000

Citing 41
Cited 12

An experiment in computational discrimination of English word senses

IBM Journal of Research and Development
Temporal ontology and temporal reference

Computational Linguistics - Special issue on tense and aspect
A computational model of the semantics of tense and aspect

Computational Linguistics - Special issue on tense and aspect
Machine learning of natural language

Machine learning of natural language
Adaptation in natural and artificial systems

Adaptation in natural and artificial systems
Genetic programming: on the programming of computers by means of natural selection

Genetic programming: on the programming of computers by means of natural selection
On the Handling of Continuous-Valued Attributes in Decision Tree Generation

Machine Learning
Dimensions of meaning

Proceedings of the 1992 ACM/IEEE conference on Supercomputing
Original Contribution: Stacked generalization

Neural Networks
C4.5: programs for machine learning

C4.5: programs for machine learning
The generative lexicon

Computational Linguistics
The donut problem: scalability, generalization and breeding policies in genetic programming

Advances in genetic programming
Competitively evolving decision trees against fixed training cases for natural language processing

Advances in genetic programming
Optimizing confidence of text classification by evolution of symbolic expressions

Advances in genetic programming
Natural language understanding (2nd ed.)

Natural language understanding (2nd ed.)
Classifying cue phrases in text and speech using machine learning

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Emergent linguistic rules from inducing decision trees: disambiguating discourse clue words

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Guest Editors‘ Introduction: Machine Learning and Natural Language

Machine Learning - Special issue on natural language learning
Genetic Algorithms in Search, Optimization and Machine Learning

Genetic Algorithms in Search, Optimization and Machine Learning
Machine Learning

Machine Learning
Applying classification algorithms in practice

Statistics and Computing
Induction of Decision Trees

Machine Learning
A Representation for the Adaptive Generation of Simple Sequential Programs

Proceedings of the 1st International Conference on Genetic Algorithms
Improving Minority Class Prediction Using Case-Specific Feature Weights

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Uniform Crossover in Genetic Algorithms

Proceedings of the 3rd International Conference on Genetic Algorithms
Genetic Programming for Feature Discovery and Image Discrimination

Proceedings of the 5th International Conference on Genetic Algorithms
Toward Multi-Strategy Parallel & Distributed Learning in Sequence Analysis

Proceedings of the 1st International Conference on Intelligent Systems for Molecular Biology
A Two-Level Knowledge Representation for Machine Translation: Lexical Semantics and Tense/Aspect

Proceedings of the First SIGLEX Workshop on Lexical Semantics and Knowledge Representation
Slot Grammar: A System for Simpler Construction of Practical Natural Language Grammars

Proceedings of the International Symposium on Natural Language and Logic
Linguistic indicators for language understanding: using machine learning methods to combine corpus-based indicators for aspectual classification of clauses

Linguistic indicators for language understanding: using machine learning methods to combine corpus-based indicators for aspectual classification of clauses
Automatic acquisition of lexical semantic knowledge from large corpora: the identification of semantically related words, markedness, polarity, and antonymy

Automatic acquisition of lexical semantic knowledge from large corpora: the identification of semantically related words, markedness, polarity, and antonymy
A stochastic parts program and noun phrase parser for unrestricted text

ANLC '88 Proceedings of the second conference on Applied natural language processing
Towards the automatic identification of adjectival scales: clustering adjectives according to meaning

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Distributional clustering of English words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Statistical sense disambiguation with relatively small corpora using dictionary definitions

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
A quantitative evaluation of linguistic tests for the automatic prediction of semantic markedness

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Decision lists for lexical ambiguity resolution: application to accent restoration in Spanish and French

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Degrees of stativity: the lexical representation of verb aspect

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 4
Corpus-based linguistic indicators for aspectual classification

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Semantic classes and syntactic ambiguity

HLT '93 Proceedings of the workshop on Human Language Technology
Filling knowledge gaps in a broad coverage machine translation system

IJCAI'95 Proceedings of the 14th international joint conference on Artificial intelligence - Volume 2

Identifying semantic relations in text

Exploring artificial intelligence in the new millennium
A probabilistic account of logical metonymy

Computational Linguistics
A multilingual paradigm for automatic verb classification

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Learning the countability of English nouns from corpus data

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Experiments on the Automatic Induction of German Semantic Verb Classes

Computational Linguistics
Applying machine learning to Chinese temporal relation resolution

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Combining linguistic features with weighted Bayesian classifier for temporal reference processing

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Automatic verb classification using multilingual resources

ConLL '01 Proceedings of the 2001 workshop on Computational Natural Language Learning - Volume 7
Identification of event mentions and their semantic class

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Detecting experiences from weblogs

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Using query patterns to learn the duration of events

IWCS '11 Proceedings of the Ninth International Conference on Computational Semantics
Aspectual type and temporal relation classification

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Aspectual classification maps verbs to a small set of primitive categories in order to reason about time. This classification is necessary for interpreting temporal modifiers and assessing temporal relationships, and is therefore a required component for many natural language applications.A verb's aspectual category can be predicted by co-occurrence frequencies between the verb and certain linguistic modifiers. These frequency measures, called linguistic indicators, are chosen by linguistic insights. However, linguistic indicators used in isolation are predictively incomplete, and are therefore insufficient when used individually.In this article, we compare three supervised machine learning methods for combining multiple linguistic indicators for aspectual classification: decision trees, genetic programming, and logistic regression. A set of 14 indicators are combined for classification according to two aspectual distinctions. This approach improves the classification performance for both distinctions, as evaluated over unrestricted sets of verbs occurring across two corpora. This demonstrates the effectiveness of the linguistic indicators and provides a much-needed full-scale method for automatic aspectual classification. Moreover, the models resulting from learning reveal several linguistic insights that are relevant to aspectual classification. We also compare supervised learning methods with an unsupervised method for this task.