The disambiguation of nominalizations

Authors:
Maria Lapata
Affiliations:
Division of Informatics, University of Edinburgh, 2 Buccleuch Place, Edinburgh EH8 9LW, UK
Venue:
Computational Linguistics
Year:
2002

Citing 22
Cited 35

Interpretation as abduction

Artificial Intelligence - Special volume on natural language processing
Class-based n-gram models of natural language

Computational Linguistics
Selection and information: a class-based approach to lexical relationships

Selection and information: a class-based approach to lexical relationships
Emergent linguistic rules from inducing decision trees: disambiguating discourse clue words

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Assessing agreement on classification tasks: the kappa statistic

Computational Linguistics
Similarity-Based Models of Word Cooccurrence Probabilities

Machine Learning - Special issue on natural language learning
Understanding noun compounds

Understanding noun compounds
Structural ambiguity and lexical relations

Computational Linguistics - Special issue on using large corpora: I
Discourse segmentation by human and automated means

Computational Linguistics
Generalizing case frames using a thesaurus and the MDL principle

Computational Linguistics
Automatic verb classification using distributions of grammatical features

EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Integrating symbolic and statistical representations: the lexicon pragmatics interface

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
A computational analysis of complex noun phrases in Navy messages

ACL '84 Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics
Another look at nominal compounds

ACL '84 Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics
Distributional clustering of English words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Generalizing automatically generated selectional patterns

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Algorithm for automatic interpretation of noun sequences

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Measures of distributional similarity

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Distributional similarity models: clustering vs. nearest neighbors

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Corpus-based linguistic indicators for aspectual classification

ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
Identifying user corrections automatically in spoken dialogue systems

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Learning trees and rules with set-valued features

AAAI'96 Proceedings of the thirteenth national conference on Artificial intelligence - Volume 1

Using the web to obtain frequencies for unseen bigrams

Computational Linguistics - Special issue on web as corpus
A comparison of parsing technologies for the biomedical domain

Natural Language Engineering
Deverbal compound noun analysis based on lexical conceptual structure

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2
On the robustness of entropy-based similarity measures in evaluation of subcategorization acquisition systems

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Automatic Discovery of Part-Whole Relations

Computational Linguistics
Expressing implicit semantic relations without supervision

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Semantic role labeling of nominalized predicates in Chinese

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Using shallow linguistic analysis to improve search on Danish compounds

Natural Language Engineering
Automatically extracting nominal mentions of events with a bootstrapped probabilistic classifier

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Interpreting semantic relations in noun compounds via verb semantics

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Labeling chinese predicates with semantic roles

Computational Linguistics
On the time-dependent occupancy distribution of the g/g/1 queuing system

Probability in the Engineering and Informational Sciences
Models for the semantic classification of noun phrases

CLS '04 Proceedings of the HLT-NAACL Workshop on Computational Lexical Semantics
The syntax and semantics of prepositions in the task of automatic interpretation of nominal phrases and compounds: A cross-linguistic study

Computational Linguistics
Semantic role assignment for event nominalisations by leveraging verbal data

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Interpretation of compound nominalisations using corpus and web statistics

MWE '06 Proceedings of the Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties
Parsing arguments of nominalizations in English and Chinese

HLT-NAACL-Short '04 Proceedings of HLT-NAACL 2004: Short Papers
Automatic semantic relation extraction with multiple boundary generation

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 2
SemEval-2007 task 04: classification of semantic relations between nominals

SemEval '07 Proceedings of the 4th International Workshop on Semantic Evaluations
SemEval-2010 task 8: multi-way classification of semantic relations between pairs of nominals

DEW '09 Proceedings of the Workshop on Semantic Evaluations: Recent Achievements and Future Directions
Making sense of Japanese relative clause constructions

TextMean '04 Proceedings of the 2nd Workshop on Text Meaning and Interpretation
From annotator agreement to noise models

Computational Linguistics
Mining of parsed data to derive deverbal argument structure

GEAF '09 Proceedings of the 2009 Workshop on Grammar Engineering Across Frameworks
Automatic identification of semantic relations in Italian complex nominals

IWCS-8 '09 Proceedings of the Eighth International Conference on Computational Semantics
Schema Normalization for Improving Schema Matching

ER '09 Proceedings of the 28th International Conference on Conceptual Modeling
Bootstrapping location relations from text

Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem - Volume 47
"Nut case: what does it mean?": understanding semantic relationship between nouns in noun compounds through paraphrasing and ranking the paraphrases

Proceedings of the 1st international workshop on Search and mining entity-relationship data
Using verbs to characterize noun-noun relations

AIMSA'06 Proceedings of the 12th international conference on Artificial Intelligence: methodology, Systems, and Applications
Large-scale noun compound interpretation using bootstrapping and the web as a corpus

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Automatic interpretation of noun compounds using wordnet similarity

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Tools for nominalization: an alternative for lexical normalization

PROPOR'06 Proceedings of the 7th international conference on Computational Processing of the Portuguese Language
Empirical methods for the study of denotation in nominalizations in spanish

Computational Linguistics
Word sense and semantic relations in noun compounds

ACM Transactions on Speech and Language Processing (TSLP) - Special issue on multiword expressions: From theory to practice and use, part 2
Semantic interpretation of noun compounds using verbal and other paraphrases

ACM Transactions on Speech and Language Processing (TSLP) - Special issue on multiword expressions: From theory to practice and use, part 2
Using part---whole relations for automatic deduction of compound-internal relations in GermaNet

Language Resources and Evaluation

Quantified Score

Hi-index	0.00

Visualization

Abstract

This article addresses the interpretation of nominalizations, a particular class of compound nouns whose head noun is derived from a verb and whose modifier is interpreted as an argument of this verb. Any attempt to automatically interpret nominalizations needs to take into account: (a) the selectional constraints imposed by the nominalized compound head, (b) the fact that the relation of the modifier and the head noun can be ambiguous, and (c) the fact that these constraints can be easily overridden by contextual or pragmatic factors. The interpretation of nominalizations poses a further challenge for probabilistic approaches since the argument relations between a head and its modifier are not readily available in the corpus. Even an approximation that maps the compound head to its underlying verb provides insufficient evidence. We present an approach that treats the interpretation task as a disambiguation problem and show how we can "re-create" the missing distributional evidence by exploiting partial parsing, smoothing techniques, and contextual information. We combine these distinct information sources using Ripper, a system that learns sets of rules from data, and achieve an accuracy of 86.1% (over a baseline of 61.5%) on the British National Corpus.