From trees to predicate-argument structures

Authors:
Maria Liakata;Stephen Pulman
Affiliations:
Oxford University, Oxford;Oxford University, Oxford
Venue:
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Year:
2002

Citing 6
Cited 7

WordNet: a lexical database for English

Communications of the ACM
The Berkeley FrameNet Project

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Shake-and-bake translation

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Automatic predicate argument analysis of the Penn TreeBank

HLT '01 Proceedings of the first international conference on Human language technology research
Automatic labeling of semantic roles

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
The Penn Treebank: annotating predicate argument structure

HLT '94 Proceedings of the workshop on Human Language Technology

Learning theories from text

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Wide-coverage semantic representations from a CCG parser

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Deriving generalized knowledge from corpora using WordNet abstraction

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Automatic fine-grained semantic classification for domain adaptation

STEP '08 Proceedings of the 2008 Conference on Semantics in Text Processing
Open knowledge extraction through compositional language processing

STEP '08 Proceedings of the 2008 Conference on Semantics in Text Processing
Analysis of a probabilistic model of redundancy in unsupervised information extraction

Artificial Intelligence
Deterministic statistical mapping of sentences to underspecified semantics

IWCS '11 Proceedings of the Ninth International Conference on Computational Semantics

Quantified Score

Hi-index	0.00

Visualization

Abstract

The Penn Treebank encodes valuable information such as grammatical function, semantic roles, and identification of traces. The addition of such information was intended to facilitate the process of predicate-argument extraction. However, even with the enriched annotation this task is far from trivial and, to our knowledge, no complete set of predicate argument structures derived from the Treebank exists. Our paper describes a method for retrieving predicate-argument structures that circumvents the complexity of the tree structures in the corpus, while employing few template rules. Our system operates on a flattened, morphologically enriched version of the corpus. This flattened representation allows access to all levels of the tree simultaneously and thus enables the detection of the main sentence constituents by means of simple template rules. A small number of rules apply to identify the head words of each constituent and the latter fill in the constituent templates, to build the logical forms representative of the predicate argument structure. The system is robust in the face of incomplete syntactic coverage.