Probabilistic tree-adjoining grammar as a framework for statistical natural language processing

Authors:
Philip Resnik
Affiliations:
University of Pennsylvania, Philadelphia, Pennsylvania
Venue:
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Year:
1992

Citing 7
Cited 26

A statistical approach to sense disambiguation in machine translation

HLT '91 Proceedings of the workshop on Speech and Natural Language
Mathematical and computational aspects of lexicalized grammars

Mathematical and computational aspects of lexicalized grammars
Parsing idioms in lexicalized TAGs

EACL '89 Proceedings of the fourth conference on European chapter of the Association for Computational Linguistics
Word association norms, mutual information, and lexicography

ACL '89 Proceedings of the 27th annual meeting on Association for Computational Linguistics
Automatically extracting and representing collocations for language generation

ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Noun classification from predicate-argument structures

ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Stochastic lexicalized tree-adjoining grammars

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2

A Review of Statistical Language Processing Techniques

Artificial Intelligence Review
An alternative conception of tree-adjoining derivation

Computational Linguistics
Using an annotated corpus as a stochastic grammar

EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
A probabilistic context-free grammar for disambiguation in morphological parsing

EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Bayesian grammar induction for language modeling

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Some novel applications of Explanation-Based Learning to parsing Lexicalized Tree-Adjoining Grammars

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Head automata and bilingual tiling: translation with minimal representations

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Computational complexity of probabilistic disambiguation by means of tree-grammars

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
Facilitating treebank annotation using a statistical parser

HLT '01 Proceedings of the first international conference on Human language technology research
Can subcategorization help a statistical dependency parser?

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Applying co-training methods to statistical parsing

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
Statistical parsing with an automatically-extracted tree adjoining grammar

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Head-Driven Statistical Models for Natural Language Parsing

Computational Linguistics
Two statistical parsing models applied to the Chinese Treebank

CLPW '00 Proceedings of the second workshop on Chinese language processing: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 12
Automated extraction of tags from the penn treebank

New developments in parsing technology
Automated extraction of Tree-Adjoining Grammars from treebanks

Natural Language Engineering
What are the productive units of natural language grammar?: a DOP approach to the automatic identification of constructions

CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Weighted parsing of trees

IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
Cross parser evaluation and tagset variation: a French treebank study

IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
Simple, accurate parsing with an all-fragments grammar

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Using an annotated language corpus as a virtual stochastic grammar

AAAI'93 Proceedings of the eleventh national conference on Artificial intelligence
The surprising variance in shortest-derivation parsing

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Computation of infix probabilities for probabilistic context-free grammars

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
A generalized view on parsing and translation

IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
An information-theoretic measure to evaluate parsing difficulty across treebanks

ACM Transactions on Speech and Language Processing (TSLP)
Incremental, predictive parsing with psycholinguistically motivated tree-adjoining grammar

Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, I argue for the use of a probabilistic form of tree-adjoining grammar (TAG) in statistical natural language processing. I first discuss two previous statistical approaches --- one that concentrates on the probabilities of structural operations, and another that emphasizes co-occurrence relationships between words. I argue that a purely structural apprach, exemplified by probabilistic context-free grammar, lacks sufficient sensitivity to lexical context, and, conversely, that lexical co-occurence analyses require a richer notion of locality that is best provided by importing some notion of structure.I then propose probabilistic TAG as a framework for statistical language modelling, arguing that it provides an advantageous combination of structure, locality, and lexical sensitivity. Issues in the acquisition of probabilistic TAG and parameter estimation are briefly considered.