Supertagging: an approach to almost parsing

Authors:
Srinivas Bangalore;Aravind K. Joshi
Affiliations:
AT&T Labs -- Research;University of Pennsylvania
Venue:
Computational Linguistics
Year:
1999

Citing 30
Cited 86

Information-based syntax and semantics: Vol. 1: fundamentals

Information-based syntax and semantics: Vol. 1: fundamentals
Training and scaling preference functions for disambiguation

Computational Linguistics
Role of constrained computational systems in natural language processing

Artificial Intelligence - Special issue: artificial intelligence 40 years later
Constraint Grammar: A Language-Independent System for Parsing Unrestricted Text

Constraint Grammar: A Language-Independent System for Parsing Unrestricted Text
On the Estimation of 'Small' Probabilities by Leaving-One-Out

IEEE Transactions on Pattern Analysis and Machine Intelligence
A study of tree adjoining grammars

A study of tree adjoining grammars
Mathematical and computational aspects of lexicalized grammars

Mathematical and computational aspects of lexicalized grammars
Building a large annotated corpus of English: the penn treebank

Computational Linguistics - Special issue on using large corpora: II
Coping with ambiguity and unknown words through probabilistic models

Computational Linguistics - Special issue on using large corpora: II
A stochastic parts program and noun phrase parser for unrestricted text

ANLC '88 Proceedings of the second conference on Applied natural language processing
A parser from antiquity

Natural Language Engineering
Regular expressions for language engineering

Natural Language Engineering
Parsing the Wall Street Journal with the inside-outside algorithm

EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Lexicon-grammar and the syntactic analysis of French

ACL '84 Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics
A probabilistic corpus-driven model for lexical-functional analysis

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Towards history-based grammars: using richer models for probabilistic parsing

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Automatic grammar induction and parsing free text: a transformation-based approach

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Compilation of HPSG to TAG

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Statistical decision-tree models for parsing

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
A new statistical parser based on bigram lexical dependencies

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
XTAG system: a wide coverage grammar for English

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
Parsing strategies with 'lexicalized' grammars: application to tree adjoining grammars

COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 2
Disambiguation of super parts of speech (or supertags): almost parsing

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Probabilistic tagging with feature structures

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Syntactic analysis of natural language using linguistic rules and corpus-based patterns

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Motivations and methods for text simplification

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
SRI International FASTUS system: MUC-6 test results and analysis

MUC6 '95 Proceedings of the 6th conference on Message understanding
Decision tree parsing using a hidden derivation model

HLT '94 Proceedings of the workshop on Human Language Technology
A variable-length category-based n-gram language model

ICASSP '96 Proceedings of the Acoustics, Speech, and Signal Processing, 1996. on Conference Proceedings., 1996 IEEE International Conference - Volume 01
Statistical parsing with a context-free grammar and word statistics

AAAI'97/IAAI'97 Proceedings of the fourteenth national conference on artificial intelligence and ninth conference on Innovative applications of artificial intelligence

Stochastic Finite-State Models for Spoken Language MachineTranslation

Machine Translation
Automatic verb classification based on statistical distributions of argument structure

Computational Linguistics
A lightweight dependency analyzer for partial parsing

Natural Language Engineering
Exploiting a probabilistic hierarchical model for generation

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Hypertags

COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 1
Generalised PP-attachment disambiguation using corpus-based linguistic diagnostics

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Towards automatic generation of natural language generation systems

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Building deep dependency structures with a wide-coverage CCG parser

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
A finite-state approach to machine translation

NAACL '01 Proceedings of the second meeting of the North American Chapter of the Association for Computational Linguistics on Language technologies
A SNoW based supertagger with application to NP chunking

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Corpus-based lexical choice in natural language generation

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
ANSI C Program Slicing Tool and Text Generator for an Interactive Learning Environment

ICALT '05 Proceedings of the Fifth IEEE International Conference on Advanced Learning Technologies
Automatic distinction of arguments and modifiers: the case of prepositional phrases

ConLL '01 Proceedings of the 2001 workshop on Computational Natural Language Learning - Volume 7
Extracting clauses for spoken language understanding in conversational systems

EMNLP '02 Proceedings of the ACL-02 conference on Empirical methods in natural language processing - Volume 10
Feature selection for a rich HPSG grammar using decision trees

COLING-02 proceedings of the 6th conference on Natural language learning - Volume 20
Abductive explanation-based learning improves parsing accuracy and efficiency

SIGHAN '03 Proceedings of the second SIGHAN workshop on Chinese language processing - Volume 17
Use of deep linguistic features for the recognition and labeling of semantic arguments

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
The Proposition Bank: An Annotated Corpus of Semantic Roles

Computational Linguistics
Automated extraction of tags from the penn treebank

New developments in parsing technology
Fast transpose methods for kernel learning on sparse data

ICML '06 Proceedings of the 23rd international conference on Machine learning
The Notion of Argument in Prepositional Phrase Attachment

Computational Linguistics
How much can part-of-speech tagging help parsing?

Natural Language Engineering
Automated extraction of Tree-Adjoining Grammars from treebanks

Natural Language Engineering
Compiling boostexter rules into a finite-state transducer

ACLdemo '04 Proceedings of the ACL 2004 on Interactive poster and demonstration sessions
A finite-state model of human sentence processing

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Learning the structure of task-driven human-human dialogs

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Guiding a constraint dependency parser with supertags

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Hybrid parsing: using probabilistic models as predictors for a symbolic parser

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
Multi-tagging for lexicalized-grammar parsing

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
The importance of supertagging for wide-coverage CCG parsing

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Tagging with hidden Markov models using ambiguous tags

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Statistical language modeling with performance benchmarks using various levels of syntactic-semantic information

COLING '04 Proceedings of the 20th international conference on Computational Linguistics
Discriminative Machine Translation Using Global Lexical Selection

ACM Transactions on Asian Language Information Processing (TALIP)
Combining lexical, syntactic and prosodic cues for improved online dialog act tagging

Computer Speech and Language
Icelandic data driven part of speech tagging

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Weakly supervised supertagging with grammar-informed initialization

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Perceptron training for a wide-coverage lexicalized-grammar parser

DeepLP '07 Proceedings of the Workshop on Deep Linguistic Processing
Pruning the search space of a hand-crafted parsing system with a probabilistic parser

DeepLP '07 Proceedings of the Workshop on Deep Linguistic Processing
Unsupervised methods for head assignments

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Extremely lexicalized models for accurate and fast HPSG parsing

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Multilingual deep lexical acquisition for HPSGs via supertagging

EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Using semantic and syntactic graphs for call classification

FeatureEng '05 Proceedings of the ACL Workshop on Feature Engineering for Machine Learning in Natural Language Processing
Active learning for the identification of nonliteral language

FigLanguages '07 Proceedings of the Workshop on Computational Approaches to Figurative Language
Adapting a lexicalized-grammar parser to contrasting domains

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Robust understanding in multimodal interfaces

Computational Linguistics
MICA: a probabilistic dependency parser based on tree insertion grammars application note

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Evaluating impact of re-training a lexical disambiguation model on domain adaptation of an HPSG parser

IWPT '07 Proceedings of the 10th International Conference on Parsing Technologies
Improving the efficiency of a wide-coverage CCG parser

IWPT '07 Proceedings of the 10th International Conference on Parsing Technologies
A log-linear model with an n-gram reference distribution for accurate HPSG parsing

IWPT '07 Proceedings of the 10th International Conference on Parsing Technologies
Efficient HPSG parsing with supertagging and CFG-filtering

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
CCG supertags in factored statistical machine translation

StatMT '07 Proceedings of the Second Workshop on Statistical Machine Translation
Porting a lexicalized-grammar parser to the biomedical domain

Journal of Biomedical Informatics
Induction of fine-grained part-of-speech taggers via classifier combination and crosslingual projection

ParaText '05 Proceedings of the ACL Workshop on Building and Using Parallel Texts
Brutus: a semantic role labeling system incorporating CCG, CFG, and dependency features

ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Clustering words by syntactic similarity improves dependency parsing of predicate-argument structures

IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
HPSG supertagging: a sequence labeling view

IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
Smoothing fine-grained PCFG lexicons

IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
Dependency constraints for lexical disambiguation

IWPT '09 Proceedings of the 11th International Conference on Parsing Technologies
Perceptron reranking for CCG realization

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
A syntactified direct translation model with linear-time decoding

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Fully lexicalising CCGbank with hat categories

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Evaluating a statistical CCG parser on Wikipedia

People's Web '09 Proceedings of the 2009 Workshop on The People's Web Meets NLP: Collaboratively Constructed Semantic Resources
A simple approach for HPSG supertagging using dependency information

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Faster parsing by supertagger adaptation

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Forest-guided supertagger training

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Morphological analysis can improve a CCG parser for English

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Automatic committed belief tagging

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Chart pruning for fast lexicalised-grammar parsing

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Computational linguistics and natural language processing

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
An unsupervised approach for linking automatically extracted and manually crafted LTAGs

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part I
Optimistic backtracking: a backtracking overlay for deterministic incremental parsing

HLT-SS '11 Proceedings of the ACL 2011 Student Session
A comparison of loopy belief propagation and dual decomposition for integrated CCG supertagging and parsing

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Efficient CCG parsing: A* versus adaptive supertagging

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Integrating source-language context into phrase-based statistical machine translation

Machine Translation
Finite state grammar transduction from distributed collected knowledge

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Out-of-the-box robust parsing of Portuguese

PROPOR'10 Proceedings of the 9th international conference on Computational Processing of the Portuguese Language
Training a log-linear parser with loss functions via softmax-margin

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Efficient accurate syntactic direct translation models: one tree at a time

Machine Translation
Learning structural dependencies of words in the Zipfian tail

IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
CuteForce: deep deterministic HPSG parsing

IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
Linguistically-augmented Bulgarian-to-English statistical machine translation model

EACL 2012 Proceedings of the Joint Workshop on Exploiting Synergies between Information Retrieval and Machine Translation (ESIRMT) and Hybrid Approaches to Machine Translation (HyTra)
Linguistically-enriched models for Bulgarian-to-English machine translation

SSST-6 '12 Proceedings of the Sixth Workshop on Syntax, Semantics and Structure in Statistical Translation
Enriching machine-mediated speech-to-speech translation using contextual information

Computer Speech and Language
Finite-state chart constraints for reduced complexity context-free parsing pipelines

Computational Linguistics
Bridge the gap between statistical and hand-crafted grammars

Computer Speech and Language
Incremental, predictive parsing with psycholinguistically motivated tree-adjoining grammar

Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we have proposed novel methods for robust parsing that integrate the flexibility of linguistically motivated lexical descriptions with the robustness of statistical techniques. Our thesis is that the computation of linguistic structure can be localized if lexical items are associated with rich descriptions (supertags) that impose complex constraints in a local context. The supertags are designed such that only those elements on which the lexical item imposes constraints appear within a given supertag. Further, each lexical item is associated with as many supertags as the number of different syntactic contexts in which the lexical item can appear. This makes the number of different descriptions for each lexical item much larger than when the descriptions are less complex, thus increasing the local ambiguity for a parser. But this local ambiguity can be resolved by using statistical distributions of supertag co-occurrences collected from a corpus of parses. We have explored these ideas in the context of the Lexicalized Tree-Adjoining Grammar (LTAG) framework. The supertags in LTAG combine both phrase structure information and dependency information in a single representation. Supertag disambiguation results in a representation that is effectively a parse (an almost parse), and the parser need "only" combine the individual supertags. This method of parsing can also be used to parse sentence fragments such as in spoken utterances where the disambiguated supertag sequence may not combine into a single structure.