An efficient augmented-context-free parsing algorithm

Authors:
Masaru Tomita
Affiliations:
Carnegie-Mellon University, Pittsburgh, PA
Venue:
Computational Linguistics
Year:
1987

Citing 17
Cited 59

Sentence disambiguation by asking

Computers and Translation
LR Parsing

ACM Computing Surveys (CSUR)
An efficient context-free parsing algorithm

Communications of the ACM
Simple LR(k) grammars

Communications of the ACM
Transition network grammars for natural language analysis

Communications of the ACM
Efficient Parsing for Natural Language: A Fast Algorithm for Practical Systems

Efficient Parsing for Natural Language: A Fast Algorithm for Practical Systems
The Theory of Parsing, Translation, and Compiling

The Theory of Parsing, Translation, and Compiling
Functional Unification Grammar: a formalism for machine translation

ACL '84 Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics
LR parsers for natural languages

ACL '84 Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics
The design of a computer language for linguistic information

ACL '84 Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics
A structure-sharing representation for unification-based grammar formalisms

ACL '85 Proceedings of the 23rd annual meeting on Association for Computational Linguistics
Using restriction to extend parsing algorithms for complex-feature-based formalisms

ACL '85 Proceedings of the 23rd annual meeting on Association for Computational Linguistics
Sentence disambiguation by a shift-reduce parsing technique

ACL '83 Proceedings of the 21st annual meeting on Association for Computational Linguistics
Menu-based natural language understanding

ACL '83 Proceedings of the 21st annual meeting on Association for Computational Linguistics
Parsing spoken language: a semantic caseframe approach

COLING '86 Proceedings of the 11th coference on Computational linguistics
Principles of Compiler Design (Addison-Wesley series in computer science and information processing)

Principles of Compiler Design (Addison-Wesley series in computer science and information processing)
Language As a Cognitive Process: Syntax

Language As a Cognitive Process: Syntax

Unification: a multidisciplinary survey

ACM Computing Surveys (CSUR)
Parsers and printers as stream destructors and constructors embedded in functional languages

FPCA '89 Proceedings of the fourth international conference on Functional programming languages and computer architecture
Two recent developments in tree adjoining grammars: semantics and efficient processing

HLT '90 Proceedings of the workshop on Speech and Natural Language
Communicative facial displays as a new conversational modality

CHI '93 Proceedings of the INTERACT '93 and CHI '93 Conference on Human Factors in Computing Systems
Stroing logical form in a shared-packed forest

Computational Linguistics
Robustness and Portability Issues in Multilingual Speech Processing

Machine Translation
Conflicts

ACM SIGPLAN Notices
Automatic Indexing and Content-Based Retrieval of Captioned Images

Computer
Generalized probabilistic LR parsing of natural language (Corpora) with unification-based grammars

Computational Linguistics - Special issue on using large corpora: I
An efficient implementation of the head-corner parser

Computational Linguistics
Surface-marker-based dialog modelling: A progress report on the MAREDI project

Natural Language Engineering
Generalized left-corner parsing

EACL '93 Proceedings of the sixth conference on European chapter of the Association for Computational Linguistics
Polynomial time and space shift-reduce parsing of arbitrary context-free grammars

ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
The structure of shared forests in ambiguous parsing

ACL '89 Proceedings of the 27th annual meeting on Association for Computational Linguistics
Relating complexity to practical performance in parsing with wide-coverage unification grammars

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Structural disambiguation with constraint propagation

ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Deterministic left to right parsing of Tree Adjoining Languages

ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Graph-structured Stack and natural language parsing

ACL '88 Proceedings of the 26th annual meeting on Association for Computational Linguistics
Linear encodings of linguistic analyses

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
Efficiency considerations for LFG-parsers: incremental and table-lookup techniques

COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 1
Knowledge integration in a robust and efficient morpho-syntactic analyzer for French

COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 1
Robust parsing of severely corrupted spoken utterances

COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 1
Parsing incomplete sentences

COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 1
Parsing noisy sentences

COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 2
Combining lexicon-driven parsing and phrase-structure-based parsing

COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 2
An English-to-Korean machine translator: MATES/EK

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
Parsing Turkish using the lexical functional grammar formalism

COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
High-probability syntactic links

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
The generalized LR parser/compiler V8-4: a software package for practical NL projects

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 1
Multi-lingual translation of spontaneously spoken language in a limited domain

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
High-probability syntactic links

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
A new parallel algorithm for generalized LR parsing

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 2
Chinese syntactic parsing based on extended GLR parsing algorithm with PCFG

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 2
On the applicability of Global Index Grammars

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 2
Source transformation, analysis and generation in TXL

Proceedings of the 2006 ACM SIGPLAN symposium on Partial evaluation and semantics-based program manipulation
Optimal ambiguity packing in context-free parsers with interleaved unification

New developments in parsing technology
Soup: a parser for real-world spontaneous speech

New developments in parsing technology
The TXL source transformation language

Science of Computer Programming - The fourth workshop on language descriptions, tools, and applications (LDTA'04)
Book review:

Computational Linguistics
A best-first probabilistic shift-reduce parser

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Xoc, an extension-oriented compiler for systems programming

Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
SASyLF: an educational proof assistant for language theory

Proceedings of the 2008 international workshop on Functional and declarative programming in education
Fast, Accurate Creation of Data Validation Formats by End-User Developers

IS-EUD '09 Proceedings of the 2nd International Symposium on End-User Development
A higher-order strategy for eliminating common subexpressions

Computer Languages, Systems and Structures
Semi-supervised training of a statistical parser from unlabeled partially-bracketed data

IWPT '07 Proceedings of the 10th International Conference on Parsing Technologies
Coordinated morphological and syntactic analysis of Japanese language

IJCAI'91 Proceedings of the 12th international joint conference on Artificial intelligence - Volume 2
Fast translation rule matching for syntax-based statistical machine translation

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
LR parsing for global index languages (GILs)

CIAA'03 Proceedings of the 8th international conference on Implementation and application of automata
GIGs: restricted context-sensitive descriptive power in bounded polynomial-time

CICLing'03 Proceedings of the 4th international conference on Computational linguistics and intelligent text processing
Convolution kernel over packed parse forest

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Non-isomorphic forest pair translation

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
ATREUS: a comparative study of continuous speech recognition systems at ATR

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
A neural network controlled adaptive search strategy for HMM-based speech recognition

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
LR parsing for boolean grammars

DLT'05 Proceedings of the 9th international conference on Developments in Language Theory
Improved GLR parsing algorithm

ICIC'05 Proceedings of the 2005 international conference on Advances in Intelligent Computing - Volume Part II
Machine translation based on constraint-based synchronous grammar

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Systematic processing of long sentences in rule based portuguese-chinese machine translation

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
DynGenPar: a dynamic generalized parser for common mathematical language

CICM'12 Proceedings of the 11th international conference on Intelligent Computer Mathematics
Efficient large-scale parsing: a survey

Proceedings of the COLING-2000 Workshop on Efficiency In Large-Scale Parsing Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

An efficient parsing algorithm for augmented context-free grammars is introduced, and its application to on-line natural language interfaces discussed. The algorithm is a generalized LR parsing algorithm, which precomputes an LR shift-reduce parsing table (possibly with multiple entries) from a given augmented context-free grammar. Unlike the standard LR parsing algorithm, it can handle arbitrary context-free grammars, including ambiguous grammars, while most of the LR efficiency is preserved by introducing the concept of a "graph-structured stack". The graph-structured stack allows an LR shift-reduce parser to maintain multiple parses without parsing any part of the input twice in the same way. We can also view our parsing algorithm as an extended chart parsing algorithm efficiently guided by LR parsing tables. The algorithm is fast, due to the LR table precomputation. In several experiments with different English grammars and sentences, timings indicate a five- to tenfold speed advantage over Earley's context-free parsing algorithm.The algorithm parses a sentence strictly from left to right on-line, that is, it starts parsing as soon as the user types in the first word of a sentence, without waiting for completion of the sentence. A practical on-line parser based on the algorithm has been implemented in Common Lisp, and running on Symbolics and HP AI workstations. The parser is used in the multi-lingual machine translation project at CMU. Also, a commercial on-line parser for Japanese language is being built by Intelligent Technology Incorporation, based on the technique developed at CMU.