Recognition is not parsing - SPPF-style parsing from cubic recognisers

Authors:
Elizabeth Scott;Adrian Johnstone
Affiliations:
Department of Computer Science, Royal Holloway, University of London, Egham, Surrey, United Kingdom;Department of Computer Science, Royal Holloway, University of London, Egham, Surrey, United Kingdom
Venue:
Science of Computer Programming
Year:
2010

Citing 13
Cited 2

Compilers: principles, techniques, and tools

Compilers: principles, techniques, and tools
Efficient Computation of LALR(1) Look-Ahead Sets

ACM Transactions on Programming Languages and Systems (TOPLAS)
An efficient context-free parsing algorithm

Communications of the ACM
Efficient Parsing for Natural Language: A Fast Algorithm for Practical Systems

Efficient Parsing for Natural Language: A Fast Algorithm for Practical Systems
The Java Language Specification

The Java Language Specification
Even faster generalized LR parsing

Acta Informatica
Faster Generalized LR Parsing

CC '99 Proceedings of the 8th International Conference on Compiler Construction, Held as Part of the European Joint Conferences on the Theory and Practice of Software, ETAPS'99
Java(TM) Language Specification, The (3rd Edition) (Java (Addison-Wesley))

Java(TM) Language Specification, The (3rd Edition) (Java (Addison-Wesley))
Generalized Bottom Up Parsers With Reduced Stack Activity

The Computer Journal
Right nulled GLR parsers

ACM Transactions on Programming Languages and Systems (TOPLAS)
BRNGLR: a cubic Tomita-style GLR parsing algorithm

Acta Informatica
Proofs and pedagogy; science and systems: The grammar tool box

Science of Computer Programming
SPPF-Style Parsing From Earley Recognisers

Electronic Notes in Theoretical Computer Science (ENTCS)

The Semantics of Parsing with Semantic Actions

LICS '12 Proceedings of the 2012 27th Annual IEEE/ACM Symposium on Logic in Computer Science
GLL parse-tree generation

Science of Computer Programming

Quantified Score

Hi-index	0.02

Visualization

Abstract

In their recogniser forms, the Earley and RIGLR algorithms for testing whether a string can be derived from a grammar are worst-case cubic on general context free grammars (CFG). Earley gave an outline of a method for turning his recognisers into parsers, but it turns out that this method is incorrect. Tomita's GLR parser returns a shared packed parse forest (SPPF) representation of all derivations of a given string from a given CFG but is worst-case unbounded polynomial order. The parser version of the RIGLR algorithm constructs Tomita-style SPPFs and thus is also worst-case unbounded polynomial order. We have given a modified worst-case cubic GLR algorithm, that, for any string and any CFG, returns a binarised SPPF representation of all possible derivations of a given string. In this paper we apply similar techniques to develop worst-case cubic Earley and RIGLR parsing algorithms.