Recognizing substrings of LR(k) languages in linear time

Authors:
Joseph Bates;Alon Lavie
Affiliations:
-;-
Venue:
POPL '92 Proceedings of the 19th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Year:
1992

Citing 6
Cited 0

An efficient incremental LR parser for grammars with epislon productions

Acta Informatica
Noncorrecting syntax error recovery

ACM Transactions on Programming Languages and Systems (TOPLAS)
Compilers: principles, techniques, and tools

Compilers: principles, techniques, and tools
An LR substring parser for noncorrecting syntax error recovery

PLDI '89 Proceedings of the ACM SIGPLAN 1989 Conference on Programming language design and implementation
Efficient Parsing for Natural Language: A Fast Algorithm for Practical Systems

Efficient Parsing for Natural Language: A Fast Algorithm for Practical Systems
The Theory of Parsing, Translation, and Compiling

The Theory of Parsing, Translation, and Compiling

Quantified Score

Hi-index	0.00

Visualization

Abstract

LR parsing techniques have long been studied as efficient and powerful methods for processing context free languages. A linear time algorithm for recognizing languages representable by LR(k) grammars has long been known. Recognizing substrings of a context-free language is at least as hard as recognizing full strings of the language, as the latter problem easily reduces to the former. In this paper we present a linear time algorithm for recognizing substrings of LR(k) languages, thus showing that the substring recognition problem for these languages is no harder than the full string recognition problem. An interesting data structure, the Forest Structured Stack, allows the algorithm to track all possible parses of a substring without loosing the efficiency of the original LR parser. We present the algorithm, prove its correctness, analyze its complexity, and mention several applications that have been constructed.