Efficient divide-and-conquer parsing of practical context-free languages

  • Authors:
  • Jean-Philippe Bernardy;Koen Claessen

  • Affiliations:
  • Chalmers University of Technology and University of Gothenburg, Gothenburg, Sweden;Chalmers University of Technology and University of Gothenburg, Gothenburg, Sweden

  • Venue:
  • Proceedings of the 18th ACM SIGPLAN international conference on Functional programming
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a divide-and-conquer algorithm for parsing context-free languages efficiently. Our algorithm is an instance of Valiant's (1975), who reduced the problem of parsing to matrix multiplications. We show that, while the conquer step of Valiant's is O(n3) in the worst case, it improves to O(logn3), under certain conditions satisfied by many useful inputs. These conditions occur for example in program texts written by humans. The improvement happens because the multiplications involve an overwhelming majority of empty matrices. This result is relevant to modern computing: divide-and-conquer algorithms can be parallelized relatively easily.