Efficient matrix-encoded grammars and low latency parallelization strategies for CYK

  • Authors:
  • Aaron Dunlop;Nathan Bodenstab;Brian Roark

  • Affiliations:
  • Oregon Health & Science University, Portland, OR;Oregon Health & Science University, Portland, OR;Oregon Health & Science University, Portland, OR

  • Venue:
  • IWPT '11 Proceedings of the 12th International Conference on Parsing Technologies
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a matrix encoding of context-free grammars, motivated by hardware-level efficiency considerations. We find efficiency gains of 2.5--9x for exhaustive inference and approximately 2x for pruned inference, resulting in high-accuracy parsing at over 20 sentences per second. Our grammar encoding allows fine-grained parallelism during chart cell population; we present a controlled study of several methods of parallel parsing, and find near-optimal latency reductions as core-count increases.