Multilevel coarse-to-fine PCFG parsing

  • Authors:
  • Eugene Charniak;Mark Johnson;Micha Elsner;Joseph Austerweil;David Ellis;Isaac Haxton;Catherine Hill;R. Shrivaths;Jeremy Moore;Michael Pozar;Theresa Vu

  • Affiliations:
  • Brown University Providence, RI;Brown University Providence, RI;Brown University Providence, RI;Brown University Providence, RI;Brown University Providence, RI;Brown University Providence, RI;Brown University Providence, RI;Brown University Providence, RI;Brown University Providence, RI;Brown University Providence, RI;Brown University Providence, RI

  • Venue:
  • HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a PCFG parsing algorithm that uses a multilevel coarse-to-fine (mlctf) scheme to improve the efficiency of search for the best parse. Our approach requires the user to specify a sequence of nested partitions or equivalence classes of the PCFG nonterminals. We define a sequence of PCFGs corresponding to each partition, where the nonterminals of each PCFG are clusters of nonterminals of the original source PCFG. We use the results of parsing at a coarser level (i.e., grammar defined in terms of a coarser partition) to prune the next finer level. We present experiments showing that with our algorithm the work load (as measured by the total number of constituents processed) is decreased by a factor of ten with no decrease in parsing accuracy compared to standard CKY parsing with the original PCFG. We suggest that the search space over mlctf algorithms is almost totally unexplored so that future work should be able to improve significantly on these results.