Using web-scale N-grams to improve base NP parsing performance

  • Authors:
  • Emily Pitler;Shane Bergsma;Dekang Lin;Kenneth Church

  • Affiliations:
  • University of Pennsylvania;University of Alberta;Google, Inc.;Johns Hopkins University

  • Venue:
  • COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We use web-scale N-grams in a base NP parser that correctly analyzes 95.4% of the base NPs in natural text. Web-scale data improves performance. That is, there is no data like more data. Performance scales log-linearly with the number of parameters in the model (the number of unique N-grams). The web-scale N-grams are particularly helpful in harder cases, such as NPs that contain conjunctions.