A robust and hybrid deep-linguistic theory applied to large-scale parsing

  • Authors:
  • Gerold Schneider;James Dowdall;Fabio Rinaldi

  • Affiliations:
  • University of Zurich;University of Zurich;University of Zurich

  • Venue:
  • ROMAND '04 Proceedings of the 3rd Workshop on RObust Methods in Analysis of Natural Language Data
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Modern statistical parsers are robust and quite fast, but their output is relatively shallow when compared to formal grammar parsers. We suggest to extend statistical approaches to a more deep-linguistic analysis while at the same time keeping the speed and low complexity of a statistical parser. The resulting parsing architecture suggested, implemented and evaluated here is highly robust and hybrid on a number of levels, combining statistical and rule-based approaches, constituency and dependency grammar, shallow and deep processing, full and near-full parsing. With its parsing speed of about 300,000 words per hour and state-of-the-art performance the parser is reliable for a number of large-scale applications discussed in the article.