Using a hybrid system of corpus and knowledge-based techniques to automate the induction of a lexical sublanguage grammar

  • Authors:
  • Geert Jan Wilms

  • Affiliations:
  • Union University, Jackson, TN

  • Venue:
  • COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

Porting a Natural Language Processing (NLP) system to a new domain remains one of the bottleneeks in syntactic parsing, because of the amount of effort required to fix gaps in the lexicon, and to attune the existing grammar to the idiosyncracies of the new sublanguage. This paper shows how the process of fitting a lexicalized grammar to a domain can be automated to a great extent by using a hybrid system that combines traditional knowledge-based techniques with a corpus-based approach.