Tree k-Grammar Models for Natural Language Modelling and Parsing

  • Authors:
  • Jose L. Verdú-Mas;Mikel L. Forcada;Rafael C. Carrasco;Jorge Calera-Rubio

  • Affiliations:
  • -;-;-;-

  • Venue:
  • Proceedings of the Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we compare three different approaches to build a probabilistic context-free grammar for natural language parsing from a tree bank corpus: (1) a model that simply extracts the rules contained in the corpus and counts the number of occurrences of each rule; (2) a model that also stores information about the parent node's category, and (3) a model that estimates the probabilities according to a generalized k-gram scheme for trees with k = 3. The last model allows for faster parsing and decreases considerably the perplexity of test samples.