Bayesian induction of syntactic language models for brazilian portuguese

  • Authors:
  • Daniel Emilio Beck;Helena de Medeiros Caseli

  • Affiliations:
  • Department of Computer Science --- LaLiC/NILC, Federal University of São Carlos (UFSCar), São Carlos, SP, Brazil;Department of Computer Science --- LaLiC/NILC, Federal University of São Carlos (UFSCar), São Carlos, SP, Brazil

  • Venue:
  • PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Recent approaches for building syntactic language models include the combination of Probabilistic Tree Substitution Grammars (PTSGs) and Bayesian learning methods. While PTSGs have appealing features for syntax modeling, Bayesian methods provide a framework for inducing compact grammars that do not overfit the training corpus. In this paper, we apply these approaches to learn syntactic language models from a Brazilian Portuguese treebank.