Searching for Part of Speech Tags That Improve Parsing Models

  • Authors:
  • Martín Ariel Domínguez;Gabriel Infante-Lopez

  • Affiliations:
  • Grupo de Procesamiento de Lenguaje Natural, Universidad Nacional de Córdoba, Argentina;Grupo de Procesamiento de Lenguaje Natural, Universidad Nacional de Córdoba, Argentina and Consejo Nacional de Investigaciones Científicas y Técnicas,

  • Venue:
  • GoTAL '08 Proceedings of the 6th international conference on Advances in Natural Language Processing
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

We introduce a technique for inducing a refinement of the set of part of speech tags related to verbs. We cluster verbs according to their syntactic behavior in a dependency structure setting. The set of clusters is automatically determined by means of a quality measure over the probabilistic automata that describe words in a bilexical grammar. Each of the resulting clusters defines a new part of speech tag. We try out the resulting tag set in a state-of-the art phrase structure parser and we show that the induced part of speech tags significantly improve the accuracy of the parser.