A feasibility study on low level techniques for improving parsing accuracy for spanish using maltparser

  • Authors:
  • Miguel Ballesteros;Jesús Herrera;Virginia Francisco;Pablo Gervás

  • Affiliations:
  • Departamento de Ingeniería del Software e Inteligencia Artificial;Departamento de Ingeniería del Software e Inteligencia Artificial;Instituto de Tecnología del Conocimiento, Universidad Complutense de Madrid, Madrid, Spain;Instituto de Tecnología del Conocimiento, Universidad Complutense de Madrid, Madrid, Spain

  • Venue:
  • SETN'10 Proceedings of the 6th Hellenic conference on Artificial Intelligence: theories, models and applications
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

In the last years dependency parsing has been accomplished by machine learning–based systems showing great accuracy but usually under 90% for Labelled Attachment Score (LAS) Maltparser is one of such systems Machine learning allows to obtain parsers for every language having an adequate training corpus Since generally such systems can not be modified the following question arises: Can we beat this 90% LAS by using better training corpora? Some previous work points that high level techniques are not sufficient for building more accurate training corpora Thus, by analyzing the words that are more frequently incorrectly attached or labelled, we study the feasibility of some low level techniques, based on n–version parsing models, in order to obtain better parsing accuracy.