Serial combination of rules and statistics: a case study in Czech tagging

  • Authors:
  • Jan Hajič;Pavel Krbec;Pavel Květoň;Karel Oliva;Vladimír Petkevič

  • Affiliations:
  • IFAL, MFF UK, Prague, Czechia;IFAL, MFF UK, Prague, Czechia;ICNC, FF UK, Prague, Czechia;Univ. of Saarland, Germany;ITCL, FF UK, Prague, Czechia

  • Venue:
  • ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
  • Year:
  • 2001

Quantified Score

Hi-index 0.00

Visualization

Abstract

A hybrid system is described which combines the strength of manual rule-writing and statistical learning, obtaining results superior to both methods if applied separately. The combination of a rule-based system and a statistical one is not parallel but serial: the rule-based system performing partial disambiguation with recall close to 100% is applied first, and a trigram HMM tagger runs on its results. An experiment in Czech tagging has been performed with encouraging results.