Part-of-speech tagging using parallel weighted finite-state transducers

  • Authors:
  • Miikka Silfverberg;Krister Lindén

  • Affiliations:
  • Department of Modern Languages, University of Helsinki, Helsinki, Finland;Department of Modern Languages, University of Helsinki, Helsinki, Finland

  • Venue:
  • IceTAL'10 Proceedings of the 7th international conference on Advances in natural language processing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We use parallel weighted finite-state transducers to implement a part-of-speech tagger, which obtains state-of-the-art accuracy when used to tag the Europarl corpora for Finnish, Swedish and English. Our system consists of a weighted lexicon and a guesser combined with a bigram model factored into two weighted transducers. We use both lemmas and tag sequences in the bigram model, which guarantees reliable bigram estimates.