A statistical tagger for morphological tagging of Russian language texts

  • Authors:
  • V. V. Petrochenkov;A. O. Kazennikov

  • Affiliations:
  • Institute for Information Transmission Problems (Kharkevich Institute), Russian Academy of Sciences, Moscow, Russia;Moscow State Institute of Radiotechnics, Electronics, and Automation, Moscow, Russia

  • Venue:
  • Automation and Remote Control
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We consider a method of constructing a statistical tagger for automated morphological tagging for Russian language texts. In this method, each word is assigned with a tag that contains information about the part of speech and a full set of the word's morphological characteristics. We employ the set of morphological characteristics used in the SynTagRus corpus whose material has been used to train the tagger. The tagger is based on the SVM (Support Vector Machine) approach. The developed tagger has proven to be efficient and has shown high tagging quality.