Building an inflectional stemmer for Bulgarian

  • Authors:
  • Preslav Nakov

  • Affiliations:
  • Department of Electrical Engineering and Computer Science, University of California at Berkeley

  • Venue:
  • CompSysTech '03 Proceedings of the 4th international conference conference on Computer systems and technologies: e-Learning
  • Year:
  • 2003

Quantified Score

Hi-index 0.00

Visualization

Abstract

The paper starts with an overview of the most important approaches to stemming for English as well as for some Slavic languages. Then, the design, implementation and evaluation of an inflectional stemmer for Bulgarian are described. The problem is addressed as a machine-learning task from a large morphological dictionary. A detailed automatic evaluation for different parameter values in terms of under-stemming, over-stemming and coverage is provided.