Shallow language processing architecture for Bulgarian

  • Authors:
  • Hristo Tanev;Ruslan Mitkov

  • Affiliations:
  • Centro per la Ricerca Scientifica e Tecnologica, Trento, Italy;Languages and Social Studies, Wolverhampton, UK

  • Venue:
  • COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes LINGUA - an architecture for text processing in Bulgarian. First, the pre-processing modules for tokenisation, sentence splitting, paragraph segmentation, part-of-speech tagging, clause chunking and noun phrase extraction are outlined. Next, the paper proceeds to describe in more detail the anaphora resolution module. Evaluation results are reported for each processing task.