ACM SIGIR Forum
Information retrieval: data structures and algorithms
Information retrieval: data structures and algorithms
Viewing morphology as an inference process
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Stemming algorithms: a case study for detailed evaluation
Journal of the American Society for Information Science - Special issue: evaluation of information retrieval systems
Viewing stemming as recall enhancement
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Corpus-based stemming using cooccurrence of word variants
ACM Transactions on Information Systems (TOIS)
Hi-index | 0.00 |
The paper starts with an overview of the most important approaches to stemming for English as well as for some Slavic languages. Then, the design, implementation and evaluation of an inflectional stemmer for Bulgarian are described. The problem is addressed as a machine-learning task from a large morphological dictionary. A detailed automatic evaluation for different parameter values in terms of under-stemming, over-stemming and coverage is provided.