Dictionary-based amharic: english information retrieval

  • Authors:
  • Atelach Alemu Argaw;Lars Asker;Rickard Cöster;Jussi Karlgren

  • Affiliations:
  • Department of Computer and Systems Sciences, Stockholm University/Royal Institute of Technology, Stockholm;Department of Computer and Systems Sciences, Stockholm University/Royal Institute of Technology, Stockholm;Swedish Institute of Computer Science, Stockholm;Swedish Institute of Computer Science, Stockholm

  • Venue:
  • CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present two approaches to the Amharic – English bilingual track in CLEF 2004. Both experiments use a dictionary based approach to translate the Amharic queries into English Bags-of-words, but while one approach removes non-content bearing words from the Amharic queries based on their IDF value, the other uses a list of English stop words to perform the same task. The resulting translated (English) terms are then submitted to a retrieval engine that supports the Boolean and vector-space models. In our experiments, the second approach (based on a list of English stop words) performs slightly better than the one based on IDF values for the Amharic terms.