Monolingual, bilingual, and GIRT information retrieval at CLEF-2005

Authors:
Jacques Savoy;Pierre-Yves Berger
Affiliations:
Institut interfacultaire d’informatique, Université de Neuchâtel, Neuchâtel, Switzerland;Institut interfacultaire d’informatique, Université de Neuchâtel, Neuchâtel, Switzerland
Venue:
CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Year:
2005

Citing 6
Cited 2

Statistical inference in retrieval effectiveness evaluation

Information Processing and Management: an International Journal
Experimentation as a way of life: Okapi at TREC

Information Processing and Management: an International Journal - The sixth text REtrieval conference (TREC-6)
Probabilistic models of information retrieval based on measuring the divergence from randomness

ACM Transactions on Information Systems (TOIS)
Fusion Via a Linear Combination of Scores

Information Retrieval
Combining Multiple Strategies for Effective Monolingual and Cross-Language Retrieval

Information Retrieval
Data fusion for effective european monolingual information retrieval

CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images

Benefits of resource-based stemming in hungarian information retrieval

CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
Experiments with monolingual, bilingual, and robust retrieval

CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

For our fifth participation in the CLEF evaluation campaigns, our first objective was to propose an effective and general stopword list as well as a light stemming procedure for the Hungarian, Bulgarian and Portuguese (Brazilian) languages. Our second objective was to obtain a better picture of the relative merit of various search engines when processing documents in those languages. To do so we evaluated our scheme using two probabilistic models and five vector-processing approaches. In the bilingual track, we evaluated both the machine translation and bilingual dictionary approaches applied to automatically translate a query submitted in English into various target languages. Finally, using the GIRT corpora (available in English, German and Russian), we investigated the variations in retrieval effectiveness that resulted when we included or excluded manually assigned keywords attached to the bibliographic records (mainly comprising a title and an abstract).