SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Searching distributed collections with inference networks
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Database merging strategy based on logistic regression
Information Processing and Management: an International Journal
Cross-Language Information Retrieval
Cross-Language Information Retrieval
A study of learning a merge model for multilingual information retrieval
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
On the Selection of the Best Retrieval Result Per Query ---An Alternative Approach to Data Fusion---
FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
Ranking multilingual documents using minimal language dependent resources
CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Learning a merge model for multilingual information retrieval
Information Processing and Management: an International Journal
Bilingual and multilingual experiments with the IR-n system
CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
SINAI at CLEF 2005: multi-8 two-years-on and multi-8 merging-only tasks
CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Architecture and evaluation of BRUJA, a multilingual question answering system
Information Retrieval
SINAI at CLEF 2006 ad hoc robust multilingual track: query expansion using the Google search engine
CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
Hi-index | 0.00 |
A usual strategy to implement CLIR (Cross-Language Information Retrieval) systems is the so-called query translation approach. The user query is translated for each language present in the multilingual collection in order to compute an independent monolingual information retrieval process per language. Thus, this approach divides documents according to language. In this way, we obtain as many different collections as languages. After searching in these corpora and obtaining a result list per language, we must merge them in order to provide a single list of retrieved articles.In this paper, we propose an approach to obtain a single list of relevant documents for CLIR systems driven by query translation. This approach, which we call 2-step RSV (RSV: Retrieval Status Value), is based on the re-indexing of the retrieval documents according to the query vocabulary, and it performs noticeably better than traditional methods.The proposed method requires query vocabulary alignment: given a word for a given query, we must know the translation or translations to the other languages. Because this is not always possible, we have researched on a mixed model. This mixed model is applied in order to deal with queries with partial word-level alignment. The results prove that even in this scenario, 2-step RSV performs better than traditional merging methods.