Using Translation Heuristics to Improve a Multimodal and Multilingual Information Retrieval System

Authors:
Miguel Ángel García-Cumbreras;Maria Teresa Martín-Valdivia;Luis Alfonso Ureña-López;Manuel Carlos Díaz-Galiano;Arturo Montejo-Ráez
Affiliations:
University of Jaén. Departamento de Informática, Grupo Sistemas Inteligentes de Acceso a la Información, Campus Las Lagunillas, Ed. A3, E-23071, Jaén, Spain;University of Jaén. Departamento de Informática, Grupo Sistemas Inteligentes de Acceso a la Información, Campus Las Lagunillas, Ed. A3, E-23071, Jaén, Spain;University of Jaén. Departamento de Informática, Grupo Sistemas Inteligentes de Acceso a la Información, Campus Las Lagunillas, Ed. A3, E-23071, Jaén, Spain;University of Jaén. Departamento de Informática, Grupo Sistemas Inteligentes de Acceso a la Información, Campus Las Lagunillas, Ed. A3, E-23071, Jaén, Spain;University of Jaén. Departamento de Informática, Grupo Sistemas Inteligentes de Acceso a la Información, Campus Las Lagunillas, Ed. A3, E-23071, Jaén, Spain
Venue:
WILF '07 Proceedings of the 7th international workshop on Fuzzy Logic and Applications: Applications of Fuzzy Sets Theory
Year:
2007

Citing 4
Cited 0

An algorithm for suffix stripping

Readings in information retrieval
Improving cross language retrieval with triangulated translation

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
The CLEF 2005 cross–language image retrieval track

CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
ImageCLEF 2004: combining image and multi-lingual search for medical image retrieval

CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images

Quantified Score

Hi-index	0.00

Visualization

Abstract

Nowadays, the multimodal nature of the World Wide Web is an evidence. Web sites which include video files, pictures, music and text have become widespread. Furthermore, multimodal collections in several languages demand to apply multilingual information retrieval strategies. This paper describes a new retrieval technique applied on a multimodal and multilingual system that have been tested on two different multilingual image collections. The system applies several machine translators and implements some novel heuristics. These heuristics explore a variety of ways to combine the translations obtained from the given set of translators, and the configuration of the retrieval model by using different weighting functions, and also studying the effect of pseudo-relevance feedback(PRF) on this domain. Our results show interesting effects by these variations, allowing the determination of the parameters for the best retrieval model on this data and reporting the loss in performance on each language.