Using information gain to improve multi-modal information retrieval systems

Authors:
M. T. Martín-Valdivia;M. C. Díaz-Galiano;A. Montejo-Raez;L. A. Ureña-López
Affiliations:
Departamento de Informática, Campus Las Lagunillas, s/n., University of Jaén, Jaén E-23071, Spain;Departamento de Informática, Campus Las Lagunillas, s/n., University of Jaén, Jaén E-23071, Spain;Departamento de Informática, Campus Las Lagunillas, s/n., University of Jaén, Jaén E-23071, Spain;Departamento de Informática, Campus Las Lagunillas, s/n., University of Jaén, Jaén E-23071, Spain
Venue:
Information Processing and Management: an International Journal
Year:
2008

Citing 8
Cited 3

Evaluating evaluation measure stability

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Content-based query of image databases: inspirations from text retrieval

Pattern Recognition Letters - Selected papers from the 11th scandinavian conference on image analysis
Induction of Decision Trees

Machine Learning
A Comparative Study on Feature Selection in Text Categorization

ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Information-Theoretic Measures for Anomaly Detection

SP '01 Proceedings of the 2001 IEEE Symposium on Security and Privacy
Content-based multimedia information retrieval: State of the art and challenges

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
The CLEF 2005 cross–language image retrieval track

CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Overview of the ImageCLEF 2006 photographic retrieval and object annotation tasks

CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval

Using web sources for improving video categorization

Journal of Intelligent Information Systems
Generating web-based corpora for video transcripts categorization

Expert Systems with Applications: An International Journal
Statistical cross-language Web content quality assessment

Knowledge-Based Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Nowadays, access to information requires managing multimedia databases effectively, and so, multi-modal retrieval techniques (particularly images retrieval) have become an active research direction. In the past few years, a lot of content-based image retrieval (CBIR) systems have been developed. However, despite the progress achieved in the CBIR, the retrieval accuracy of current systems is still limited and often worse than only textual information retrieval systems. In this paper, we propose to combine content-based and text-based approaches to multi-modal retrieval in order to achieve better results and overcome the lacks of these techniques when they are taken separately. For this purpose, we use a medical collection that includes both images and non-structured text. We retrieve images from a CBIR system and textual information through a traditional information retrieval system. Then, we combine the results obtained from both systems in order to improve the final performance. Furthermore, we use the information gain (IG) measure to reduce and improve the textual information included in multi-modal information retrieval systems. We have carried out several experiments that combine this reduction technique with a visual and textual information merger. The results obtained are highly promising and show the profit obtained when textual information is managed to improve conventional multi-modal systems.