Multimodal fusion for image retrieval using matrix factorization

Authors:
Juan C. Caicedo;Fabio A. González
Affiliations:
Universidad Nacional de Colombia;Universidad Nacional de Colombia
Venue:
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Year:
2012

Citing 5
Cited 0

Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Image retrieval: Ideas, influences, and trends of the new age

ACM Computing Surveys (CSUR)
Semantic spaces revisited: investigating the performance of auto-annotation and semantic retrieval using semantic spaces

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
The MIR flickr retrieval evaluation

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Multimodal representation, indexing, automated annotation and retrieval of image collections via non-negative matrix factorization

Neurocomputing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Image collections on the internet and other sources of information can naturally include attached text descriptions. This work considers the problem of fusing two data modalities: visual content and text keywords, to allow a flexible image indexing scheme. The proposed strategy learns multimodal relationships using matrix reconstruction principles and factorization algorithms, allowing one data modality to be represented in another modality space. We further exploit this exchangeability property, to fuse the modalities in any of the representation spaces by backprojecting predicted data to the input space. An experimental evaluation was carried out on the Corel 5K and MIRFlickr data sets using example images without text as query paradigm. Experimental results demonstrate the ability of the proposed strategy to find multimodal links between data and make them useful to improve the image retrieval performance.