Late fusion of heterogeneous methods for multimedia image retrieval

Authors:
Hugo Jair Escalante;Carlos A. Hérnadez;Luis Enrique Sucar;Manuel Montes
Affiliations:
National Institute of Astrophysics, Optics and Electronics, Puebla, Mexico;National Institute of Astrophysics, Optics and Electronics, Puebla, Mexico;National Institute of Astrophysics, Optics and Electronics, Puebla, Mexico;National Institute of Astrophysics, Optics and Electronics, Puebla, Mexico
Venue:
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Year:
2008

Citing 13
Cited 11

Unifying textual and visual cues for content-based image retrieval on the World Wide Web

Computer Vision and Image Understanding - Special issue on content-based access for image and video libraries
Modern Information Retrieval

Modern Information Retrieval
Analysing the performance of visual, concept and text features in content-based video retrieval

Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis

Proceedings of the 13th annual ACM international conference on Multimedia
Proceedings of the international workshop on TRECVID video summarization

The 15th ACM International Conference on Multimedia 2007
Image retrieval: Ideas, influences, and trends of the new age

ACM Computing Surveys (CSUR)
Overview of the ImageCLEFphoto 2007 Photographic Retrieval Task

Advances in Multilingual and Multimodal Information Retrieval
FIRE in ImageCLEF 2007: Support Vector Machines and Logistic Models to Fuse Image Descriptors for Photo Retrieval

Advances in Multilingual and Multimodal Information Retrieval
Towards Annotation-Based Query and Document Expansion for Image Retrieval

Advances in Multilingual and Multimodal Information Retrieval
Markov random fields and spatial information to improve automatic image annotation

PSIVT'07 Proceedings of the 2nd Pacific Rim conference on Advances in image and video technology
UNED at ImageCLEF 2005: automatically structured queries with named entities over metadata

CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Approaches of using a word-image ontology and an annotated image corpus as intermedia for cross-language image retrieval

CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval

SNDocRank: a social network-based video search ranking framework

Proceedings of the international conference on Multimedia information retrieval
Annotation-based expansion and late fusion of mixed methods for multimedia image retrieval

CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
A new approach to cross-modal multimedia retrieval

Proceedings of the international conference on Multimedia
A relevant image search engine with late fusion: mixing the roles of textual and visual descriptors

Proceedings of the 16th international conference on Intelligent user interfaces
Semantic combination of textual and visual information in multimedia retrieval

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
The effects of heterogeneous information combination on large scale social image search

Proceedings of the Third International Conference on Internet Multimedia Computing and Service
Multimodal indexing based on semantic cohesion for image retrieval

Information Retrieval
Effective heterogeneous similarity measure with nearest neighbors for cross-media retrieval

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Leveraging high-level and low-level features for multimedia event detection

Proceedings of the 20th ACM international conference on Multimedia
Distributional semantics with eyes: using image analysis to improve computational representations of word meaning

Proceedings of the 20th ACM international conference on Multimedia
Parallel field alignment for cross media retrieval

Proceedings of the 21st ACM international conference on Multimedia

Quantified Score

Hi-index	0.00

Visualization

Abstract

Late fusion of independent retrieval methods is the simpler approach and a widely used one for combining visual and textual information for the search process. Usually each retrieval method is based on a single modality, or even, when several methods are considered per modality, all of them use the same information for indexing/querying. The latter reduces the diversity and complementariness of documents considered for the fusion, as a consequence the performance of the fusion approach is poor. In this paper we study the combination of multiple heterogeneous methods for image retrieval in annotated collections. Heterogeneousness is considered in terms of i) the modality in which the methods are based on, ii) in the information they use for indexing/querying and iii) in the individual performance of the methods. Different settings for the fusion are considered including weighted, global, per-modality and hierarchical. We report experimental results, in an image retrieval benchmark, that show that the proposed combination outperforms significantly any of the individual methods we consider. Retrieval performance is comparable to the best performance obtained in the context of ImageCLEF2007. An interesting result is that even methods that perform poor (individually) resulted very useful to the fusion strategy. Furthermore, opposed to work reported in the literature, better results were obtained by assigning a low weight to text-based methods. The main contribution of this paper is experimental, several interesting findings are reported that motivate further research on diverse subjects.