Unifying textual and visual cues for content-based image retrieval on the World Wide Web
Computer Vision and Image Understanding - Special issue on content-based access for image and video libraries
Modern Information Retrieval
Analysing the performance of visual, concept and text features in content-based video retrieval
Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval
Early versus late fusion in semantic video analysis
Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis
Proceedings of the 13th annual ACM international conference on Multimedia
Proceedings of the international workshop on TRECVID video summarization
The 15th ACM International Conference on Multimedia 2007
Image retrieval: Ideas, influences, and trends of the new age
ACM Computing Surveys (CSUR)
Overview of the ImageCLEFphoto 2007 Photographic Retrieval Task
Advances in Multilingual and Multimodal Information Retrieval
Advances in Multilingual and Multimodal Information Retrieval
Towards Annotation-Based Query and Document Expansion for Image Retrieval
Advances in Multilingual and Multimodal Information Retrieval
Markov random fields and spatial information to improve automatic image annotation
PSIVT'07 Proceedings of the 2nd Pacific Rim conference on Advances in image and video technology
UNED at ImageCLEF 2005: automatically structured queries with named entities over metadata
CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
SNDocRank: a social network-based video search ranking framework
Proceedings of the international conference on Multimedia information retrieval
Annotation-based expansion and late fusion of mixed methods for multimedia image retrieval
CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
A new approach to cross-modal multimedia retrieval
Proceedings of the international conference on Multimedia
A relevant image search engine with late fusion: mixing the roles of textual and visual descriptors
Proceedings of the 16th international conference on Intelligent user interfaces
Semantic combination of textual and visual information in multimedia retrieval
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
The effects of heterogeneous information combination on large scale social image search
Proceedings of the Third International Conference on Internet Multimedia Computing and Service
Multimodal indexing based on semantic cohesion for image retrieval
Information Retrieval
Effective heterogeneous similarity measure with nearest neighbors for cross-media retrieval
MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Leveraging high-level and low-level features for multimedia event detection
Proceedings of the 20th ACM international conference on Multimedia
Proceedings of the 20th ACM international conference on Multimedia
Parallel field alignment for cross media retrieval
Proceedings of the 21st ACM international conference on Multimedia
Hi-index | 0.00 |
Late fusion of independent retrieval methods is the simpler approach and a widely used one for combining visual and textual information for the search process. Usually each retrieval method is based on a single modality, or even, when several methods are considered per modality, all of them use the same information for indexing/querying. The latter reduces the diversity and complementariness of documents considered for the fusion, as a consequence the performance of the fusion approach is poor. In this paper we study the combination of multiple heterogeneous methods for image retrieval in annotated collections. Heterogeneousness is considered in terms of i) the modality in which the methods are based on, ii) in the information they use for indexing/querying and iii) in the individual performance of the methods. Different settings for the fusion are considered including weighted, global, per-modality and hierarchical. We report experimental results, in an image retrieval benchmark, that show that the proposed combination outperforms significantly any of the individual methods we consider. Retrieval performance is comparable to the best performance obtained in the context of ImageCLEF2007. An interesting result is that even methods that perform poor (individually) resulted very useful to the fusion strategy. Furthermore, opposed to work reported in the literature, better results were obtained by assigning a low weight to text-based methods. The main contribution of this paper is experimental, several interesting findings are reported that motivate further research on diverse subjects.