Short Communication: A multimedia retrieval framework highlighting agents and coordinating their interactions to address the semantic gap

Authors:
M. Belkhatir;H. C. Thiem
Affiliations:
Lyon Institute of Technology, Université de Lyon I, Campus de la Doua, 69100, France;Lyon Institute of Technology, Université de Lyon I, Campus de la Doua, 69100, France
Venue:
Expert Systems with Applications: An International Journal
Year:
2010

Citing 10
Cited 0

Unifying Keywords and Visual Contents in Image Retrieval

IEEE MultiMedia
Automatic Annotation and Retrieval of Images

World Wide Web
Modeling annotated data

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Evaluating the impact of selection noise in community-based web search

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Image annotations by combining multiple evidence & wordNet

Proceedings of the 13th annual ACM international conference on Multimedia
A survey of content-based image retrieval with high-level semantics

Pattern Recognition
Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories

Computer Vision and Image Understanding
The Google Similarity Distance

IEEE Transactions on Knowledge and Data Engineering
Supervised Learning of Semantic Classes for Image Annotation and Retrieval

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multiple Bernoulli relevance models for image and video annotation

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Quantified Score

Hi-index	12.05

Visualization

Abstract

State-of-the-art approaches for the semantic characterization of the visual content rely on segmentation techniques outputting static entities such as rectangular regions, blobs or multimedia objects. Once these entities are highlighted, additional external information allowing to refine this characterization within knowledge bases (e.g. the fact that semantic concepts sky and sun corresponding to these entities often co-occur) is ignored due to the difficulty of integrating it a priori within these static approaches. We formulate the hypothesis that this information is important in the process of highlighting the semantic visual content and propose an architecture based on image agents, abstract structures representing the visual entities of the image content, which integrate it dynamically. For this, we investigate the formation of semantic concepts within a population of multimedia agents. We first propose a learning framework linking the automatically-extracted visual content to these agents. We then develop an architecture allowing the communication of image agents about the perceived semantic concepts by taking into account the external information incrementally. We validate our proposition in a system-based evaluation framework on a corpus of real world color photographs.