Combining global, regional and contextual features for automatic image annotation

Authors:
Yong Wang;Tao Mei;Shaogang Gong;Xian-Sheng Hua
Affiliations:
Department of Computer Science, Queen Mary, University of London, London E1 4NS, UK;Microsoft Research Asia, Beijing 100190, PR China;Department of Computer Science, Queen Mary, University of London, London E1 4NS, UK;Microsoft Research Asia, Beijing 100190, PR China
Venue:
Pattern Recognition
Year:
2009

Citing 15
Cited 10

Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
Unsupervised learning by probabilistic latent semantic analysis

Machine Learning
Unsupervised Segmentation of Color-Texture Regions in Images and Video

IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic image annotation and retrieval using cross-media relevance models

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Modeling annotated data

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach

IEEE Transactions on Pattern Analysis and Machine Intelligence
Matching words and pictures

The Journal of Machine Learning Research
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Image Categorization by Learning and Reasoning with Regions

The Journal of Machine Learning Research
Effective automatic image annotation via a coherent language model and active learning

Proceedings of the 12th annual ACM international conference on Multimedia
A Bayesian Hierarchical Model for Learning Natural Scene Categories

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Combining Local and Global Image Features for Object Class Recognition

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops - Volume 03
Negative Samples Analysis in Relevance Feedback

IEEE Transactions on Knowledge and Data Engineering
Image retrieval: Ideas, influences, and trends of the new age

ACM Computing Surveys (CSUR)
Multiple Bernoulli relevance models for image and video annotation

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Recognition of attentive objects with a concept association network for image annotation

Pattern Recognition
Fusing semantic aspects for image annotation and retrieval

Journal of Visual Communication and Image Representation
Visual search reranking via adaptive particle swarm optimization

Pattern Recognition
Locally discriminative topic modeling

Pattern Recognition
The effectiveness of image features based on fractal image coding for image annotation

Expert Systems with Applications: An International Journal
Topic evolution prediction of user generated contents considering enterprise generated contents

Proceedings of the First ACM International Workshop on Hot Topics on Interdisciplinary Social Networks Research
K-Nearest Neighbors Relevance Annotation Model for Distance Education

International Journal of Distance Education Technologies
Real web community based automatic image annotation

Computers and Electrical Engineering
Human computation: Image metadata acquisition based on a single-player annotation game

International Journal of Human-Computer Studies
Image annotation using high order statistics in non-Euclidean spaces

Journal of Visual Communication and Image Representation

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper presents a novel approach to automatic image annotation which combines global, regional, and contextual features by an extended cross-media relevance model. Unlike typical image annotation methods which use either global or regional features exclusively, as well as neglect the textual context information among the annotated words, the proposed approach incorporates the three kinds of information which are helpful to describe image semantics to annotate images by estimating their joint probability. Specifically, we describe the global features as a distribution vector of visual topics and model the textual context as a multinomial distribution. The global features provide the global distribution of visual topics over an image, while the textual context relaxes the assumption of mutual independence among annotated words which is commonly adopted in most existing methods. Both the global features and textual context are learned by a probability latent semantic analysis approach from the training data. The experiments over 5k Corel images have shown that combining these three kinds of information is beneficial in image annotation.