Latent visual context learning for web image applications

Authors:
Wengang Zhou;Qi Tian;Yijuan Lu;Linjun Yang;Houqiang Li
Affiliations:
University of Science and Technology of China, Hefei, China;University of Texas at San Antonio, San Antonio, TX, USA;Texas State University, TX, USA;Microsoft Research Asia, Beijing, China;University of Science and Technology of China, Hefei, China
Venue:
Pattern Recognition
Year:
2011

Citing 22
Cited 2

The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Probabilistic latent semantic indexing

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
IR evaluation methods for retrieving highly relevant documents

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Latent dirichlet allocation

The Journal of Machine Learning Research
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Block-level link analysis

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A Spectral Technique for Correspondence Problems Using Pairwise Constraints

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Scalable Recognition with a Vocabulary Tree

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
ImprovingWeb-based Image Search via Content Based Clustering

CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
Generating summaries and visualization for large collections of geo-referenced photographs

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Video search reranking via information bottleneck principle

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Video search re-ranking via multi-graph propagation

Proceedings of the 15th international conference on Multimedia
Video search reranking through random walk over document-level context graph

Proceedings of the 15th international conference on Multimedia
Generating diverse and representative image search results for landmarks

Proceedings of the 17th international conference on World Wide Web
VisualRank: Applying PageRank to Large-Scale Image Search

IEEE Transactions on Pattern Analysis and Machine Intelligence
Bayesian video search reranking

MM '08 Proceedings of the 16th ACM international conference on Multimedia
ContextSeer: context search and recommendation at query time for shared consumer photos

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Unsupervised modeling and recognition of object categories with combination of visual contents and geometric similarity links

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Descriptive visual words and visual phrases for image applications

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Query aware visual similarity propagation for image search reranking

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Multimedia search with pseudo-relevance feedback

CIVR'03 Proceedings of the 2nd international conference on Image and video retrieval

Image re-ranking and rank aggregation based on similarity of ranked lists

Pattern Recognition
Social-oriented visual image search

Computer Vision and Image Understanding

Quantified Score

Hi-index	0.01

Visualization

Abstract

Recently, image representation based on bag-of-visual-words (BoW) model has been popularly applied in image and vision domains. In BoW, a visual codebook of visual words is defined, usually by clustering local features, to represent any novel image with the occurrence of its contained visual words. Given a set of images, we argue that the significance of each image is determined by the significance of its contained visual words. Traditionally, the significances of visual words are defined by term frequency-inverse document frequency (tf-idf), which cannot necessarily capture the intrinsic visual context. In this paper, we propose a new scheme of latent visual context learning (LVCL). The visual context among images and visual words is formulated from latent semantic context and visual link graph analysis. With LVCL, the importance of visual words and images will be distinguished from each other, which will facilitate image level applications, such as image re-ranking and canonical image selection. We validate our approach on text-query based search results returned by Google Image. Experimental results demonstrate the effectiveness and potentials of our LVCL in applications of image re-ranking and canonical image selection, over the state-of-the-art approaches.