Video retrieval based on object discovery
Computer Vision and Image Understanding
Unsupervised modeling of objects and their hierarchical contextual interactions
Journal on Image and Video Processing - Special issue on patches in vision
Foreground Focus: Unsupervised Learning from Partially Matching Images
International Journal of Computer Vision
Learning natural scene categories by selective multi-scale feature extraction
Image and Vision Computing
Unsupervised identification of multiple objects of interest from multiple images: dISCOVER
ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part II
A spatially aware generative model for image classification, topic discovery and segmentation
ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Semantics extraction from images
Knowledge-driven multimedia information extraction and ontology evolution
Probabilistic semantic component descriptor
Multimedia Tools and Applications
Multimedia Tools and Applications
Hi-index | 0.00 |
The bag of visual words representation has attracted a lot of attention in the computer vision community. In particular, Probabilistic Latent Semantic Analysis (PLSA) has been applied to object recognition as an unsupervised technique built on top of the bag of visual words representation. PLSA, however, does not explicitly consider the spatial information of the visual words. In this paper, we propose an iterative technique, where a modified form of PLSA provides location and scale estimates of the foreground object through the estimated latent semantic. In return, the updated location and scale estimates will improve the estimate of the latent semantic. We call this iterative algorithm Semantic-Shift. We show results with significant improvements over PLSA.