Continuous visual vocabulary modelsfor pLSA-based scene recognition

Authors:
Eva Hörster;Rainer Lienhart;Malcolm Slaney
Affiliations:
University of Augsburg, Augsburg, Germany;University of Augsburg, Augsburg, Germany;Yahoo! Research, Santa Clara, CA, USA
Venue:
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Year:
2008

Citing 12
Cited 7

Unsupervised learning by probabilistic latent semantic analysis

Machine Learning
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

International Journal of Computer Vision
Indoor-Outdoor Image Classification

CAIVD '98 Proceedings of the 1998 International Workshop on Content-Based Access of Image and Video Databases (CAIVD '98)
Modeling annotated data

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Latent dirichlet allocation

The Journal of Machine Learning Research
Matching words and pictures

The Journal of Machine Learning Research
A Bayesian Hierarchical Model for Learning Natural Scene Categories

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Discovering Objects and their Localization in Images

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Modeling Scenes with Local Descriptors and Latent Aspects

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Image retrieval on large-scale image databases

Proceedings of the 6th ACM international conference on Image and video retrieval
Scene classification via pLSA

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Image classification for content-based indexing

IEEE Transactions on Image Processing

Multilayer pLSA for multimodal image retrieval

Proceedings of the ACM International Conference on Image and Video Retrieval
Applying pLSA to region-based image categorization with soft vector quantization

Proceedings of the First International Conference on Internet Multimedia Computing and Service
Topic models for semantics-preserving video compression

Proceedings of the international conference on Multimedia information retrieval
Online learning for PLSA-based visual recognition

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Image classification based on weighted topics

ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part II
Multimedia features for click prediction of new ads in display advertising

Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
A feature-word-topic model for image annotation and retrieval

ACM Transactions on the Web (TWEB)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Topic models such as probabilistic Latent Semantic Analysis (pLSA) and Latent Dirichlet Allocation (LDA) have been shown to perform well in various image content analysis tasks. However, due to the origin of these models from the text domain, almost all prior work uses discrete vocabularies even when applied in the image domain. Thus in these works the continuous local features used to describe an image need to be quantized to fit the model. In this work we will propose and evaluate three different extensions to the pLSA framework so that words are modeled as continuous feature vector distributions rather than crudely quantized high-dimensional descriptors. The performance of these continuous vocabulary models are compared in an automatic scene recognition task. Our experiments clearly show that the continuous approaches outperform the standard pLSA model. In this paper all required equations for parameter estimation and inference are given for each of the three models.