Scene classification via pLSA

Authors:
Anna Bosch;Andrew Zisserman;Xavier Muñoz
Affiliations:
Computer Vision and Robotics Group, University of Girona, Girona;Robotics Research Group, University of Oxford, Oxford;Computer Vision and Robotics Group, University of Girona, Girona
Venue:
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Year:
2006

Citing 14
Cited 125

Probabilistic latent semantic indexing

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Unsupervised learning by probabilistic latent semantic analysis

Machine Learning
Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons

International Journal of Computer Vision
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

International Journal of Computer Vision
Indoor-Outdoor Image Classification

CAIVD '98 Proceedings of the 1998 International Workshop on Content-Based Access of Image and Video Databases (CAIVD '98)
Video Google: A Text Retrieval Approach to Object Matching in Videos

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Scale & Affine Invariant Interest Point Detectors

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Histograms of Oriented Gradients for Human Detection

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
A Bayesian Hierarchical Model for Learning Natural Scene Categories

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Discovering Objects and their Localization in Images

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Modeling Scenes with Local Descriptors and Latent Aspects

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Hidden semantic concept discovery in region based image retrieval

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Image classification for content-based indexing

IEEE Transactions on Image Processing

Review: Which is the best way to organize/classify images by content?

Image and Vision Computing
Categorization of natural scenes: Local versus global information and the role of color

ACM Transactions on Applied Perception (TAP)
Image retrieval on large-scale image databases

Proceedings of the 6th ACM international conference on Image and video retrieval
Multi-level local descriptor quantization for bag-of-visterms image representation

Proceedings of the 6th ACM international conference on Image and video retrieval
Representing shape with a spatial pyramid kernel

Proceedings of the 6th ACM international conference on Image and video retrieval
TV ad video categorization with probabilistic latent concept learning

Proceedings of the international workshop on Workshop on multimedia information retrieval
Object categorization

Foundations and Trends® in Computer Graphics and Vision
Describing Visual Scenes Using Transformed Objects and Parts

International Journal of Computer Vision
Content visualization and management of geo-located image databases

CHI '08 Extended Abstracts on Human Factors in Computing Systems
World-scale mining of objects and events from community photo collections

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Scene modeling in global-local view for scene classification

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Language modeling for bag-of-visual words image categorization

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Continuous visual vocabulary modelsfor pLSA-based scene recognition

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Comparing Local Feature Descriptors in pLSA-Based Image Models

Proceedings of the 30th DAGM symposium on Pattern Recognition
Content-based mood classification for photos and music: a generic multi-modal classification framework and evaluation approach

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Performance evaluation of local colour invariants

Computer Vision and Image Understanding
Natural Versus Artificial Scene Classification by Ordering Discrete Fourier Power Spectra

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
A Multimodal Constellation Model for Object Category Recognition

MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Spatial Hierarchy of Textons Distributions for Scene Classification

MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Annotating images and image objects using a hierarchical dirichlet process model

Proceedings of the 9th International Workshop on Multimedia Data Mining: held in conjunction with the ACM SIGKDD 2008
Region and constellations based categorization of images with unsupervised graph learning

Image and Vision Computing
Contextual classification of image patches with latent aspect models

Journal on Image and Video Processing - Special issue on patches in vision
PLSI: The True Fisher Kernel and beyond

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Reducing Keypoint Database Size

ICIAP '09 Proceedings of the 15th International Conference on Image Analysis and Processing
Scene classification using pLSA with visterm spatial location

IMCE '09 Proceedings of the 1st international workshop on Interactive multimedia for consumer electronics
Semantic concept annotation based on audio PLSA model

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Places clustering of full-length film key-framesusing latent aspect modeling over SIFT matches

IEEE Transactions on Circuits and Systems for Video Technology
Multilayer pLSA for multimodal image retrieval

Proceedings of the ACM International Conference on Image and Video Retrieval
Image categorization via robust pLSA

Pattern Recognition Letters
Scene Categorization by Introducing Contextual Information to the Visual Words

ISVC '09 Proceedings of the 5th International Symposium on Advances in Visual Computing: Part I
Filtering adult image content with topic models

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Applying pLSA to region-based image categorization with soft vector quantization

Proceedings of the First International Conference on Internet Multimedia Computing and Service
Scene categorization via contextual visual words

Pattern Recognition
Progressive randomization: Seeing the unseen

Computer Vision and Image Understanding
Distances and weighting schemes for bag of visual words image retrieval

Proceedings of the international conference on Multimedia information retrieval
OPTIMOL: Automatic Online Picture Collection via Incremental Model Learning

International Journal of Computer Vision
Learning natural scene categories by selective multi-scale feature extraction

Image and Vision Computing
Expression microarray classification using topic models

Proceedings of the 2010 ACM Symposium on Applied Computing
Semi-latent Dirichlet allocation: a hierarchical model for human action recognition

Proceedings of the 2nd conference on Human motion: understanding, modeling, capture and animation
Scene classification based on multi-resolution orientation histogram of Gabor features

ICVS'08 Proceedings of the 6th international conference on Computer vision systems
Geo-located image grouping using latent descriptions

ICVS'08 Proceedings of the 6th international conference on Computer vision systems
Model-based subspace clustering of non-Gaussian data

Neurocomputing
A hybrid unsupervised image re-ranking approach with latent topic contents

Proceedings of the ACM International Conference on Image and Video Retrieval
Multi modal semantic indexing for image retrieval

Proceedings of the ACM International Conference on Image and Video Retrieval
Scene classification based on local autocorrelation of similarities with subspaces

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
A Dirichlet process mixture of generalized Dirichlet distributions for proportional data modeling

IEEE Transactions on Neural Networks
Semantic modeling of natural scenes based on contextual Bayesian networks

Pattern Recognition
IPSILON: incremental parsing for semantic indexing of latent concepts

IEEE Transactions on Image Processing
Scene classification using multiple features in a two-stage probabilistic classification framework

Neurocomputing
The third eye: mining the visual cognition across multi-language communities

Proceedings of the international conference on Multimedia
An efficient face recognition through combining local features and statistical feature extraction

PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
Information theoretical Kernels for generative embeddings based on hidden Markov models

SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Biologically-aware latent dirichlet allocation (BaLDA) for the classification of expression microarray

PRIB'10 Proceedings of the 5th IAPR international conference on Pattern recognition in bioinformatics
Why did the person cross the road (there)? scene understanding using probabilistic logic models and common sense reasoning

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Towards optimal naive bayes nearest neighbor

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Constrained spectral clustering via exhaustive and efficient constraint propagation

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Local autocorrelation of similarities with subspaces for shift invariant scene classification

Pattern Recognition
Characteristic pattern discovery in videos

Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing
Correlated PLSA for image clustering

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Generating representative views of landmarks via scenic theme detection

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Regularized semi-supervised latent dirichlet allocation for visual concept learning

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
A multimodal constellation model for object image classification

Journal on Image and Video Processing - Special issue on selected papers from multimedia modeling conference 2009
Exploiting Textons distributions on spatial hierarchy for scene classification

Journal on Image and Video Processing - Special issue on selected papers from multimedia modeling conference 2009
Pyramid center-symmetric local binary/trinary patterns for effective pedestrian detection

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part IV
Integrated image representation based natural scene classification

Expert Systems with Applications: An International Journal
Semantics extraction from images

Knowledge-driven multimedia information extraction and ontology evolution
Hybrid generative-discriminative nucleus classification of renal cell carcinoma

SIMBAD'11 Proceedings of the First international conference on Similarity-based pattern recognition
ReVision: automated classification, analysis and redesign of chart images

Proceedings of the 24th annual ACM symposium on User interface software and technology
Combining latent semantic learning and reduced hypergraph learning for semi-supervised image categorization

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Renal cancer cell classification using generative embeddings and information theoretic kernels

PRIB'11 Proceedings of the 6th IAPR international conference on Pattern recognition in bioinformatics
A comparison on score spaces for expression microarray data classification

PRIB'11 Proceedings of the 6th IAPR international conference on Pattern recognition in bioinformatics
Sparse patch-histograms for object classification in cluttered images

DAGM'06 Proceedings of the 28th conference on Pattern Recognition
Discriminative compact pyramids for object and scene recognition

Pattern Recognition
Feature fusion within local region using localized maximum-margin learning for scene categorization

Pattern Recognition
Learning semantic features for action recognition via diffusion maps

Computer Vision and Image Understanding
Indoor Mobile Robotics at Grima, PUC

Journal of Intelligent and Robotic Systems
Single-Histogram class models for image segmentation

ICVGIP'06 Proceedings of the 5th Indian conference on Computer Vision, Graphics and Image Processing
Scene location guide by image-based retrieval

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Scene gist: a holistic generative model of natural image

ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part II
A variational statistical framework for object detection

ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part II
Supervising latent topic model for maximum-margin text classification and regression

PAKDD'10 Proceedings of the 14th Pacific-Asia conference on Advances in Knowledge Discovery and Data Mining - Volume Part I
Modulating Shape Features by Color Attention for Object Recognition

International Journal of Computer Vision
Novelty detection in wildlife scenes through semantic context modelling

Pattern Recognition
Local co-occurrence features in subspace obtained by KPCA of local blob visual words for scene classification

Pattern Recognition
Sports video classification using bag of words model

ACIIDS'12 Proceedings of the 4th Asian conference on Intelligent Information and Database Systems - Volume Part III
Halfway through the semantic gap: Prosemantic features for image retrieval

Information Sciences: an International Journal
Improving Image Classification Using Semantic Attributes

International Journal of Computer Vision
Use of color information for keypoints detection and descriptors construction

IScIDE'11 Proceedings of the Second Sino-foreign-interchange conference on Intelligent Science and Intelligent Data Engineering
Compact and adaptive spatial pyramids for scene recognition

Image and Vision Computing
Cross community news event summary generation based on collaborative ranking

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service
Stel Component Analysis: Joint Segmentation, Modeling and Recognition of Objects Classes

International Journal of Computer Vision
Spatial pooling of heterogeneous features for image applications

Proceedings of the 20th ACM international conference on Multimedia
Combining information theoretic kernels with generative embeddings for classification

Neurocomputing
Recognition of occluded objects by reducing feature interactions

Image and Vision Computing
Biologically inspired task oriented gist model for scene classification

Computer Vision and Image Understanding
Local log-euclidean covariance matrix (L2ECM) for image representation and its applications

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Simultaneous image classification and annotation via biased random walk on tri-relational graph

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Spring lattice counting grids: scene recognition using deformable positional constraints

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Visual categorization based on learning contextual probabilistic latent component tree

ICANN'12 Proceedings of the 22nd international conference on Artificial Neural Networks and Machine Learning - Volume Part I
Investigating Topic Models' Capabilities in Expression Microarray Data Classification

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Image retrieval with query-adaptive hashing

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Novel color Gabor-LBP-PHOG (GLP) descriptors for object and scene image classification

Proceedings of the Eighth Indian Conference on Computer Vision, Graphics and Image Processing
Scene classification based on category-specific representations created through prototype feature selection

Proceedings of the 27th Conference on Image and Vision Computing New Zealand
Semantic image clustering using object relation network

CVM'12 Proceedings of the First international conference on Computational Visual Media
Objects as attributes for scene classification

ECCV'10 Proceedings of the 11th European conference on Trends and Topics in Computer Vision - Volume Part I
Latent semantic learning with structured sparse representation for human action recognition

Pattern Recognition
Fusing color and shape for bag-of-words based object recognition

CCIW'13 Proceedings of the 4th international conference on Computational Color Imaging
Affine transforms between image space and color space for invariant local descriptors

Pattern Recognition
Enhanced local binary covariance matrices (ELBCM) for texture analysis and object tracking

Proceedings of the 6th International Conference on Computer Vision / Computer Graphics Collaboration Techniques and Applications
Modeling hidden topics with dual local consistency for image analysis

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
Multiple feature fusion based on co-training approach and time regularization for place classification in wearable video

Advances in Multimedia
Variational learning of a Dirichlet process of generalized Dirichlet distributions for simultaneous clustering and feature selection

Pattern Recognition
Medical image retrieval using bag of meaningful visual words: unsupervised visual vocabulary pruning with PLSA

Proceedings of the 1st ACM international workshop on Multimedia indexing and information retrieval for healthcare
Indoor scene recognition by a mobile robot through adaptive object detection

Robotics and Autonomous Systems
Human-inspired features for natural scene classification

Pattern Recognition Letters
Scene classification using multi-resolution low-level feature combination

Neurocomputing
Infinite Dirichlet mixture models learning via expectation propagation

Advances in Data Analysis and Classification
Online variational learning of generalized Dirichlet mixture models with feature selection

Neurocomputing
A bag-of-semantics model for image clustering

The Visual Computer: International Journal of Computer Graphics
Variational learning of finite Dirichlet mixture models using component splitting

Neurocomputing
New color GPHOG descriptors for object and scene image classification

Machine Vision and Applications
Continuous human action recognition in real time

Multimedia Tools and Applications
Object Bank: An Object-Level Image Representation for High-Level Visual Recognition

International Journal of Computer Vision
3D object retrieval via range image queries in a bag-of-visual-words context

The Visual Computer: International Journal of Computer Graphics
Coloring Action Recognition in Still Images

International Journal of Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

Given a set of images of scenes containing multiple object categories (e.g. grass, roads, buildings) our objective is to discover these objects in each image in an unsupervised manner, and to use this object distribution to perform scene classification. We achieve this discovery using probabilistic Latent Semantic Analysis (pLSA), a generative model from the statistical text literature, here applied to a bag of visual words representation for each image. The scene classification on the object distribution is carried out by a k-nearest neighbour classifier. We investigate the classification performance under changes in the visual vocabulary and number of latent topics learnt, and develop a novel vocabulary using colour SIFT descriptors. Classification performance is compared to the supervised approaches of Vogel & Schiele [19] and Oliva & Torralba [11], and the semi-supervised approach of Fei Fei & Perona [3] using their own datasets and testing protocols. In all cases the combination of (unsupervised) pLSA followed by (supervised) nearest neighbour classification achieves superior results. We show applications of this method to image retrieval with relevance feedback and to scene classification in videos.