Scene Classification Using a Hybrid Generative/Discriminative Approach

Authors:
Anna Bosch;Andrew Zisserman;Xavier Muñoz
Affiliations:
-;-;-
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2008

Citing 0
Cited 87

3D Object Recognition Using Hyper-Graphs and Ranked Local Invariant Features

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Clustering Using Class Specific Hyper Graphs

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
A Novel Video Classification Method Based on Hybrid Generative/Discriminative Models

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
An Integrated Method for Multiple Object Detection and Localization

ISVC '08 Proceedings of the 4th International Symposium on Advances in Visual Computing, Part II
Spatial Weighting for Bag-of-Visual-Words and Its Application in Content-Based Image Retrieval

PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Reducing Keypoint Database Size

ICIAP '09 Proceedings of the 15th International Conference on Image Analysis and Processing
Red Eye Detection through Bag-of-Keypoints Classification

ICIAP '09 Proceedings of the 15th International Conference on Image Analysis and Processing
Scene classification using pLSA with visterm spatial location

IMCE '09 Proceedings of the 1st international workshop on Interactive multimedia for consumer electronics
Dense sampling low-level statistics of local features

Proceedings of the ACM International Conference on Image and Video Retrieval
Learning color names for real-world applications

IEEE Transactions on Image Processing
Randomized Probabilistic Latent Semantic Analysis for Scene Recognition

CIARP '09 Proceedings of the 14th Iberoamerican Conference on Pattern Recognition: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Scene Categorization by Introducing Contextual Information to the Visual Words

ISVC '09 Proceedings of the 5th International Symposium on Advances in Visual Computing: Part I
A novel approach to musical genre classification using probabilistic latent semantic analysis model

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Help-training semi-supervised LS-SVM

IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
A framework for attention-based personal photo manager

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Scene categorization via contextual visual words

Pattern Recognition
Comparing compact codebooks for visual categorization

Computer Vision and Image Understanding
Nearest neighbor based collection OCR

DAS '10 Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
Semantic modeling of natural scenes based on contextual Bayesian networks

Pattern Recognition
Topic models for image annotation and text illustration

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
IPSILON: incremental parsing for semantic indexing of latent concepts

IEEE Transactions on Image Processing
Dictionary learning based object detection and counting in traffic scenes

Proceedings of the Fourth ACM/IEEE International Conference on Distributed Smart Cameras
Towards extensible automatic image annotation with the bag-of-words approach

Proceedings of the international workshop on Very-large-scale multimedia corpus, mining and retrieval
Scene categorization using boosted back-propagation neural networks

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Learning global and regional features for photo annotation

CLEF'09 Proceedings of the 10th international conference on Cross-language evaluation forum: multimedia experiments
Image-to-class distance metric learning for image classification

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Why did the person cross the road (there)? scene understanding using probabilistic logic models and common sense reasoning

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Improving local descriptors by embedding global and local spatial information

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Local autocorrelation of similarities with subspaces for shift invariant scene classification

Pattern Recognition
New colour SIFT descriptors for image classification with applications to biometrics

International Journal of Biometrics
Bayesian hybrid generative discriminative learning based on finite Liouville mixture models

Pattern Recognition
Boosted scene categorization approach by adjusting inner structures and outer weights of weak classifiers

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Exploiting Textons distributions on spatial hierarchy for scene classification

Journal on Image and Video Processing - Special issue on selected papers from multimedia modeling conference 2009
Online learning for PLSA-based visual recognition

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Indoor scene classification using combined 3D and gist features

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part II
Image classification using spatial pyramid coding and visual word reweighting

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part III
Region Contextual Visual Words for scene categorization

Expert Systems with Applications: An International Journal
Help-Training for semi-supervised support vector machines

Pattern Recognition
Improved learning of I2C distance and accelerating the neighborhood search for image classification

Pattern Recognition
Learning sparse features on-line for image classification

ICIAR'11 Proceedings of the 8th international conference on Image analysis and recognition - Volume Part I
Locally discriminative topic modeling

Pattern Recognition
Building global image features for scene recognition

Pattern Recognition
Supervised visual vocabulary with category information

ACIVS'11 Proceedings of the 13th international conference on Advanced concepts for intelligent vision systems
Geometric Latent Dirichlet Allocation on a Matching Graph for Large-scale Image Datasets

International Journal of Computer Vision
On the spatial extents of SIFT descriptors for visual concept detection

ICVS'11 Proceedings of the 8th international conference on Computer vision systems
Artificial neural networks based war scene classification using various feature extraction methods: a comparative study

AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part III
Shared feature extraction for semi-supervised image classification

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Multi-feature pLSA for combining visual features in image annotation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Discriminative compact pyramids for object and scene recognition

Pattern Recognition
Feature fusion within local region using localized maximum-margin learning for scene categorization

Pattern Recognition
Good match exploration using triangle constraint

Pattern Recognition Letters
Image classification based on weighted topics

ICONIP'11 Proceedings of the 18th international conference on Neural Information Processing - Volume Part II
Histogram of Oriented Uniform Patterns for robust place recognition and categorization

International Journal of Robotics Research
Modulating Shape Features by Color Attention for Object Recognition

International Journal of Computer Vision
Local co-occurrence features in subspace obtained by KPCA of local blob visual words for scene classification

Pattern Recognition
Nearest-Neighbor based Metric Functions for indoor scene recognition

Computer Vision and Image Understanding
Color CENTRIST: a color descriptor for scene categorization

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Active refinement of clone anomaly reports

Proceedings of the 34th International Conference on Software Engineering
Compact and adaptive spatial pyramids for scene recognition

Image and Vision Computing
MIFT: A framework for feature descriptors to be mirror reflection invariant

Image and Vision Computing
Scene classification using a multi-resolution bag-of-features model

Pattern Recognition
Allocating images and selecting image collections for distributed visual search

Proceedings of the 4th International Conference on Internet Multimedia Computing and Service
Distributional semantics with eyes: using image analysis to improve computational representations of word meaning

Proceedings of the 20th ACM international conference on Multimedia
Biologically inspired task oriented gist model for scene classification

Computer Vision and Image Understanding
Bag of spatio-visual words for context inference in scene classification

Pattern Recognition
Randomized spatial partition for scene recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Learning hybrid part filters for scene recognition

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
A new biologically inspired color image descriptor

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
ISABoost: A weak classifier inner structure adjusting based AdaBoost algorithm-ISABoost based application in scene categorization

Neurocomputing
Self organizing natural scene image retrieval

Expert Systems with Applications: An International Journal
Learning image-to-class distance metric for image classification

ACM Transactions on Intelligent Systems and Technology (TIST) - Special section on agent communication, trust in multiagent systems, intelligent tutoring and coaching systems
Fast multi-view segment graph kernel for object classification

Signal Processing
Image region description using orthogonal combination of local binary patterns enhanced with color information

Pattern Recognition
HSOG: a novel local descriptor based on histograms of second order gradients for object categorization

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Weakly supervised codebook learning by iterative label propagation with graph quantization

Signal Processing
A region-centered topic model for object discovery and category-based image segmentation

Pattern Recognition
VISOR: towards on-the-fly large-scale object category retrieval

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Discriminative histograms of local dominant orientation (D-HLDO) for biometric image feature extraction

Pattern Recognition
Variational learning of a Dirichlet process of generalized Dirichlet distributions for simultaneous clustering and feature selection

Pattern Recognition
Bubble space and place representation in topological maps

International Journal of Robotics Research
Scene image retrieval via re-ranking semantic and packed dense interestpoints

Neurocomputing
Learning semantic concepts from image database with hybrid generative/discriminative approach

Engineering Applications of Artificial Intelligence
Scene classification using multi-resolution low-level feature combination

Neurocomputing
Content-based copy detection through multimodal feature representation and temporal pyramid matching

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Histogram of visual words based on locally adaptive regression kernels descriptors for image feature extraction

Neurocomputing
Coloring Action Recognition in Still Images

International Journal of Computer Vision
HWVP: hierarchical wavelet packet descriptors and their applications in scene categorization and semantic concept retrieval

Multimedia Tools and Applications

Quantified Score

Hi-index	0.15

Visualization

Abstract

We investigate whether dimensionality reduction using a latent generative model is beneficial for the task of weakly supervised scene classification. In detail we are given a set of labelled images of scenes (e.g. coast, forest, city, river, etc) and our objective is to classify a new image into one of these categories. Our approach consists of first discovering latent "topics" using probabilistic Latent Semantic Analysis (pLSA), a generative model from the statistical text literature here applied to a bagof visual words representation for each image, and subsequently training a multi-way classifier on the topic distribution vector for each image. We compare this approach to that of representing each imageby a bag of visual words vector directly, and training a multi-way classifier on these vectors.To this end we introduce a novel vocabulary using dense colour SIFT descriptors, and then investigate the classification performance under changes in the size of the visual vocabulary, the number of latent topics learnt, and the type of discriminative classifier used (k-nearest neighbour or SVM). We achieve superior classification performance to recent publications that have used a bag of visual word representation, in all cases using the authors' own datasets and testing protocols. We also investigate the gain in adding spatial information. We show applications to image retrieval with relevance feedback and to scene classification in videos.