Biologically inspired task oriented gist model for scene classification

Authors:
Yina Han;Guizhong Liu
Affiliations:
School of Marine Engineering, Northwestern Polytechnical University, Xi'an 710072, China and State Key Laboratory of Acoustics, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190, ...;School of Electronic and Information Engineering, Xi'an Jiaotong University, Xi'an 710049, China
Venue:
Computer Vision and Image Understanding
Year:
2013

Citing 30
Cited 0

Nonlinear component analysis as a kernel eigenvalue problem

Neural Computation
A Model of Saliency-Based Visual Attention for Rapid Scene Analysis

IEEE Transactions on Pattern Analysis and Machine Intelligence
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

International Journal of Computer Vision
Context-based vision system for place and object recognition

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Learning the Kernel Matrix with Semidefinite Programming

The Journal of Machine Learning Research
A Bayesian Hierarchical Model for Learning Natural Scene Categories

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Perception Strategies in Hierarchical Vision Systems

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
Rapid Biologically-Inspired Scene Classification Using Features Shared with Visual Attention

IEEE Transactions on Pattern Analysis and Machine Intelligence
Semantic Modeling of Natural Scenes for Content-Based Image Retrieval

International Journal of Computer Vision
Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories

Computer Vision and Image Understanding
Large Scale Multiple Kernel Learning

The Journal of Machine Learning Research
Robust Object Recognition with Cortex-Like Mechanisms

IEEE Transactions on Pattern Analysis and Machine Intelligence
Multiclass multiple kernel learning

Proceedings of the 24th international conference on Machine learning
Scene Classification Using a Hybrid Generative/Discriminative Approach

IEEE Transactions on Pattern Analysis and Machine Intelligence
Localized multiple kernel learning

Proceedings of the 25th international conference on Machine learning
Object Class Recognition and Localization Using Sparse Features with Limited Receptive Fields

International Journal of Computer Vision
Manifold models for signals and images

Computer Vision and Image Understanding
More generality in efficient multiple kernel learning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Biologically inspired mobile robot vision localization

IEEE Transactions on Robotics
Using the forest to see the trees: exploiting context for visual object detection and localization

Communications of the ACM
Context based object categorization: A critical survey

Computer Vision and Image Understanding
Biologically inspired feature manifold for scene classification

IEEE Transactions on Image Processing
A Survey on Transfer Learning

IEEE Transactions on Knowledge and Data Engineering
A Hierarchical GIST Model Embedding Multiple Biological Feasibilities for Scene Classification

ICPR '10 Proceedings of the 2010 20th International Conference on Pattern Recognition
Towards a more discriminative and semantic visual vocabulary

Computer Vision and Image Understanding
Scene classification via pLSA

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
Image classification for content-based indexing

IEEE Transactions on Image Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Capturing the scene gist is account for rapid and accurate scene classification in human visual system. This paper presents a biologically inspired task oriented gist model (BT-Gist) that attempts to emulate two important attributes of biological gist: holistic scene centered spatial layout representation and task oriented resolution determination. For the first attribute, we enrich the model of Oliva and Torralba by refining the low-level features in several biological plausible ways, extending the spatial layout to multiple resolution and followed by perceptually meaningful manifold analysis for a set of multi-resolution biologically inspired intrinsic manifold spatial layouts (BMSLs). Since the optimal resolution that best represents the spatial layout varies from task to task, we embody the second attribute as learning the combination of BMSLs of multiple resolution with respect to their optimal discriminative invariance trade-off for the task at hand, and then cast it in the SVM based localized multiple kernel learning (LMKL) framework, by which the kernel of each scene gist is approximated as a local combination of kernels associated to multi-resolution BMSLs. By exploring the task specific category distribution pattern over BMSL, we define the local model as a category distribution sensitive (CDS) kernel, which can accommodate both the diverse individuality of specific BMSL and the universality shared within the whole category space. Via CDS-LMKL, both the optimal resolution for spatial layouts and the final classifier can be efficiently obtained in a joint manner. We evaluate BT-Gist on four natural scene databases and one cluttered indoor scene database with a range of comparison: From different MKL methods, to various biologically inspired models and BoF based computer vision models. CDS-LMKL leads to better results compared to several existing MKL algorithms. Given the two biological attributes that the framework has to follow, BT-Gist, despite its holistic nature, outperforms existing biologically inspired models and BoF based computer vision models in natural scene classification, and competes with the object segmentation based ROI-Gist in cluttered indoor scene classification.