Object and Action Classification with Latent Window Parameters

Authors:
Hakan Bilen;Vinay P. Namboodiri;Luc J. Gool
Affiliations:
ESAT-PSI/iMinds, Ku Leuven, Heverlee, Belgium 3001;Alcatel-Lucent Bell Labs, Antwerp, Belgium 2018;ESAT-PSI/iMinds, Ku Leuven, Heverlee, Belgium 3001 and Computer Vision Laboratory, ETH Zürich, Zurich, Switzerland 8092
Venue:
International Journal of Computer Vision
Year:
2014

Citing 23
Cited 1

The concave-convex procedure

Neural Computation
Object Recognition from Local Scale-Invariant Features

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Space-time Interest Points

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Support vector machine learning for interdependent and structured output spaces

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Recognizing Human Actions: A Local SVM Approach

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
A support vector method for multivariate performance measures

ICML '05 Proceedings of the 22nd international conference on Machine learning
Learning structured prediction models: a large margin approach

ICML '05 Proceedings of the 22nd international conference on Machine learning
Generic Object Recognition with Boosting

IEEE Transactions on Pattern Analysis and Machine Intelligence
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Statistical Comparisons of Classifiers over Multiple Data Sets

The Journal of Machine Learning Research
A support vector method for optimizing average precision

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Object categorization

Foundations and Trends® in Computer Graphics and Vision
Curriculum learning

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Learning structural SVMs with latent variables

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
The Pascal Visual Object Classes (VOC) Challenge

International Journal of Computer Vision
Object Detection with Discriminatively Trained Part-Based Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Modeling the temporal extent of actions

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Improving the fisher kernel for large-scale image classification

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Image classification using super-vector coding of local image descriptors

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Complex loss optimization via dual decomposition

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Discriminative spatial saliency for image classification

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Similarity constrained latent support vector machine: an application to weakly supervised action classification

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VII

Objects, Actions, Places

International Journal of Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we propose a generic framework to incorporate unobserved auxiliary information for classifying objects and actions. This framework allows us to automatically select a bounding box and its quadrants from which best to extract features. These spatial subdivisions are learnt as latent variables. The paper is an extended version of our earlier work Bilen et al. (Proceedings of The British Machine Vision Conference, 2011), complemented with additional ideas, experiments and analysis. We approach the classification problem in a discriminative setting, as learning a max-margin classifier that infers the class label along with the latent variables. Through this paper we make the following contributions: (a) we provide a method for incorporating latent variables into object and action classification; (b) these variables determine the relative focus on foreground versus background information that is taken account of; (c) we design an objective function to more effectively learn in unbalanced data sets; (d) we learn a better classifier by iterative expansion of the latent parameter space. We demonstrate the performance of our approach through experimental evaluation on a number of standard object and action recognition data sets.