A multi-scale learning framework for visual categorization

Authors:
Shao-Chuan Wang;Yu-Chiang Frank Wang
Affiliations:
Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan;Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan
Venue:
ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part I
Year:
2010

Citing 20
Cited 1

Learning the Kernel Matrix with Semidefinite Programming

The Journal of Machine Learning Research
Multiple kernel learning, conic duality, and the SMO algorithm

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Shape Matching and Object Recognition Using Low Distortion Correspondences

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Learning a Similarity Metric Discriminatively, with Application to Face Verification

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Creating Efficient Codebooks for Visual Recognition

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Learning a kernel function for classification with small training samples

ICML '06 Proceedings of the 23rd international conference on Machine learning
Multiclass Object Recognition with Sparse, Localized Features

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
A Visual Vocabulary for Flower Classification

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories

Computer Vision and Image Understanding
Large Scale Multiple Kernel Learning

The Journal of Machine Learning Research
Self-taught learning: transfer learning from unlabeled data

Proceedings of the 24th international conference on Machine learning
More efficiency in multiple kernel learning

Proceedings of the 24th international conference on Machine learning
Online dictionary learning for sparse coding

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Distance Metric Learning for Large Margin Nearest Neighbor Classification

The Journal of Machine Learning Research
An efficient algorithm for local distance metric learning

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Sparse Multiple Kernel Learning for Signal Processing Applications

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning distance functions for image retrieval

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Hierarchical spatial matching kernel for image categorization

ICIAR'11 Proceedings of the 8th international conference on Image analysis and recognition - Volume Part I

Quantified Score

Hi-index	0.00

Visualization

Abstract

Spatial pyramid matching has recently become a promising technique for image classification. Despite its success and popularity, no prior work has tackled the problem of learning the optimal spatial pyramid representation for the given image data and the associated object category. We propose a Multiple Scale Learning (MSL) framework to learn the best weights for each scale in the pyramid. Our MSL algorithm would produce class-specific spatial pyramid image representations and thus provide improved recognition performance. We approach the MSL problem as solving a multiple kernel learning (MKL) task, which defines the optimal combination of base kernels constructed at different pyramid levels. A wide range of experiments on Oxford flower and Caltech- 101 datasets are conducted, including the use of state-of-the-art feature encoding and pooling strategies. Finally, excellent empirical results reported on both datasets validate the feasibility of our proposed method.