Learning image-to-class distance metric for image classification

Authors:
Zhengxiang Wang;Yiqun Hu;Liang-Tien Chia
Affiliations:
Nanyang Technological University, Singapore;The University of Western Australia, Crawley, WA, Australia;Nanyang Technological University, Singapore
Venue:
ACM Transactions on Intelligent Systems and Technology (TIST) - Special section on agent communication, trust in multiagent systems, intelligent tutoring and coaching systems
Year:
2013

Citing 19
Cited 0

Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
A Bayesian Hierarchical Model for Learning Natural Scene Categories

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Learning a Mahalanobis Metric from Equivalence Constraints

The Journal of Machine Learning Research
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Features

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Learning Distance Metrics with Contextual Constraints for Image Retrieval

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Information-theoretic metric learning

Proceedings of the 24th international conference on Machine learning
Scene Classification Using a Hybrid Generative/Discriminative Approach

IEEE Transactions on Pattern Analysis and Machine Intelligence
Speeded-Up Robust Features (SURF)

Computer Vision and Image Understanding
Distance Metric Learning for Large Margin Nearest Neighbor Classification

The Journal of Machine Learning Research
An efficient algorithm for local distance metric learning

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Robust distance metric learning with auxiliary knowledge

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Learning instance-to-class distance for human action recognition

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Transfer metric learning by learning task relationships

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Image-to-class distance metric learning for image classification

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Towards optimal naive bayes nearest neighbor

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Face and Human Gait Recognition Using Image-to-Class Distance

IEEE Transactions on Circuits and Systems for Video Technology

Quantified Score

Hi-index	0.00

Visualization

Abstract

Image-To-Class (I2C) distance is a novel distance used for image classification and has successfully handled datasets with large intra-class variances. However, it uses Euclidean distance for measuring the distance between local features in different classes, which may not be the optimal distance metric in real image classification problems. In this article, we propose a distance metric learning method to improve the performance of I2C distance by learning per-class Mahalanobis metrics in a large margin framework. Our I2C distance is adaptive to different classes by combining with the learned metric for each class. These multiple per-class metrics are learned simultaneously by forming a convex optimization problem with the constraints that the I2C distance from each training image to its belonging class should be less than the distances to other classes by a large margin. A subgradient descent method is applied to efficiently solve this optimization problem. For efficiency and scalability to large-scale problems, we also show how to simplify the method to learn a diagonal matrix for each class. We show in experiments that our learned Mahalanobis I2C distance can significantly outperform the original Euclidean I2C distance as well as other distance metric learning methods in several prevalent image datasets, and our simplified diagonal matrices can preserve the performance but significantly speed up the metric learning procedure for large-scale datasets. We also show in experiment that our method is able to correct the class imbalance problem, which usually leads the NN-based methods toward classes containing more training images.