Efficient lp-norm multiple feature metric learning for image categorization

  • Authors:
  • Shuhui Wang;Qingming Huang;Shuqiang Jiang;Qi Tian

  • Affiliations:
  • Key Lab of Intell. Info. Process., Inst. of Comput. Tech., CAS, Beijing, 100190, China, Beijing, China;Graduate University, Chinese Academy of Sciences, Beijing, China;Key Lab of Intell. Info. Process., Inst. of Comput. Tech., CAS, Beijing, 100190, China, Beijing, China;Dept. of Computer Science, Univ. of Texas at San Antonio, San Antonio, TX, USA

  • Venue:
  • Proceedings of the 20th ACM international conference on Information and knowledge management
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Previous metric learning approaches are only able to learn the metric based on single concatenated multivariate feature representation. However, for many real world problems with multiple feature representation such as image categorization, the model trained by previous approaches will degrade because of sparsity brought by significant dimension growth and uncontrolled influence from each feature channel. In this paper, we propose an efficient distance metric learning model which adapts Distance Metric Learning on multiple feature representations. The aim is to learn the Mahalanobis matrices for each independent feature and their non-sparse lp-norm weight coefficients simultaneously by maximizing the margin of the overall learned distance metric among the pairs from the same class and the distance of pairs from different classes. We further extend this method to nonlinear kernel learning and category specific metric learning, which demonstrate the applicability of using many existing kernels for image data and exploring the hierarchical semantic structures for large scale image datasets. Experiments on various datasets demonstrate the promising power of our method.