Image classification by multimodal subspace learning

  • Authors:
  • Jun Yu;Feng Lin;Hock-Soon Seah;Cuihua Li;Ziyu Lin

  • Affiliations:
  • Computer Science Department, Xiamen University, Xiamen 361005, PR China;School of Computer Engineering, Nanyang Technological University, 639798 Singapore, Singapore;School of Computer Engineering, Nanyang Technological University, 639798 Singapore, Singapore;Computer Science Department, Xiamen University, Xiamen 361005, PR China;Computer Science Department, Xiamen University, Xiamen 361005, PR China

  • Venue:
  • Pattern Recognition Letters
  • Year:
  • 2012

Quantified Score

Hi-index 0.10

Visualization

Abstract

In recent years we witnessed a surge of interest in subspace learning for image classification. However, the previous methods lack of high accuracy since they do not consider multiple features of the images. For instance, we can represent a color image by finding a set of visual features to represent the information of its color, texture and shape. According to the ''Patch Alignment'' Framework, we developed a new subspace learning method, termed Semi-Supervised Multimodal Subspace Learning (SS-MMSL), in which we can encode different features from different modalities to build a meaningful subspace. In particular, the new method adopts the discriminative information from the labeled data to construct local patches and aligns these patches to get the optimal low dimensional subspace for each modality. For local patch construction, the data distribution revealed by unlabeled data is utilized to enhance the subspace learning. In order to find a low dimensional subspace wherein the distribution of each modality is sufficiently smooth, SS-MMSL adopts an alternating and iterative optimization algorithm to explore the complementary characteristics of different modalities. The iterative procedure reaches the global minimum of the criterion due to the strong convexity of the criterion. Our experiments of image classification and cartoon retrieval demonstrate the validity of the proposed method.