Cross-Media semantics mining based on sparse canonical correlation analysis and relevance feedback

  • Authors:
  • Hong Zhang;Xiaoming Liu

  • Affiliations:
  • College of Computer Science & Technology, Wuhan University of Science & Technology, China;College of Computer Science & Technology, Wuhan University of Science & Technology, China

  • Venue:
  • PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Cross-media learning is a new hot topic in multimedia content analysis and retrieval. Because multimedia data of different modalities are heterogeneous in feature space and there exists the well-know semantic gap, one of the most challenging issues for cross-media learning is to mine underlying semantics and estimate cross-media correlation. In this paper we propose a cross-media semantics mining approach based on Sparse Canonical Correlation Analysis and relevance feedback. First, we analyze sparse canonical correlation between low-level feature matrices of different modalities in training stage, and construct a Multimodal Sparse Subspace where both canonical correlation and most meaningful features are preserved; then based on geometric distance in the subspace we estimate cross-media correlation and enable cross-media retrieval; also we provide long-term relevance feedback strategy for performance optimization. Our approach is tested with general multimedia data, including image, audio and text. Experiment and comparison results are encouraging and show that the performance of our approach is effective.