Fusing inherent and external knowledge with nonlinear learning for cross-media retrieval

Authors:
Hong Zhang;Yun Liu;Zhigang Ma
Affiliations:
College of Computer Science and Technology, Wuhan University of Science and Technology, Wuhan 430081, China and State Key Laboratory of Software Engineering, Wuhan University, 430072, China;School of Electrical and Electronic Engineering, Nanyang Technological University, Nanyang Avenue, 639798 Singapore, Singapore;Department of Information Engineering and Computer Science, University of Trento, 38123, Italy
Venue:
Neurocomputing
Year:
2013

Citing 17
Cited 0

Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Comparing discriminating transformations and SVM for learning during multimedia retrieval

MULTIMEDIA '01 Proceedings of the ninth ACM international conference on Multimedia
Automatic image annotation and retrieval using cross-media relevance models

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Probabilistic Space-Time Video Modeling via Piecewise GMM

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning an image manifold for retrieval

Proceedings of the 12th annual ACM international conference on Multimedia
Optimal multimodal fusion for multimedia data analysis

Proceedings of the 12th annual ACM international conference on Multimedia
Complementary information retrieval for cross-media news content

Proceedings of the 2nd ACM international workshop on Multimedia databases
Content-based multimedia information retrieval: State of the art and challenges

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Locality preserving CCA with applications to data visualization and pose estimation

Image and Vision Computing
Cross-modal correlation learning for clustering on image-audio dataset

Proceedings of the 15th international conference on Multimedia
Image retrieval: Ideas, influences, and trends of the new age

ACM Computing Surveys (CSUR)
Inferring semantic concepts from community-contributed images and noisy tags

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Cross-media retrieval using query dependent search methods

Pattern Recognition
A Multimedia Retrieval Framework Based on Semi-Supervised Ranking and Relevance Feedback

IEEE Transactions on Pattern Analysis and Machine Intelligence
Harmonizing Hierarchical Manifolds for Multimedia Document Semantics Understanding and Cross-Media Retrieval

IEEE Transactions on Multimedia
Image Annotation by Graph-Based Inference With Integrated Multiple/Single Instance Representations

IEEE Transactions on Multimedia
Effective Image Retrieval Based on Hidden Concept Discovery in Image Database

IEEE Transactions on Image Processing

Quantified Score

Hi-index	0.01

Visualization

Abstract

Cross-media retrieval focuses on searching multimedia data of different modalities with content-based methods. However, most of those methods are designed for multimedia retrieval in single modality, such as image retrieval and audio retrieval. Though a few work has focused on cross-media retrieval, the performance is yet to be satisfactory and the potential of using cross-media retrieval for boosted retrieval performance remains largely unexplored. Hence, in this paper, we propose a novel cross-media retrieval approach for general multimedia data, such as image and audio. First, image and audio samples are mapped into an isomorphic feature subspace with kernel-based method; second, multimedia semantics is learned from inherent feature correlation by local linear regression; also a graph model is constructed to utilize external knowledge from relevance feedback; then we build a unified objective function integrating inherent and external learning results, and by solving the objective function we calculate a multimodal semantic space where cross-media retrieval among image and audio is enabled. Extensive experiments have validated the proposed methods with encouraging results.