Laplacian Eigenmaps for dimensionality reduction and data representation
Neural Computation
Think globally, fit locally: unsupervised learning of low dimensional manifolds
The Journal of Machine Learning Research
Multimedia content processing through cross-modal association
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Unsupervised learning from a corpus for shape-based 3D model retrieval
MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
3D object retrieval using the 3D shape impact descriptor
Pattern Recognition
A robust digital audio watermarking based on statistics characteristics
Pattern Recognition
Ranking with local regression and global alignment for cross media retrieval
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Incremental Laplacian eigenmaps by preserving adjacent information between data points
Pattern Recognition Letters
3D model comparison using spatial structure circular descriptor
Pattern Recognition
Multi-modal Correlation Modeling and Ranking for Retrieval
PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Segmentation, indexing, and retrieval for environmental and natural sounds
IEEE Transactions on Audio, Speech, and Language Processing
Cross-media retrieval using query dependent search methods
Pattern Recognition
CEDD: color and edge directivity descriptor: a compact descriptor for image indexing and retrieval
ICVS'08 Proceedings of the 6th international conference on Computer vision systems
A 3D Shape Retrieval Framework Supporting Multimodal Queries
International Journal of Computer Vision
Evaluating Color Descriptors for Object and Scene Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence
An approach to content-based image retrieval based on the Lucene search engine library
ECDL'10 Proceedings of the 14th European conference on Research and advanced technology for digital libraries
Location grounding in multimodal local search
International Conference on Multimodal Interfaces and the Workshop on Machine Learning for Multimodal Interaction
Nonlinear dimensionality reduction for efficient and effective audio similarity searching
Multimedia Tools and Applications
Manifold-ranking based retrieval using k-regular nearest neighbor graph
Pattern Recognition
Measuring multi-modality similarities via subspace learning for cross-media retrieval
PCM'06 Proceedings of the 7th Pacific Rim conference on Advances in Multimedia Information Processing
I-SEARCH: a unified framework for multimodal search and retrieval
The Future Internet
Hi-index | 0.01 |
In this paper, a unified framework for multimodal content retrieval is presented. The proposed framework supports retrieval of rich media objects as unified sets of different modalities (image, audio, 3D, video and text) by efficiently combining all monomodal heterogeneous similarities to a global one according to an automatic weighting scheme. Then, a multimodal space is constructed to capture the semantic correlations among multiple modalities. In contrast to existing techniques, the proposed method is also able to handle external multimodal queries, by embedding them to the already constructed multimodal space, following a space mapping procedure of a submanifold analysis. In our experiments with five real multimodal datasets, we show the superiority of the proposed approach against competitive methods.