A Multilinear Singular Value Decomposition
SIAM Journal on Matrix Analysis and Applications
Content-Based Image Retrieval at the End of the Early Years
IEEE Transactions on Pattern Analysis and Machine Intelligence
Unsupervised learning by probabilistic latent semantic analysis
Machine Learning
Multilinear Analysis of Image Ensembles: TensorFaces
ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part I
The Journal of Machine Learning Research
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Multimodal Video Indexing: A Review of the State-of-the-art
Multimedia Tools and Applications
MMSS: Multi-Modal Story-Oriented Video Summarization
ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
A Probabilistic Semantic Model for Image Annotation and Multi-Modal Image Retrieva
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Modeling Scenes with Local Descriptors and Latent Aspects
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Multi-graph enabled active learning for multimodal web image retrieval
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Content-based multimedia information retrieval: State of the art and challenges
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Scalable Recognition with a Vocabulary Tree
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Handwritten digit classification using higher order singular value decomposition
Pattern Recognition
Video search by multi-modal and clustering analysis
Proceedings of the 6th ACM international conference on Image and video retrieval
Latent semantic fusion model for image retrieval and annotation
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Scenique: a multimodal image retrieval interface
AVI '08 Proceedings of the working conference on Advanced visual interfaces
Learning to reduce the semantic gap in web image retrieval and annotation
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Feature Extraction for Document Image Segmentation by pLSA Model
DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
A New Baseline for Image Annotation
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part III
Scalable Tensor Decompositions for Multi-aspect Data Mining
ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Multilayer pLSA for multimodal image retrieval
Proceedings of the ACM International Conference on Image and Video Retrieval
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV
A semantic model for cross-modal and multi-modal retrieval
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
High order pLSA for indexing tagged images
Signal Processing
Hi-index | 0.00 |
Popular image retrieval schemes generally rely only on a single mode, (either low level visual features or embedded text) for searching in multimedia databases. Many popular image collections (eg. those emerging over Internet) have associated tags, often for human consumption. A natural extension is to combine information from multiple modes for enhancing effectiveness in retrieval. In this paper, we propose two techniques: Multi-modal Latent Semantic Indexing (MMLSI) and Multi-Modal Probabilistic Latent Semantic Analysis (MMpLSA). These methods are obtained by directly extending their traditional single mode counter parts. Both these methods incorporate visual features and tags by generating simultaneous semantic contexts. The experimental results demonstrate an improved accuracy over other single and multi-modal methods.