Multi modal semantic indexing for image retrieval

Authors:
Pulla Chandrika;C. V. Jawahar
Affiliations:
International Institute of Information Technology, Hyderabad, India;International Institute of Information Technology, Hyderabad, India
Venue:
Proceedings of the ACM International Conference on Image and Video Retrieval
Year:
2010

Citing 23
Cited 2

A Multilinear Singular Value Decomposition

SIAM Journal on Matrix Analysis and Applications
Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
Unsupervised learning by probabilistic latent semantic analysis

Machine Learning
Multilinear Analysis of Image Ensembles: TensorFaces

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part I
Latent dirichlet allocation

The Journal of Machine Learning Research
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Multimodal Video Indexing: A Review of the State-of-the-art

Multimedia Tools and Applications
MMSS: Multi-Modal Story-Oriented Video Summarization

ICDM '04 Proceedings of the Fourth IEEE International Conference on Data Mining
A Probabilistic Semantic Model for Image Annotation and Multi-Modal Image Retrieva

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Modeling Scenes with Local Descriptors and Latent Aspects

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Multi-graph enabled active learning for multimodal web image retrieval

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Content-based multimedia information retrieval: State of the art and challenges

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Scalable Recognition with a Vocabulary Tree

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Handwritten digit classification using higher order singular value decomposition

Pattern Recognition
Video search by multi-modal and clustering analysis

Proceedings of the 6th ACM international conference on Image and video retrieval
Latent semantic fusion model for image retrieval and annotation

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Scenique: a multimodal image retrieval interface

AVI '08 Proceedings of the working conference on Advanced visual interfaces
Learning to reduce the semantic gap in web image retrieval and annotation

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Feature Extraction for Document Image Segmentation by pLSA Model

DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
A New Baseline for Image Annotation

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part III
Scalable Tensor Decompositions for Multi-aspect Data Mining

ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Multilayer pLSA for multimodal image retrieval

Proceedings of the ACM International Conference on Image and Video Retrieval
Scene classification via pLSA

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part IV

A semantic model for cross-modal and multi-modal retrieval

Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
High order pLSA for indexing tagged images

Signal Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Popular image retrieval schemes generally rely only on a single mode, (either low level visual features or embedded text) for searching in multimedia databases. Many popular image collections (eg. those emerging over Internet) have associated tags, often for human consumption. A natural extension is to combine information from multiple modes for enhancing effectiveness in retrieval. In this paper, we propose two techniques: Multi-modal Latent Semantic Indexing (MMLSI) and Multi-Modal Probabilistic Latent Semantic Analysis (MMpLSA). These methods are obtained by directly extending their traditional single mode counter parts. Both these methods incorporate visual features and tags by generating simultaneous semantic contexts. The experimental results demonstrate an improved accuracy over other single and multi-modal methods.