PLSA-based image auto-annotation: constraining the latent space

Authors:
Florent Monay;Daniel Gatica-Perez
Affiliations:
IDIAP Research Institute, Martigny, Switzerland;IDIAP Research Institute, Martigny, Switzerland
Venue:
Proceedings of the 12th annual ACM international conference on Multimedia
Year:
2004

Citing 5
Cited 54

Unsupervised Learning by Probabilistic Latent Semantic Analysis

Machine Learning
Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Modeling annotated data

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Matching words and pictures

The Journal of Machine Learning Research
On image auto-annotation with latent space models

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia

Learning an image-word embedding for image auto-annotation on the nonlinear latent space

Proceedings of the 13th annual ACM international conference on Multimedia
Incorporating concept ontology to enable probabilistic concept reasoning for multi-level image annotation

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Automatic function selection for large scale salient object detection

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Automatic image annotation by incorporating feature hierarchy and boosting to scale up SVM classifiers

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Automatic image annotation by an iterative approach: incorporating keyword correlations and region matching

Proceedings of the 6th ACM international conference on Image and video retrieval
Automatic image annotation via local multi-label classification

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Automatic Image Annotation with Relevance Feedback and Latent Semantic Analysis

Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
Crossing textual and visual content in different application scenarios

Multimedia Tools and Applications
Web-Scale Image Annotation

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
PLSI: The True Fisher Kernel and beyond

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Scene classification using pLSA with visterm spatial location

IMCE '09 Proceedings of the 1st international workshop on Interactive multimedia for consumer electronics
Semi-supervised topic modeling for image annotation

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Multilayer pLSA for multimodal image retrieval

Proceedings of the ACM International Conference on Image and Video Retrieval
Style modeling for tagging personal photo collections

Proceedings of the ACM International Conference on Image and Video Retrieval
Incremental Learning of Triadic PLSA for Collaborative Filtering

AMT '09 Proceedings of the 5th International Conference on Active Media Technology
Topic models for semantics-preserving video compression

Proceedings of the international conference on Multimedia information retrieval
Image annotation with tagprop on the MIRFLICKR set

Proceedings of the international conference on Multimedia information retrieval
Learning to retrieve images from text queries with a discriminative model

AMR'06 Proceedings of the 4th international conference on Adaptive multimedia retrieval: user, context, and feedback
Multi-modal multi-label semantic indexing of images based on hybrid ensemble learning

PCM'07 Proceedings of the multimedia 8th Pacific Rim conference on Advances in multimedia information processing
Hierarchical long-term learning for automatic image annotation

SAMT'07 Proceedings of the semantic and digital media technologies 2nd international conference on Semantic Multimedia
Automatic tag expansion using visual similarity for photo sharing websites

Multimedia Tools and Applications
Audio-based semantic concept classification for consumer video

IEEE Transactions on Audio, Speech, and Language Processing
IPSILON: incremental parsing for semantic indexing of latent concepts

IEEE Transactions on Image Processing
Context dependent SVMs for interconnected image network annotation

Proceedings of the international conference on Multimedia
Auto-tagging of images in non-english languages using tag language conversion

Proceedings of the international conference on Multimedia
Scene categorization using boosted back-propagation neural networks

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Large-scale text to image retrieval using a Bayesian K-neighborhood model

SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Variational inference with graph regularization for image annotation

ACM Transactions on Intelligent Systems and Technology (TIST)
Correlated PLSA for image clustering

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Boosted scene categorization approach by adjusting inner structures and outer weights of weak classifiers

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part I
Context-based support vector machines for interconnected image annotation

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part I
PLBP: An effective local binary patterns texture descriptor with pyramid representation

Pattern Recognition
A neural network to retrieve images from text queries

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part II
A discriminative approach for the retrieval of images from text queries

ECML'06 Proceedings of the 17th European conference on Machine Learning
Incorporating prior knowledge into multi-label boosting for cross-modal image annotation and retrieval

AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
Some experiments of face annotation based on latent semantic indexing in FIARS

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part II
Semi-supervised learning for image annotation based on conditional random fields

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Constructing visual models with a latent space approach

SLSFS'05 Proceedings of the 2005 international conference on Subspace, Latent Structure and Feature Selection
Combining image-level and segment-level models for automatic annotation

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
WSABIE: scaling up to large vocabulary image annotation

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Probabilistic semantic component descriptor

Multimedia Tools and Applications
Collaborative visual modeling for automatic image annotation via sparse model coding

Neurocomputing
An efficient two-stage framework for image annotation

Pattern Recognition
ISABoost: A weak classifier inner structure adjusting based AdaBoost algorithm-ISABoost based application in scene categorization

Neurocomputing
i-TagRanker: an efficient tag ranking system for image sharing and retrieval using the semantic relationships between tags

Multimedia Tools and Applications
An interactive semi-supervised approach for automatic image annotation

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Modeling hidden topics with dual local consistency for image analysis

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part I
A feature-word-topic model for image annotation and retrieval

ACM Transactions on the Web (TWEB)
Applying a lightweight iterative merging chinese segmentation in web image annotation

MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Explicit context-aware kernel map learning for image annotation

ICVS'13 Proceedings of the 9th international conference on Computer Vision Systems
Learning to Recommend Descriptive Tags for Questions in Social Forums

ACM Transactions on Information Systems (TOIS)
Effective automatic image annotation via integrated discriminative and generative models

Information Sciences: an International Journal
Learning semantic representations of objects and their parts

Machine Learning
A Multi-View Embedding Space for Modeling Internet Images, Tags, and Their Semantics

International Journal of Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

We address the problem of unsupervised image auto-annotation with probabilistic latent space models. Unlike most previous works, which build latent space representations assuming equal relevance for the text and visual modalities, we propose a new way of modeling multi-modal co-occurrences, constraining the definition of the latent space to ensure its consistency in semantic terms (words), while retaining the ability to jointly model visual information. The concept is implemented by a linked pair of Probabilistic Latent Semantic Analysis (PLSA) models. On a 16000-image collection, we show with extensive experiments that our approach significantly outperforms previous joint models.