Multiple Bernoulli relevance models for image and video annotation

Authors:
S. L. Feng;R. Manmatha;V. Lavrenko
Affiliations:
Center for Intelligent Information Retrieval, University of Massachusetts, Amherst, MA;Center for Intelligent Information Retrieval, University of Massachusetts, Amherst, MA;Center for Intelligent Information Retrieval, University of Massachusetts, Amherst, MA
Venue:
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Year:
2004

Citing 9
Cited 196

Example-Based Learning for View-Based Human Face Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning a Sparse Representation for Object Detection

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Automatic image annotation and retrieval using cross-media relevance models

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Modeling annotated data

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
A statistical approach to 3d object detection applied to faces and cars

A statistical approach to 3d object detection applied to faces and cars
Matching words and pictures

The Journal of Machine Learning Research
Why can't José read?: the problem of learning semantic associations in a robot environment

HLT-NAACL-LWM '04 Proceedings of the HLT-NAACL 2003 workshop on Learning word meaning from non-linguistic data - Volume 6

Hidden Markov models for automatic annotation and content-based retrieval of images and video

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating the impact of selection noise in community-based web search

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A database centric view of semantic image annotation and retrieval

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Joint visual-text modeling for automatic retrieval of multimedia documents

Proceedings of the 13th annual ACM international conference on Multimedia
Word sense disambiguation with pictures

Artificial Intelligence - Special volume on connecting language to the world
An adaptive graph model for automatic image annotation

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Scalable search-based image annotation of personal images

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Image annotation refinement using random walk with restarts

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Automatic video annotation by semi-supervised learning with kernel density estimation

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Toward bridging the annotation-retrieval gap in image search by a generative modeling approach

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Semantic Modeling of Natural Scenes for Content-Based Image Retrieval

International Journal of Computer Vision
Supervised Learning of Semantic Classes for Image Annotation and Retrieval

IEEE Transactions on Pattern Analysis and Machine Intelligence
Towards musical query-by-semantic-description using the CAL500 data set

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating the performance in automatic image annotation: Example case by adaptive fusion of global image features

Image Communication
Enhanced max margin learning on multimodal data mining in a multimedia database

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Automatic image annotation by an iterative approach: incorporating keyword correlations and region matching

Proceedings of the 6th ACM international conference on Image and video retrieval
Semantic facets: an in-depth analysis of a semantic image retrieval system

Proceedings of the 6th ACM international conference on Image and video retrieval
Using multiple segmentations for image auto-annotation

Proceedings of the 6th ACM international conference on Image and video retrieval
Information-theoretic semantic multimedia indexing

Proceedings of the 6th ACM international conference on Image and video retrieval
A learning state-space model for image retrieval

EURASIP Journal on Applied Signal Processing
Enhancing image annotation by integrating concept ontology and text-based bayesian learning model

Proceedings of the 15th international conference on Multimedia
Tagging over time: real-world image annotation by lightweight meta-learning

Proceedings of the 15th international conference on Multimedia
Bipartite graph reinforcement model for web image annotation

Proceedings of the 15th international conference on Multimedia
Dual cross-media relevance model for image annotation

Proceedings of the 15th international conference on Multimedia
Structure-sensitive manifold ranking for video concept detection

Proceedings of the 15th international conference on Multimedia
Optimizing multi-graph learning: towards a unified video annotation scheme

Proceedings of the 15th international conference on Multimedia
Modeling Semantic Aspects for Cross-Media Image Indexing

IEEE Transactions on Pattern Analysis and Machine Intelligence
Latent semantic fusion model for image retrieval and annotation

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Translating topics to words for image annotation

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
A graph-based image annotation framework

Pattern Recognition Letters
Fast image auto-annotation with discretized feature distance measures

Machine Graphics & Vision International Journal
Automatic medical image annotation and retrieval

Neurocomputing
Automatic image annotation via local multi-label classification

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Semantic spaces revisited: investigating the performance of auto-annotation and semantic retrieval using semantic spaces

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
A discrete direct retrieval model for image and video retrieval

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Learning to reduce the semantic gap in web image retrieval and annotation

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Resulted word counts optimization-A new approach for better automatic image annotation

Pattern Recognition
A survey of browsing models for content based image retrieval

Multimedia Tools and Applications
Automatic Image Annotation Using a Visual Dictionary Based on Reliable Image Segmentation

Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
Combining global, regional and contextual features for automatic image annotation

Pattern Recognition
Exploring multimedia in a keyword space

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Annotating personal albums via web mining

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Multi-progressive model for web image annotation

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Automatic image tagging as a random walk with priors on the canonical correlation subspace

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Image retrieval using query by contextual example

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Distributed image search in camera sensor networks

Proceedings of the 6th ACM conference on Embedded network sensor systems
Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning Visual Classifiers

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Classification and Automatic Annotation Extension of Images Using Bayesian Network

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Semi-supervised kernel density estimation for video annotation

Computer Vision and Image Understanding
Crossing textual and visual content in different application scenarios

Multimedia Tools and Applications
High-Performance Image Annotation and Retrieval for Weakly Labeled Images Using Latent Space Learning

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Web-Scale Image Annotation

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Adaptive Model for Integrating Different Types of Associated Texts for Automated Annotation of Web Images

MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Region-based image retrieval using color-size features of watershed regions

Journal of Visual Communication and Image Representation
TSVM-HMM: Transductive SVM based hidden Markov model for automatic image annotation

Expert Systems with Applications: An International Journal
Automatic Web Image Annotation via Web-Scale Image Semantic Space Learning

APWeb/WAIM '09 Proceedings of the Joint International Conferences on Advances in Data and Web Management
Bayesian Mixture Hierarchies for Automatic Image Annotation

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Time warp football

Proceedings of the seventh european conference on European interactive television conference
Video semantic analysis based on structure-sensitive anisotropic manifold ranking

Signal Processing
Transductive Multi-Instance Multi-Label learning algorithm with application to automatic image annotation

Expert Systems with Applications: An International Journal
Canonical contextual distance for large-scale image annotation and retrieval

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Label to region by bi-layer sparsity priors

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Unified video annotation via multigraph learning

IEEE Transactions on Circuits and Systems for Video Technology
Word sense disambiguation with pictures

Artificial Intelligence - Special volume on connecting language to the world
Integrating spatial and color information in images using a statistical framework

Expert Systems with Applications: An International Journal
Global annotation on georeferenced photographs

Proceedings of the ACM International Conference on Image and Video Retrieval
Context-based multi-label image annotation

Proceedings of the ACM International Conference on Image and Video Retrieval
Style modeling for tagging personal photo collections

Proceedings of the ACM International Conference on Image and Video Retrieval
Exploring Flickr's related tags for semantic annotation of web images

Proceedings of the ACM International Conference on Image and Video Retrieval
Image categorization via robust pLSA

Pattern Recognition Letters
Generalized Relevance Models for Automatic Image Annotation

PCM '09 Proceedings of the 10th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Learning image semantics with latent aspect model

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Image annotation and retrieval based on efficient learning of contextual latent space

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
On cross-language image annotations

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Towards developing a unified multimodal image retrieval framework

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Robust image annotation refinement via graph-based learning

CCDC'09 Proceedings of the 21st annual international conference on Chinese control and decision conference
On the use of anti-word models for audio music annotation and retrieval

IEEE Transactions on Audio, Speech, and Language Processing
Investigating visual feature extraction methods for image annotation

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Improved Resulted Word Counts Optimizer for Automatic Image Annotation Problem

Fundamenta Informaticae - Advances in Artificial Intelligence and Applications
CLOVIS: towards precision-oriented text-based video retrieval through the unification of automatically-extracted concepts and relations of the visual and audio/speech contents

Journal of Intelligent Information Systems
Image annotation with tagprop on the MIRFLICKR set

Proceedings of the international conference on Multimedia information retrieval
Large Scale Online Learning of Image Similarity Through Ranking

The Journal of Machine Learning Research
Semantic feature selection for object discovery in high-resolution remote sensing imagery

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
MAP-based image tag recommendation using a visual folksonomy

Pattern Recognition Letters
Multi-modal multi-label semantic indexing of images based on hybrid ensemble learning

PCM'07 Proceedings of the multimedia 8th Pacific Rim conference on Advances in multimedia information processing
Empirical investigations on benchmark tasks for automatic image annotation

VISUAL'07 Proceedings of the 9th international conference on Advances in visual information systems
A new model for image annotation

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
Semantic video annotation by mining association patterns from visual and speech features

PAKDD'08 Proceedings of the 12th Pacific-Asia conference on Advances in knowledge discovery and data mining
A system that learns to tag videos by watching youtube

ICVS'08 Proceedings of the 6th international conference on Computer vision systems
Web image annotation based on automatically obtained noisy training set

APWeb'08 Proceedings of the 10th Asia-Pacific web conference on Progress in WWW research and development
Modeling, classifying and annotating weakly annotated images using Bayesian network

Journal of Visual Communication and Image Representation
Multi-label learning by Image-to-Class distance for scene classification and image annotation

Proceedings of the ACM International Conference on Image and Video Retrieval
Image retrieval using Markov Random Fields and global image features

Proceedings of the ACM International Conference on Image and Video Retrieval
Beyond tag relevance: integrating visual attention model and multi-instance learning for tag saliency ranking

Proceedings of the ACM International Conference on Image and Video Retrieval
A spectral method for context based disambiguation of image annotations

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Modeling latent aspects for automatic image annotation

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Automatic tag expansion using visual similarity for photo sharing websites

Multimedia Tools and Applications
Baselines for Image Annotation

International Journal of Computer Vision
Short Communication: A multimedia retrieval framework highlighting agents and coordinating their interactions to address the semantic gap

Expert Systems with Applications: An International Journal
An information-theoretic framework for semantic-multimedia retrieval

ACM Transactions on Information Systems (TOIS)
Visual information in semantic representation

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Topic models for image annotation and text illustration

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
How many words is a picture worth? Automatic caption generation for news images

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Fusing semantic aspects for image annotation and retrieval

Journal of Visual Communication and Image Representation
Unified tag analysis with multi-edge graph

Proceedings of the international conference on Multimedia
A new approach to cross-modal multimedia retrieval

Proceedings of the international conference on Multimedia
Context dependent SVMs for interconnected image network annotation

Proceedings of the international conference on Multimedia
Image annotation using multi-correlation probabilistic matrix factorization

Proceedings of the international conference on Multimedia
Concept detector refinement using social videos

Proceedings of the international workshop on Very-large-scale multimedia corpus, mining and retrieval
Time warp sports for internet television

ACM Transactions on Computer-Human Interaction (TOCHI)
Visual query expansion via incremental hypernetwork models of image and text

PRICAI'10 Proceedings of the 11th Pacific Rim international conference on Trends in artificial intelligence
Discovering phrase-level lexicon for image annotation

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Tag refinement in an image folksonomy using visual similarity and tag co-occurrence statistics

Image Communication
PATSI: photo annotation through finding similar images with multivariate Gaussian models

ICCVG'10 Proceedings of the 2010 international conference on Computer vision and graphics: Part II
Video annotation using hierarchical Dirichlet process mixture model

Expert Systems with Applications: An International Journal
Modeling continuous visual features for semantic image annotation and retrieval

Pattern Recognition Letters
Design and implementation of a system for finding appropriate tags to photos in Flickr from Web browsing behaviour

International Journal of Web and Grid Services
Image annotation with concept level feature using PLSA+CCA

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
Leveraging auxiliary text terms for automatic image annotation

Proceedings of the 20th international conference companion on World wide web
Context-based support vector machines for interconnected image annotation

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part I
An HMM-SVM-based automatic image annotation approach

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part IV
ATLAS: a probabilistic algorithm for high dimensional similarity search

Proceedings of the 2011 ACM SIGMOD International Conference on Management of data
Graph-based methods for the automatic annotation and retrieval of art prints

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Effective term weighting in ALT text prediction for web image retrieval

APWeb'11 Proceedings of the 13th Asia-Pacific web conference on Web technologies and applications
Image similarities on the basis of visual content: an attempt to bridge the semantic gap

ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part I
Semantics extraction from images

Knowledge-driven multimedia information extraction and ontology evolution
Mining partially annotated images

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
A review on automatic image annotation techniques

Pattern Recognition
Automatic Image Annotation Based on Generalized Relevance Models

Journal of Signal Processing Systems
Automated image annotation system based on an open source object database

IDEAL'11 Proceedings of the 12th international conference on Intelligent data engineering and automated learning
Two-probabilistic latent semantic model for image annotation and retrieval

ACCV'10 Proceedings of the 2010 international conference on Computer vision - Volume Part I
Automatic image tagging based on regions of interest

AICI'11 Proceedings of the Third international conference on Artificial intelligence and computational intelligence - Volume Part I
Tag recommendation for georeferenced photos

Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Location-Based Social Networks
Learning "verb-object" concepts for semantic image annotation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Ensemble approach based on conditional random field for multi-label image and video annotation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Fusion of region and image-based techniques for automatic image annotation

MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
Automatic refinement of keyword annotations for web image search

MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
Mining multiple visual appearances of semantics for image annotation

MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
Automatic video annotation and retrieval based on bayesian inference

MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
A stratification-based approach to accurate and fast image annotation

WAIM'05 Proceedings of the 6th international conference on Advances in Web-Age Information Management
Automated image annotation using global features and robust nonparametric density estimation

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Incorporating prior knowledge into multi-label boosting for cross-modal image annotation and retrieval

AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
Using manual and automated annotations to search images by semantic similarity

Multimedia Tools and Applications
A framework for evaluating automatic image annotation algorithms

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Logistic regression of generic codebooks for semantic image retrieval

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Query by semantic example

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Recognizing objects and scenes in news videos

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Semi-supervised learning for image annotation based on conditional random fields

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Automatic annotation and retrieval for videos

PSIVT'06 Proceedings of the First Pacific Rim conference on Advances in Image and Video Technology
Automatic image annotation with cooperation of concept-specific and universal visual vocabularies

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Tag recommendation for flickr using web browsing behavior

ICCSA'10 Proceedings of the 2010 international conference on Computational Science and Its Applications - Volume Part II
Putting the user in the loop: visual resource discovery

AMR'05 Proceedings of the Third international conference on Adaptive Multimedia Retrieval: user, context, and feedback
A Probabilistic Model to Combine Tags and Acoustic Similarity for Music Retrieval

ACM Transactions on Information Systems (TOIS)
Combining image-level and segment-level models for automatic annotation

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
A novel multi-modal integration and propagation model for cross-media information retrieval

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
An annotation rule extraction algorithm for image retrieval

Pattern Recognition Letters
Combining visual attention model with multi-instance learning for tag ranking

Neurocomputing
Learning to summarize web image and text mutually

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Assistive tagging: A survey of multimedia tagging with human-computer joint exploration

ACM Computing Surveys (CSUR)
Collaborative visual modeling for automatic image annotation via sparse model coding

Neurocomputing
Accessing learning resources described in semantically enriched weblogs

International Journal of Metadata, Semantics and Ontologies
Fast Structured Prediction Using Large Margin Sigmoid Belief Networks

International Journal of Computer Vision
Improved Resulted Word Counts Optimizer for Automatic Image Annotation Problem

Fundamenta Informaticae - Advances in Artificial Intelligence and Applications
Movie keyframe retrieval based on cross-media correlation detection and context model

IEA/AIE'12 Proceedings of the 25th international conference on Industrial Engineering and Other Applications of Applied Intelligent Systems: advanced research in applied artificial intelligence
Label-to-region with continuity-biased bi-layer sparsity priors

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Multi-view learning from imperfect tagging

Proceedings of the 20th ACM international conference on Multimedia
Image annotation by semantic sparse recoding of visual content

Proceedings of the 20th ACM international conference on Multimedia
Interactive tool for image annotation using a semi-supervised and hierarchical approach

Computer Standards & Interfaces
Automatic image annotation using tag-related random search over visual neighbors

Proceedings of the 21st ACM international conference on Information and knowledge management
Semantic context learning with large-scale weakly-labeled image set

Proceedings of the 21st ACM international conference on Information and knowledge management
An efficient two-stage framework for image annotation

Pattern Recognition
Annotation propagation in large image databases via dense image correspondence

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Image annotation using metric learning in semantic neighbourhoods

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Random forest for image annotation

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Labeling images by integrating sparse multiple distance learning and semantic context modeling

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part IV
A novel image annotation feedback model based on internet-search

WISM'12 Proceedings of the 2012 international conference on Web Information Systems and Mining
Translating related words to videos and back through latent topics

Proceedings of the sixth ACM international conference on Web search and data mining
K-Nearest Neighbors Relevance Annotation Model for Distance Education

International Journal of Distance Education Technologies
An interactive semi-supervised approach for automatic image annotation

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Nonlinear matrix factorization with unified embedding for social tag relevance learning

Neurocomputing
A spatio-temporal pyramid matching for video retrieval

Computer Vision and Image Understanding
Unsupervised language learning for discovered visual concepts

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part IV
MLRank: Multi-correlation Learning to Rank for image annotation

Pattern Recognition
Real web community based automatic image annotation

Computers and Electrical Engineering
Automatic image annotation using semantic relevance

Proceedings of the Fifth International Conference on Internet Multimedia Computing and Service
Towards efficient sparse coding for scalable image annotation

Proceedings of the 21st ACM international conference on Multimedia
Zero-shot video retrieval using content and concepts

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Correlation consistency constrained probabilistic matrix factorization for social tag refinement

Neurocomputing
A feature-word-topic model for image annotation and retrieval

ACM Transactions on the Web (TWEB)
Human computation: Image metadata acquisition based on a single-player annotation game

International Journal of Human-Computer Studies
Learning semantic concepts from image database with hybrid generative/discriminative approach

Engineering Applications of Artificial Intelligence
Multi-view embedding learning for incompletely labeled data

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Nonparametric bayesian upstream supervised multi-modal topic models

Proceedings of the 7th ACM international conference on Web search and data mining
ObjectPatchNet: Towards scalable and semantic image annotation and retrieval

Computer Vision and Image Understanding
Effective automatic image annotation via integrated discriminative and generative models

Information Sciences: an International Journal
Automatic annotation of image databases based on implicit crowdsourcing, visual concept modeling and evolution

Multimedia Tools and Applications

Quantified Score

Hi-index	0.01

Visualization

Abstract

Retrieving images in response to textual queries requires some knowledge of the semantics of the picture. Here, we show how we can do both automatic image annotation and retrieval (using one word queries) from images and videos using a multiple Bernoulli relevance model. The model assumes that a training set of images or videos along with keyword annotations is provided. Multiple keywords are provided for an image and the specific correspondence between a keyword and an image is not provided. Each image is partitioned into a set of rectangular regions and a real-valued feature vector is computed over these regions. The relevance model is a joint probability distribution of the word annotations and the image feature vectors and is computed using the training set. The word probabilities are estimated using a multiple Bernoulli model and the image feature probabilities using a non-parametric kernel density estimate. The model is then used to annotate images in a test set. We show experiments on both images from a standard Corel data set and a set of video key frames from NIST's Video Trec. Comparative experiments show that the model performs better than a model based on estimating word probabilities using the popular multinomial distribution. The results also show that our model significantly outperforms previously reported results on the task of image and video annotation.