Matching words and pictures

Authors:
Kobus Barnard;Pinar Duygulu;David Forsyth;Nando de Freitas;David M. Blei;Michael I. Jordan
Affiliations:
Computer Science Department, University of Arizona, Tucson, AZ;Department of Computer Engineering, Middle East Technical University, Ankara, Turkey;Computer Science Division, University of California, Berkeley, CA;Department of Computer Science, University of British Columbia, Vancouver, B.C. V6T 1Z4, Canada;Computer Science Division, University of California, Berkeley, CA;Computer Science Division and Department of Statistics, University of California, Berkeley, CA
Venue:
The Journal of Machine Learning Research
Year:
2003

Citing 21
Cited 353

A shortest augmenting path algorithm for dense and sparse linear assignment problems

Computing
Extracting visual information from text: using captions to label faces in newspaper photographs

Extracting visual information from text: using captions to label faces in newspaper photographs
Visual semantics: extracting visual information from text accompanying pictures

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Normalized Cuts and Image Segmentation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition

Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
Computer Vision: A Modern Approach

Computer Vision: A Modern Approach
End-User Searching Challenges Indexing Practices inthe Digital Newspaper Photo Archive

Information Retrieval
Browse and Search Patterns in a Digital Image Database

Information Retrieval
Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying

IEEE Transactions on Pattern Analysis and Machine Intelligence
Finding Naked People

ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume II - Volume II
Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Multiple-Instance Learning for Natural Scene Classification

ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Combining Textual and Visual Cues for Content-Based Image Retrieval on the World Wide Web

CBAIVL '98 Proceedings of the IEEE Workshop on Content - Based Access of Image and Video Libraries
Name-It: Association of Face and Name in Video

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Pedestrian Detection Using Wavelet Templates

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Statistical Models for Co-occurrence Data

Statistical Models for Co-occurrence Data
WebSeer: An Image Search Engine for the World Wide Web

WebSeer: An Image Search Engine for the World Wide Web
Learning from ambiguity

Learning from ambiguity
The mathematics of statistical machine translation: parameter estimation

Computational Linguistics - Special issue on using large corpora: II
Hierarchical browsing and search of large image databases

IEEE Transactions on Image Processing

Automatic image annotation and retrieval using cross-media relevance models

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Modeling annotated data

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
On image auto-annotation with latent space models

MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Automatic image annotation by using concept-sensitive salient objects for image content representation

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic multimedia cross-modal correlation discovery

Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Image Categorization by Learning and Reasoning with Regions

The Journal of Machine Learning Research
The story picturing engine: finding elite images to illustrate a story using mutual reinforcement

Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval
A semi-naïve Bayesian method incorporating clustering with pair-wise constraints for auto image annotation

Proceedings of the 12th annual ACM international conference on Multimedia
PLSA-based image auto-annotation: constraining the latent space

Proceedings of the 12th annual ACM international conference on Multimedia
Multi-level annotation of natural scenes using dominant image components and semantic concepts

Proceedings of the 12th annual ACM international conference on Multimedia
Efficient propagation for face annotation in family albums

Proceedings of the 12th annual ACM international conference on Multimedia
Regularizing translation models for better automatic image annotation

Proceedings of the thirteenth ACM international conference on Information and knowledge management
Automatic image annotation and retrieval using subspace clustering algorithm

Proceedings of the 2nd ACM international workshop on Multimedia databases
Retrieving lightly annotated images using image similarities

Proceedings of the 2005 ACM symposium on Applied computing
Essential Latent Knowledge for Protein-Protein Interactions: Analysis by an Unsupervised Learning Approach

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Hidden Markov models for automatic annotation and content-based retrieval of images and video

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A database centric view of semantic image annotation and retrieval

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Mining images on semantics via statistical learning

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Using dual cascading learning frameworks for image indexing

VIP '05 Proceedings of the Pan-Sydney area workshop on Visual information processing
Exploiting a sensed environment to improve human-agent communication

Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Combining Sequence and Time Series Expression Data to Learn Transcriptional Modules

IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Image region entropy: a measure of "visualness" of web images associated with one concept

Proceedings of the 13th annual ACM international conference on Multimedia
Region based image annotation through multiple-instance learning

Proceedings of the 13th annual ACM international conference on Multimedia
Learning an image-word embedding for image auto-annotation on the nonlinear latent space

Proceedings of the 13th annual ACM international conference on Multimedia
Two-scale image retrieval with significant meta-information feedback

Proceedings of the 13th annual ACM international conference on Multimedia
Image annotations by combining multiple evidence & wordNet

Proceedings of the 13th annual ACM international conference on Multimedia
Similarity space projection for web image search and annotation

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Probabilistic web image gathering

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Semantic image classification with hierarchical feature subset selection

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
A mutual semantic endorsement approach to image retrieval and context provision

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Hybrid visual and conceptual image representation within active relevance feedback context

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Evaluation strategies for image understanding and retrieval

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Localized content based image retrieval

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Content-based image retrieval: approaches and trends of the new age

Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Connecting language to the world

Artificial Intelligence - Special volume on connecting language to the world
Word sense disambiguation with pictures

Artificial Intelligence - Special volume on connecting language to the world
Protocols from perceptual observations

Artificial Intelligence - Special volume on connecting language to the world
Semiotic schemas: a framework for grounding language in action and perception

Artificial Intelligence - Special volume on connecting language to the world
Word sense disambiguation with pictures

HLT-NAACL-LWM '04 Proceedings of the HLT-NAACL 2003 workshop on Learning word meaning from non-linguistic data - Volume 6
Nearest-neighbor automatic sound annotation with a WordNet taxonomy

Journal of Intelligent Information Systems - Special issue: Intelligent multimedia applications
Ontological inference for image and video analysis

Machine Vision and Applications
The Story Picturing Engine---a system for automatic text illustration

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Automatic image annotation and retrieval using weighted feature selection

Multimedia Tools and Applications
A latent mixed membership model for relational data

Proceedings of the 3rd international workshop on Link discovery
Finding visual concepts by web image mining

Proceedings of the 15th international conference on World Wide Web
Qualitative evaluation of automatic assignment of keywords to images

Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Visual ontology construction for digitized art image retrieval

Journal of Computer Science and Technology
HISA: a query system bridging the semantic gap for large image databases

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
CLAIRE: A modular support vector image indexing and classification system

ACM Transactions on Information Systems (TOIS)
An adaptive graph model for automatic image annotation

MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Image annotation by large-scale content-based image retrieval

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Real-time computerized annotation of pictures

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Toward bridging the annotation-retrieval gap in image search by a generative modeling approach

MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Semantic Modeling of Natural Scenes for Content-Based Image Retrieval

International Journal of Computer Vision
Incorporating multiple SVMs for automatic image annotation

Pattern Recognition
Segmentation and description of natural outdoor scenes

Image and Vision Computing
Unsupervised learning of a finite discrete mixture: Applications to texture modeling and image databases summarization

Journal of Visual Communication and Image Representation
Evaluating the performance in automatic image annotation: Example case by adaptive fusion of global image features

Image Communication
Semantic-associative visual content labelling and retrieval: A multimodal approach

Image Communication
Enhanced max margin learning on multimodal data mining in a multimedia database

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Model-shared subspace boosting for multi-label classification

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Image retrieval on large-scale image databases

Proceedings of the 6th ACM international conference on Image and video retrieval
Automatic image annotation by an iterative approach: incorporating keyword correlations and region matching

Proceedings of the 6th ACM international conference on Image and video retrieval
Semantic facets: an in-depth analysis of a semantic image retrieval system

Proceedings of the 6th ACM international conference on Image and video retrieval
Refining image annotation using contextual relations between words

Proceedings of the 6th ACM international conference on Image and video retrieval
Using multiple segmentations for image auto-annotation

Proceedings of the 6th ACM international conference on Image and video retrieval
How many high-level concepts will fill the semantic gap in news video retrieval?

Proceedings of the 6th ACM international conference on Image and video retrieval
Semantic identification: balancing between complexity and validity

EURASIP Journal on Applied Signal Processing
Discovering recurrent image semantics from class discrimination

EURASIP Journal on Applied Signal Processing
An efficient manual image annotation approach based on tagging and browsing

Workshop on multimedia information retrieval on The many faces of multimedia semantics
Unsupervised content-based indexing of sports video

Proceedings of the international workshop on Workshop on multimedia information retrieval
Learning people annotation from the web via consistency learning

Proceedings of the international workshop on Workshop on multimedia information retrieval
A review of text and image retrieval approaches for broadcast news video

Information Retrieval
Tagging over time: real-world image annotation by lightweight meta-learning

Proceedings of the 15th international conference on Multimedia
SBIA: search-based image annotation by leveraging web-scale images

Proceedings of the 15th international conference on Multimedia
Unsupervised content-based indexing for sports video retrieval

Proceedings of the 15th international conference on Multimedia
Exploiting spatial context constraints for automatic image region annotation

Proceedings of the 15th international conference on Multimedia
Modeling Semantic Aspects for Cross-Media Image Indexing

IEEE Transactions on Pattern Analysis and Machine Intelligence
Translating topics to words for image annotation

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Effect of word density on measuring words association

COMPUTE '08 Proceedings of the 1st Bangalore Annual Compute Conference
Fast image auto-annotation with discretized feature distance measures

Machine Graphics & Vision International Journal
Describing Visual Scenes Using Transformed Objects and Parts

International Journal of Computer Vision
Evaluation of Localized Semantics: Data, Methodology, and Experiments

International Journal of Computer Vision
Image retrieval: Ideas, influences, and trends of the new age

ACM Computing Surveys (CSUR)
Content visualization and management of geo-located image databases

CHI '08 Extended Abstracts on Human Factors in Computing Systems
Unsupervised learning of individuals and categories from images

Neural Computation
Knowledge discovery in multimedia repositories: the role of metadata

MMACTE'05 Proceedings of the 7th WSEAS International Conference on Mathematical Methods and Computational Techniques In Electrical Engineering
Flickr tag recommendation based on collective knowledge

Proceedings of the 17th international conference on World Wide Web
Automatic medical image annotation and retrieval

Neurocomputing
The evolution of visual information retrieval

Journal of Information Science
Inferring generic activities and events from image content and bags of geo-tags

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Non-negative matrix factorisation for object class discovery and image auto-annotation

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Continuous visual vocabulary modelsfor pLSA-based scene recognition

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Semantic spaces revisited: investigating the performance of auto-annotation and semantic retrieval using semantic spaces

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
A discrete direct retrieval model for image and video retrieval

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Learning to sportscast: a test of grounded language acquisition

Proceedings of the 25th international conference on Machine learning
Learning to reduce the semantic gap in web image retrieval and annotation

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Inferring semantics from textual information in multimedia retrieval

Neurocomputing
Integrating multi-modal content analysis and hyperbolic visualization for large-scale news video retrieval and exploration

Image Communication
A survey of methods for image annotation

Journal of Visual Languages and Computing
Multi-Class Segmentation with Relative Location Prior

International Journal of Computer Vision
Automatic Image Annotation with Relevance Feedback and Latent Semantic Analysis

Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
Comparing Local Feature Descriptors in pLSA-Based Image Models

Proceedings of the 30th DAGM symposium on Pattern Recognition
Learning Visual Compound Models from Parallel Image-Text Datasets

Proceedings of the 30th DAGM symposium on Pattern Recognition
Watch, Listen & Learn: Co-training on Captioned Images and Videos

ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Combining global, regional and contextual features for automatic image annotation

Pattern Recognition
Event recognition: viewing the world with a third eye

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Learning tag relevance by neighbor voting for social image retrieval

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Semantic lattices for multiple annotation of images

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Representing and playing user selected video narrative domains

SRMC '08 Proceedings of the 2nd ACM international workshop on Story representation, mechanism and context
Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning Visual Classifiers

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Learning Spatial Context: Using Stuff to Find Things

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Morphological segmentation on learned boundaries

Image and Vision Computing
Language Label Learning for Visual Concepts Discovered from Video Sequences

Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint
Classification and Automatic Annotation Extension of Images Using Bayesian Network

SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval

Computer Vision and Image Understanding
Crossing textual and visual content in different application scenarios

Multimedia Tools and Applications
Web-Scale Image Annotation

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Annotating images and image objects using a hierarchical dirichlet process model

Proceedings of the 9th International Workshop on Multimedia Data Mining: held in conjunction with the ACM SIGKDD 2008
Mining the web for visual concepts

Proceedings of the 9th International Workshop on Multimedia Data Mining: held in conjunction with the ACM SIGKDD 2008
Collaborative editing of micro-tags

CHI '09 Extended Abstracts on Human Factors in Computing Systems
Using Second Order Statistics to Enhance Automated Image Annotation

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Exploiting Visual Concepts to Improve Text-Based Image Retrieval

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Structured correspondence topic models for mining captioned figures in biological literature

Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Deriving semantic terms for images by mining the web

Proceedings of the 11th International Conference on Electronic Commerce
Object boundary detection in images using a semantic ontology

AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
Bayesian word sense induction

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Evaluating automatically generated user-focused multi-document summaries for geo-referenced images

MMIES '08 Proceedings of the Workshop on Multi-source Multilingual Information Extraction and Summarization
Incorporating temporal and semantic information with eye gaze for automatic word acquisition in multimodal conversational systems

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Situated models of meaning for sports video retrieval

NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
A Generic Approach to Topic Models

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Learning to connect language and perception

AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Towards automatic image region annotation: image region textual coreference resolution

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Semi-automatic dynamic auxiliary-tag-aided image annotation

Pattern Recognition
Pseudo-aligned multilingual corpora

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
A distributed, service-based framework for knowledge applications with multimedia

ACM Transactions on Information Systems (TOIS)
Visualizing textual travelogue with location-relevant images

Proceedings of the 2009 International Workshop on Location Based Social Networks
Image categorization combining neighborhood methods and boosting

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Semantics-preserving bag-of-words models for efficient image annotation

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Tagging and retrieving images with co-occurrence models: from corel to flickr

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
MovieBase: a movie database for event detection and behavioral analysis

WSMC '09 Proceedings of the 1st workshop on Web-scale multimedia corpus
Semi-supervised topic modeling for image annotation

MM '09 Proceedings of the 17th ACM international conference on Multimedia
What is a complete set of keywords for image description & annotation on the web

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Challenges for annotating images for sense disambiguation

LAC '06 Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora 2006
Word sense disambiguation with pictures

Artificial Intelligence - Special volume on connecting language to the world
Connecting language to the world

Artificial Intelligence - Special volume on connecting language to the world
Semiotic schemas: A framework for grounding language in action and perception

Artificial Intelligence - Special volume on connecting language to the world
Protocols from perceptual observations

Artificial Intelligence - Special volume on connecting language to the world
Multilayer pLSA for multimodal image retrieval

Proceedings of the ACM International Conference on Image and Video Retrieval
A visual analysis of the relationship between word concepts and geographical locations

Proceedings of the ACM International Conference on Image and Video Retrieval
NUS-WIDE: a real-world web image database from National University of Singapore

Proceedings of the ACM International Conference on Image and Video Retrieval
Shape reasoning on mis-segmented and mis-labeled objects using approximated Fisher criterion

Computers and Graphics
Image annotation within the context of personal photo collections using hierarchical event and scene models

IEEE Transactions on Multimedia - Special issue on integration of context and content
Using visual context and region semantics for high-level concept detection

IEEE Transactions on Multimedia - Special issue on integration of context and content
Effective annotation and search for video blogs with integration of context and content analysis

IEEE Transactions on Multimedia - Special issue on integration of context and content
Learning color names for real-world applications

IEEE Transactions on Image Processing
Exploiting multi-modal interactions: a unified framework

IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Enrichment and Ranking of the YouTube Tag Space and Integration with the Linked Data Cloud

ISWC '09 Proceedings of the 8th International Semantic Web Conference
Learning image semantics with latent aspect model

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Multimodal pLSA on visual features and tags

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Multimedia multimodal methodologies

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Towards developing a unified multimodal image retrieval framework

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Leveraging social media for training object detectors

DSP'09 Proceedings of the 16th international conference on Digital Signal Processing
The role of interactivity in human-machine conversation for automatic word acquisition

SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Unsupervised image categorization

Image and Vision Computing
Qualitative evaluation of automatic assignment of keywords to images

Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Learning social tag relevance by neighbor voting

IEEE Transactions on Multimedia
A Multi-Pronged Approach to Improving Semantic Extraction of News Video

Journal of Signal Processing Systems
Incorporating concept ontology into multi-level image indexing

Proceedings of the First International Conference on Internet Multimedia Computing and Service
Improved Resulted Word Counts Optimizer for Automatic Image Annotation Problem

Fundamenta Informaticae - Advances in Artificial Intelligence and Applications
Quest for relevant tags using local interaction networks and visual content

Proceedings of the international conference on Multimedia information retrieval
Topic models for semantics-preserving video compression

Proceedings of the international conference on Multimedia information retrieval
Region-based automatic web image selection

Proceedings of the international conference on Multimedia information retrieval
Combining visual features and text data for medical image retrieval using latent semantic kernels

Proceedings of the international conference on Multimedia information retrieval
Assessment of the utility of tag clouds for faster image retrieval

Proceedings of the international conference on Multimedia information retrieval
Image annotation with tagprop on the MIRFLICKR set

Proceedings of the international conference on Multimedia information retrieval
Statistical modeling and conceptualization of natural images

Pattern Recognition
Combining intra-image and inter-class semantics for consumer image retrieval

Pattern Recognition
OPTIMOL: Automatic Online Picture Collection via Incremental Model Learning

International Journal of Computer Vision
The segmented and annotated IAPR TC-12 benchmark

Computer Vision and Image Understanding
Dynamic Adaboost learning with feature selection based on parallel genetic algorithm for image annotation

Knowledge-Based Systems
Learning natural scene categories by selective multi-scale feature extraction

Image and Vision Computing
A shared-subspace learning framework for multi-label classification

ACM Transactions on Knowledge Discovery from Data (TKDD)
Learning to retrieve images from text queries with a discriminative model

AMR'06 Proceedings of the 4th international conference on Adaptive multimedia retrieval: user, context, and feedback
Semantic feature selection for object discovery in high-resolution remote sensing imagery

PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Combining stochastic block models and mixed membership for statistical network analysis

ICML'06 Proceedings of the 2006 conference on Statistical network analysis
Deriving a priori co-occurrence probability estimates for object recognition from social networks and text processing

ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II
Multi-modal multi-label semantic indexing of images based on hybrid ensemble learning

PCM'07 Proceedings of the multimedia 8th Pacific Rim conference on Advances in multimedia information processing
Hierarchical long-term learning for automatic image annotation

SAMT'07 Proceedings of the semantic and digital media technologies 2nd international conference on Semantic Multimedia
Document layout substructure discovery

SAMT'07 Proceedings of the semantic and digital media technologies 2nd international conference on Semantic Multimedia
A study of vocabularies for image annotation

SAMT'07 Proceedings of the semantic and digital media technologies 2nd international conference on Semantic Multimedia
Collaterally cued labelling framework underpinning semantic-level visual content descriptor

VISUAL'07 Proceedings of the 9th international conference on Advances in visual information systems
Comparing LDA with pLSI as a dimensionality reduction method in document clustering

LKR'08 Proceedings of the 3rd international conference on Large-scale knowledge resources: construction and application
A system that learns to tag videos by watching youtube

ICVS'08 Proceedings of the 6th international conference on Computer vision systems
Semantic relationships in multi-modal graphs for automatic image annotation

ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Finding the best picture: cross-media retrieval of content

ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Improving image annotation via representative feature vector selection

Neurocomputing
Modeling, classifying and annotating weakly annotated images using Bayesian network

Journal of Visual Communication and Image Representation
Variational Bayes for generic topic models

KI'09 Proceedings of the 32nd annual German conference on Advances in artificial intelligence
The accuracy and value of machine-generated image tags: design and user evaluation of an end-to-end image tagging system

Proceedings of the ACM International Conference on Image and Video Retrieval
Modeling latent aspects for automatic image annotation

ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Automatic tag expansion using visual similarity for photo sharing websites

Multimedia Tools and Applications
Visual information in semantic representation

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Topic models for image annotation and text illustration

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
How many words is a picture worth? Automatic caption generation for news images

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Generating image descriptions using dependency relational patterns

ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Context-based word acquisition for situated dialogue in a virtual world

Journal of Artificial Intelligence Research
Training a multilingual sportscaster: using perceptual context to learn language

Journal of Artificial Intelligence Research
Semantics-preserving bag-of-words models and applications

IEEE Transactions on Image Processing
Combining CBIR and NLP for multilingual terminology alignment and cross-language image indexing

YIWCALA '10 Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas
Cross-caption coreference resolution for automatic image understanding

CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Fusing semantic aspects for image annotation and retrieval

Journal of Visual Communication and Image Representation
Enriching dictionaries with images from the internet: targeting Wikipedia and a Japanese semantic lexicon: Lexeed

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Multi-label boosting for image annotation by structural grouping sparsity

Proceedings of the international conference on Multimedia
A new approach to cross-modal multimedia retrieval

Proceedings of the international conference on Multimedia
Context dependent SVMs for interconnected image network annotation

Proceedings of the international conference on Multimedia
Image to text translation by multi-label classification

ICIC'10 Proceedings of the Advanced intelligent computing theories and applications, and 6th international conference on Intelligent computing
Automatic attribute discovery and characterization from noisy web data

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Impact of visual information on text and content based image retrieval

SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Large-scale text to image retrieval using a Bayesian K-neighborhood model

SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Improving automatic image captioning using text summarization techniques

TSD'10 Proceedings of the 13th international conference on Text, speech and dialogue
Seeing people in social context: recognizing people and social relationships

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Discovering multipart appearance models from captioned images

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Image annotation by sparse logistic regression

PCM'10 Proceedings of the Advances in multimedia information processing, and 11th Pacific Rim conference on Multimedia: Part II
MOSIR: image and segment-based retrieval for mobile phones

PCM'10 Proceedings of the Advances in multimedia information processing, and 11th Pacific Rim conference on Multimedia: Part II
Attacking image recognition CAPTCHAS: a naive but effective approach

TrustBus'10 Proceedings of the 7th international conference on Trust, privacy and security in digital business
Names and faces in the news

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Multiple Bernoulli relevance models for image and video annotation

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Variational inference with graph regularization for image annotation

ACM Transactions on Intelligent Systems and Technology (TIST)
Video annotation using hierarchical Dirichlet process mixture model

Expert Systems with Applications: An International Journal
Modeling continuous visual features for semantic image annotation and retrieval

Pattern Recognition Letters
Cross-media entity recognition in nearly parallel visual and textual documents

Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Visual topic model for web image annotation

ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
Automatic image semantic interpretation using social action and tagging data

Multimedia Tools and Applications
Multimedia data mining: state of the art and challenges

Multimedia Tools and Applications
Multimodal summarization of complex sentences

Proceedings of the 16th international conference on Intelligent user interfaces
Multiple hypergraph clustering of web images by mining Word2Image correlations

Journal of Computer Science and Technology
Image annotation with concept level feature using PLSA+CCA

MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
Logistic Stick-Breaking Process

The Journal of Machine Learning Research
Context-based support vector machines for interconnected image annotation

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part I
An HMM-SVM-based automatic image annotation approach

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part IV
An energy-based model for region-labeling

Computer Vision and Image Understanding
Visual content representation using semantically similar visual words

Expert Systems with Applications: An International Journal
Mining software repositories using topic models

Proceedings of the 33rd International Conference on Software Engineering
Probabilistic image tagging with tags expanded by text-based search

DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
Semantics extraction from images

Knowledge-driven multimedia information extraction and ontology evolution
Integrating domain knowledge with user eye gaze in automated word acquisition for conversational interfaces

Proceedings of the 2010 workshop on Eye gaze in intelligent human machine interaction
Performance analysis of improved affinity propagation algorithm for image semantic annotation

ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
Composing simple image descriptions using web-scale n-grams

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Mining partially annotated images

Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning from Partial Labels

The Journal of Machine Learning Research
Multiclass classification with potential function rules: Margin distribution and generalization

Pattern Recognition
Typology of mixed-membership models: towards a design method

ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Tagging image by exploring weighted correlation between visual features and tags

WAIM'11 Proceedings of the 12th international conference on Web-age information management
Regionwise classification of building facade images

PIA'11 Proceedings of the 2011 ISPRS conference on Photogrammetric image analysis
Enriching textbooks with images

Proceedings of the 20th ACM international conference on Information and knowledge management
Multimodal representation, indexing, automated annotation and retrieval of image collections via non-negative matrix factorization

Neurocomputing
Fusing object detection and region appearance for image-text alignment

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Multi-feature pLSA for combining visual features in image annotation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Low-dimensional and comprehensive color texture description

Computer Vision and Image Understanding
A Multi-Directional Search technique for image annotation propagation

Journal of Visual Communication and Image Representation
Image clustering using multimodal keywords

SAMT'06 Proceedings of the First international conference on Semantic and Digital Media Technologies
Automatic video annotation and retrieval based on bayesian inference

MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
Feature selection for automatic image annotation

DAGM'06 Proceedings of the 28th conference on Pattern Recognition
A neural network to retrieve images from text queries

ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part II
Naming of image regions for user-friendly image retrieval

ICIAR'06 Proceedings of the Third international conference on Image Analysis and Recognition - Volume Part I
A discriminative approach for the retrieval of images from text queries

ECML'06 Proceedings of the 17th European conference on Machine Learning
The 2005 PASCAL visual object classes challenge

MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
Finding uninformative features in binary data

IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
Automatic image annotation based on topic-based smoothing

IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
Approximation of linear discriminant analysis for word dependent visual features selection

ACIVS'05 Proceedings of the 7th international conference on Advanced Concepts for Intelligent Vision Systems
Automatic annotation of images from the practitioner perspective

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Semantic annotation of image groups with self-organizing maps

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Learning shapes for image classification and retrieval

CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
SIERRA – a superimposed application for enhanced image description and retrieval

ECDL'06 Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
Incorporating prior knowledge into multi-label boosting for cross-modal image annotation and retrieval

AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
A semantic fusion approach between medical images and reports using UMLS

AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
Waterfall segmentation of complex scenes

ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part I
Automatic image annotation based on wordnet and hierarchical ensembles

CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Automatic image annotation by mining the web

DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
A correlation approach for automatic image annotation

ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Face Recognition from Caption-Based Supervision

International Journal of Computer Vision
Studying aesthetics in photographic images using a computational approach

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part III
Estimation of mixture models using Co-EM

ECML'05 Proceedings of the 16th European conference on Machine Learning
Object recognition via local patch labelling

Proceedings of the First international conference on Deterministic and Statistical Methods in Machine Learning
Automatic image annotation using maximum entropy model

IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Ontology-mediated distributed decision support for breast cancer

AIME'05 Proceedings of the 10th conference on Artificial Intelligence in Medicine
Improving image annotations using wordnet

MIS'05 Proceedings of the 11th international conference on Advances in Multimedia Information Systems
Multimodal indexing based on semantic cohesion for image retrieval

Information Retrieval
Video fingerprinting using Latent Dirichlet Allocation and facial images

Pattern Recognition
Automatic image tagging using community-driven online image databases

AMR'08 Proceedings of the 6th international conference on Adaptive Multimedia Retrieval: identifying, Summarizing, and Recommending Image and Music
A generative model for multi class object recognition and detection

TAINN'05 Proceedings of the 14th Turkish conference on Artificial Intelligence and Neural Networks
Query by semantic example

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Recognizing objects and scenes in news videos

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Semi-supervised learning for image annotation based on conditional random fields

CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Automatic image segmentation by positioning a seed

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II
Automatic generation of funny cartoons diary for everyday mobile life

AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Automatic annotation and retrieval for videos

PSIVT'06 Proceedings of the First Pacific Rim conference on Advances in Image and Video Technology
Leveraging social media for scalable object detection

Pattern Recognition
Interactive multimedia system for distance learning of higher education

Edutainment'06 Proceedings of the First international conference on Technologies for E-Learning and Digital Entertainment
Topic discovery and topic-driven clustering for audit method datasets

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
Building semantic hierarchies faithful to image semantics

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Combining image-level and segment-level models for automatic annotation

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
A novel multi-modal integration and propagation model for cross-media information retrieval

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Discovering hierarchical object models from captioned images

Computer Vision and Image Understanding
Learning bilingual lexicons using the visual similarity of labeled web images

IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Halfway through the semantic gap: Prosemantic features for image retrieval

Information Sciences: an International Journal
Learning to summarize web image and text mutually

Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
On Taxonomies for Multi-class Image Categorization

International Journal of Computer Vision
Improved Resulted Word Counts Optimizer for Automatic Image Annotation Problem

Fundamenta Informaticae - Advances in Artificial Intelligence and Applications
DRETOM: developer recommendation based on topic models for bug resolution

Proceedings of the 8th International Conference on Predictive Models in Software Engineering
A semantic approach to recommending text advertisements for images

Proceedings of the sixth ACM conference on Recommender systems
Movie keyframe retrieval based on cross-media correlation detection and context model

IEA/AIE'12 Proceedings of the 25th international conference on Industrial Engineering and Other Applications of Applied Intelligent Systems: advanced research in applied artificial intelligence
Apples to oranges: evaluating image annotations from natural language processing systems

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Learning the Relative Importance of Objects from Tagged Images for Retrieval and Cross-Modal Search

International Journal of Computer Vision
Multi-view learning from imperfect tagging

Proceedings of the 20th ACM international conference on Multimedia
Use of adaptive still image descriptors for annotation of video frames

ICIAR'07 Proceedings of the 4th international conference on Image Analysis and Recognition
Mining the web for appearance description

DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Distributional semantics with eyes: using image analysis to improve computational representations of word meaning

Proceedings of the 20th ACM international conference on Multimedia
Analyzing social media via event facets

Proceedings of the 20th ACM international conference on Multimedia
Visualizing timelines: evolutionary summarization via iterative reinforcement between text and image streams

Proceedings of the 21st ACM international conference on Information and knowledge management
Hierarchical image annotation using semantic hierarchies

Proceedings of the 21st ACM international conference on Information and knowledge management
A game-theoretic analysis of the ESP game

ACM Transactions on Economics and Computation - Inaugural Issue
Towards concept anchoring for cognitive robots

Intelligent Service Robotics
An efficient two-stage framework for image annotation

Pattern Recognition
Image retrieval with structured object queries using latent ranking SVM

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Discriminative factor alignment across heterogeneous feature space

ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
An interactive semi-supervised approach for automatic image annotation

PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Improving image tags by exploiting web search results

Multimedia Tools and Applications
Bidirectional-isomorphic manifold learning at image semantic understanding & representation

Multimedia Tools and Applications
Automatic image annotation and semantic based image retrieval for medical domain

Neurocomputing
Tagging photos using users' vocabularies

Neurocomputing
A picture is worth a thousand tags: automatic web based image tag expansion

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Unsupervised language learning for discovered visual concepts

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part IV
Annotation propagation in image databases using similarity graphs

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
"Tell me more": how semantic technologies can help refining internet image search

Proceedings of the International Workshop on Video and Image Ground Truth in Computer Vision Applications
Cross-media semantic representation via bi-directional learning to rank

Proceedings of the 21st ACM international conference on Multimedia
Picture tags and world knowledge: learning tag relations from visual semantic sources

Proceedings of the 21st ACM international conference on Multimedia
Scientific articles recommendation

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
3D Wikipedia: using online text to automatically label and navigate reconstructed geometry

ACM Transactions on Graphics (TOG)
Learning semantic concepts from image database with hybrid generative/discriminative approach

Engineering Applications of Artificial Intelligence
Applying a lightweight iterative merging chinese segmentation in web image annotation

MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Explicit context-aware kernel map learning for image annotation

ICVS'13 Proceedings of the 9th international conference on Computer Vision Systems
Fisher Linear Discriminant Analysis for text-image combination in multimedia information retrieval

Pattern Recognition
Neighborhood rough sets based multi-label classification for automatic image annotation

International Journal of Approximate Reasoning
A User-friendly Image-Text Fusion CAPTCHA for Secure Web Services

Proceedings of International Conference on Information Integration and Web-based Applications & Services
Manifold alignment preserving global geometry

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Learning group-based dictionaries for discriminative image representation

Pattern Recognition
Support vector description of clusters for content-based image annotation

Pattern Recognition
Cross domain recommendation based on multi-type media fusion

Neurocomputing
Framing image description as a ranking task: data, models and evaluation metrics

Journal of Artificial Intelligence Research
Clustering results of image searches by annotations and visual features

Telematics and Informatics
Probabilistic Joint Image Segmentation and Labeling by Figure-Ground Composition

International Journal of Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a new approach for modeling multi-modal data sets, focusing on the specific case of segmented images with associated text. Learning the joint distribution of image regions and words has many applications. We consider in detail predicting words associated with whole images (auto-annotation) and corresponding to particular image regions (region naming). Auto-annotation might help organize and access large collections of images. Region naming is a model of object recognition as a process of translating image regions to words, much as one might translate from one language to another. Learning the relationships between image regions and semantic correlates (words) is an interesting example of multi-modal data mining, particularly because it is typically hard to apply data mining techniques to collections of images. We develop a number of models for the joint distribution of image regions and words, including several which explicitly learn the correspondence between regions and words. We study multi-modal and correspondence extensions to Hofmann's hierarchical clustering/aspect model, a translation model adapted from statistical machine translation (Brown et al.), and a multi-modal extension to mixture of latent Dirichlet allocation (MoM-LDA). All models are assessed using a large collection of annotated images of real scenes. We study in depth the difficult problem of measuring performance. For the annotation task, we look at prediction performance on held out data. We present three alternative measures, oriented toward different types of task. Measuring the performance of correspondence methods is harder, because one must determine whether a word has been placed on the right region of an image. We can use annotation performance as a proxy measure, but accurate measurement requires hand labeled data, and thus must occur on a smaller scale. We show results using both an annotation proxy, and manually labeled data.