Extracting visual information from text: using captions to label faces in newspaper photographs
Extracting visual information from text: using captions to label faces in newspaper photographs
Visual semantics: extracting visual information from text accompanying pictures
AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Normalized Cuts and Image Segmentation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
Computer Vision: A Modern Approach
Computer Vision: A Modern Approach
End-User Searching Challenges Indexing Practices inthe Digital Newspaper Photo Archive
Information Retrieval
Browse and Search Patterns in a Digital Image Database
Information Retrieval
Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying
IEEE Transactions on Pattern Analysis and Machine Intelligence
ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume II - Volume II
Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary
ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Multiple-Instance Learning for Natural Scene Classification
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Combining Textual and Visual Cues for Content-Based Image Retrieval on the World Wide Web
CBAIVL '98 Proceedings of the IEEE Workshop on Content - Based Access of Image and Video Libraries
Name-It: Association of Face and Name in Video
CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Pedestrian Detection Using Wavelet Templates
CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Statistical Models for Co-occurrence Data
Statistical Models for Co-occurrence Data
WebSeer: An Image Search Engine for the World Wide Web
WebSeer: An Image Search Engine for the World Wide Web
Learning from ambiguity
The mathematics of statistical machine translation: parameter estimation
Computational Linguistics - Special issue on using large corpora: II
Hierarchical browsing and search of large image databases
IEEE Transactions on Image Processing
Automatic image annotation and retrieval using cross-media relevance models
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
On image auto-annotation with latent space models
MULTIMEDIA '03 Proceedings of the eleventh ACM international conference on Multimedia
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic multimedia cross-modal correlation discovery
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
Image Categorization by Learning and Reasoning with Regions
The Journal of Machine Learning Research
The story picturing engine: finding elite images to illustrate a story using mutual reinforcement
Proceedings of the 6th ACM SIGMM international workshop on Multimedia information retrieval
Proceedings of the 12th annual ACM international conference on Multimedia
PLSA-based image auto-annotation: constraining the latent space
Proceedings of the 12th annual ACM international conference on Multimedia
Multi-level annotation of natural scenes using dominant image components and semantic concepts
Proceedings of the 12th annual ACM international conference on Multimedia
Efficient propagation for face annotation in family albums
Proceedings of the 12th annual ACM international conference on Multimedia
Regularizing translation models for better automatic image annotation
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Automatic image annotation and retrieval using subspace clustering algorithm
Proceedings of the 2nd ACM international workshop on Multimedia databases
Retrieving lightly annotated images using image similarities
Proceedings of the 2005 ACM symposium on Applied computing
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Hidden Markov models for automatic annotation and content-based retrieval of images and video
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A database centric view of semantic image annotation and retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Mining images on semantics via statistical learning
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Using dual cascading learning frameworks for image indexing
VIP '05 Proceedings of the Pan-Sydney area workshop on Visual information processing
Exploiting a sensed environment to improve human-agent communication
Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems
Combining Sequence and Time Series Expression Data to Learn Transcriptional Modules
IEEE/ACM Transactions on Computational Biology and Bioinformatics (TCBB)
Image region entropy: a measure of "visualness" of web images associated with one concept
Proceedings of the 13th annual ACM international conference on Multimedia
Region based image annotation through multiple-instance learning
Proceedings of the 13th annual ACM international conference on Multimedia
Learning an image-word embedding for image auto-annotation on the nonlinear latent space
Proceedings of the 13th annual ACM international conference on Multimedia
Two-scale image retrieval with significant meta-information feedback
Proceedings of the 13th annual ACM international conference on Multimedia
Image annotations by combining multiple evidence & wordNet
Proceedings of the 13th annual ACM international conference on Multimedia
Similarity space projection for web image search and annotation
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Probabilistic web image gathering
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Semantic image classification with hierarchical feature subset selection
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
A mutual semantic endorsement approach to image retrieval and context provision
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Hybrid visual and conceptual image representation within active relevance feedback context
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Evaluation strategies for image understanding and retrieval
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Localized content based image retrieval
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Content-based image retrieval: approaches and trends of the new age
Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
Connecting language to the world
Artificial Intelligence - Special volume on connecting language to the world
Word sense disambiguation with pictures
Artificial Intelligence - Special volume on connecting language to the world
Protocols from perceptual observations
Artificial Intelligence - Special volume on connecting language to the world
Semiotic schemas: a framework for grounding language in action and perception
Artificial Intelligence - Special volume on connecting language to the world
Word sense disambiguation with pictures
HLT-NAACL-LWM '04 Proceedings of the HLT-NAACL 2003 workshop on Learning word meaning from non-linguistic data - Volume 6
Nearest-neighbor automatic sound annotation with a WordNet taxonomy
Journal of Intelligent Information Systems - Special issue: Intelligent multimedia applications
Ontological inference for image and video analysis
Machine Vision and Applications
The Story Picturing Engine---a system for automatic text illustration
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Automatic image annotation and retrieval using weighted feature selection
Multimedia Tools and Applications
A latent mixed membership model for relational data
Proceedings of the 3rd international workshop on Link discovery
Finding visual concepts by web image mining
Proceedings of the 15th international conference on World Wide Web
Qualitative evaluation of automatic assignment of keywords to images
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Visual ontology construction for digitized art image retrieval
Journal of Computer Science and Technology
HISA: a query system bridging the semantic gap for large image databases
VLDB '06 Proceedings of the 32nd international conference on Very large data bases
CLAIRE: A modular support vector image indexing and classification system
ACM Transactions on Information Systems (TOIS)
An adaptive graph model for automatic image annotation
MIR '06 Proceedings of the 8th ACM international workshop on Multimedia information retrieval
Image annotation by large-scale content-based image retrieval
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Real-time computerized annotation of pictures
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Toward bridging the annotation-retrieval gap in image search by a generative modeling approach
MULTIMEDIA '06 Proceedings of the 14th annual ACM international conference on Multimedia
Semantic Modeling of Natural Scenes for Content-Based Image Retrieval
International Journal of Computer Vision
Incorporating multiple SVMs for automatic image annotation
Pattern Recognition
Segmentation and description of natural outdoor scenes
Image and Vision Computing
Journal of Visual Communication and Image Representation
Enhanced max margin learning on multimodal data mining in a multimedia database
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Model-shared subspace boosting for multi-label classification
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Image retrieval on large-scale image databases
Proceedings of the 6th ACM international conference on Image and video retrieval
Proceedings of the 6th ACM international conference on Image and video retrieval
Semantic facets: an in-depth analysis of a semantic image retrieval system
Proceedings of the 6th ACM international conference on Image and video retrieval
Refining image annotation using contextual relations between words
Proceedings of the 6th ACM international conference on Image and video retrieval
Using multiple segmentations for image auto-annotation
Proceedings of the 6th ACM international conference on Image and video retrieval
How many high-level concepts will fill the semantic gap in news video retrieval?
Proceedings of the 6th ACM international conference on Image and video retrieval
Semantic identification: balancing between complexity and validity
EURASIP Journal on Applied Signal Processing
Discovering recurrent image semantics from class discrimination
EURASIP Journal on Applied Signal Processing
An efficient manual image annotation approach based on tagging and browsing
Workshop on multimedia information retrieval on The many faces of multimedia semantics
Unsupervised content-based indexing of sports video
Proceedings of the international workshop on Workshop on multimedia information retrieval
Learning people annotation from the web via consistency learning
Proceedings of the international workshop on Workshop on multimedia information retrieval
A review of text and image retrieval approaches for broadcast news video
Information Retrieval
Tagging over time: real-world image annotation by lightweight meta-learning
Proceedings of the 15th international conference on Multimedia
SBIA: search-based image annotation by leveraging web-scale images
Proceedings of the 15th international conference on Multimedia
Unsupervised content-based indexing for sports video retrieval
Proceedings of the 15th international conference on Multimedia
Exploiting spatial context constraints for automatic image region annotation
Proceedings of the 15th international conference on Multimedia
Modeling Semantic Aspects for Cross-Media Image Indexing
IEEE Transactions on Pattern Analysis and Machine Intelligence
Translating topics to words for image annotation
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Effect of word density on measuring words association
COMPUTE '08 Proceedings of the 1st Bangalore Annual Compute Conference
Fast image auto-annotation with discretized feature distance measures
Machine Graphics & Vision International Journal
Describing Visual Scenes Using Transformed Objects and Parts
International Journal of Computer Vision
Evaluation of Localized Semantics: Data, Methodology, and Experiments
International Journal of Computer Vision
Image retrieval: Ideas, influences, and trends of the new age
ACM Computing Surveys (CSUR)
Content visualization and management of geo-located image databases
CHI '08 Extended Abstracts on Human Factors in Computing Systems
Unsupervised learning of individuals and categories from images
Neural Computation
Knowledge discovery in multimedia repositories: the role of metadata
MMACTE'05 Proceedings of the 7th WSEAS International Conference on Mathematical Methods and Computational Techniques In Electrical Engineering
Flickr tag recommendation based on collective knowledge
Proceedings of the 17th international conference on World Wide Web
Automatic medical image annotation and retrieval
Neurocomputing
The evolution of visual information retrieval
Journal of Information Science
Inferring generic activities and events from image content and bags of geo-tags
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Non-negative matrix factorisation for object class discovery and image auto-annotation
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Continuous visual vocabulary modelsfor pLSA-based scene recognition
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
A discrete direct retrieval model for image and video retrieval
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
Learning to sportscast: a test of grounded language acquisition
Proceedings of the 25th international conference on Machine learning
Learning to reduce the semantic gap in web image retrieval and annotation
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A survey of methods for image annotation
Journal of Visual Languages and Computing
Multi-Class Segmentation with Relative Location Prior
International Journal of Computer Vision
Automatic Image Annotation with Relevance Feedback and Latent Semantic Analysis
Adaptive Multimedial Retrieval: Retrieval, User, and Semantics
Comparing Local Feature Descriptors in pLSA-Based Image Models
Proceedings of the 30th DAGM symposium on Pattern Recognition
Learning Visual Compound Models from Parallel Image-Text Datasets
Proceedings of the 30th DAGM symposium on Pattern Recognition
Watch, Listen & Learn: Co-training on Captioned Images and Videos
ECML PKDD '08 Proceedings of the 2008 European Conference on Machine Learning and Knowledge Discovery in Databases - Part I
Event recognition: viewing the world with a third eye
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Learning tag relevance by neighbor voting for social image retrieval
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Semantic lattices for multiple annotation of images
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Representing and playing user selected video narrative domains
SRMC '08 Proceedings of the 2nd ACM international workshop on Story representation, mechanism and context
Beyond Nouns: Exploiting Prepositions and Comparative Adjectives for Learning Visual Classifiers
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Learning Spatial Context: Using Stuff to Find Things
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Morphological segmentation on learned boundaries
Image and Vision Computing
Language Label Learning for Visual Concepts Discovered from Video Sequences
Attention in Cognitive Systems. Theories and Systems from an Interdisciplinary Viewpoint
Classification and Automatic Annotation Extension of Images Using Bayesian Network
SSPR & SPR '08 Proceedings of the 2008 Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Visual word proximity and linguistics for semantic video indexing and near-duplicate retrieval
Computer Vision and Image Understanding
Crossing textual and visual content in different application scenarios
Multimedia Tools and Applications
PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Annotating images and image objects using a hierarchical dirichlet process model
Proceedings of the 9th International Workshop on Multimedia Data Mining: held in conjunction with the ACM SIGKDD 2008
Mining the web for visual concepts
Proceedings of the 9th International Workshop on Multimedia Data Mining: held in conjunction with the ACM SIGKDD 2008
Collaborative editing of micro-tags
CHI '09 Extended Abstracts on Human Factors in Computing Systems
Using Second Order Statistics to Enhance Automated Image Annotation
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Exploiting Visual Concepts to Improve Text-Based Image Retrieval
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Structured correspondence topic models for mining captioned figures in biological literature
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Deriving semantic terms for images by mining the web
Proceedings of the 11th International Conference on Electronic Commerce
Object boundary detection in images using a semantic ontology
AAAI'06 Proceedings of the 21st national conference on Artificial intelligence - Volume 1
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Evaluating automatically generated user-focused multi-document summaries for geo-referenced images
MMIES '08 Proceedings of the Workshop on Multi-source Multilingual Information Extraction and Summarization
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Situated models of meaning for sports video retrieval
NAACL-Short '07 Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers
A Generic Approach to Topic Models
ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Learning to connect language and perception
AAAI'08 Proceedings of the 23rd national conference on Artificial intelligence - Volume 3
Towards automatic image region annotation: image region textual coreference resolution
NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Semi-automatic dynamic auxiliary-tag-aided image annotation
Pattern Recognition
Pseudo-aligned multilingual corpora
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
A distributed, service-based framework for knowledge applications with multimedia
ACM Transactions on Information Systems (TOIS)
Visualizing textual travelogue with location-relevant images
Proceedings of the 2009 International Workshop on Location Based Social Networks
Image categorization combining neighborhood methods and boosting
LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Semantics-preserving bag-of-words models for efficient image annotation
LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Tagging and retrieving images with co-occurrence models: from corel to flickr
LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
MovieBase: a movie database for event detection and behavioral analysis
WSMC '09 Proceedings of the 1st workshop on Web-scale multimedia corpus
Semi-supervised topic modeling for image annotation
MM '09 Proceedings of the 17th ACM international conference on Multimedia
What is a complete set of keywords for image description & annotation on the web
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Challenges for annotating images for sense disambiguation
LAC '06 Proceedings of the Workshop on Frontiers in Linguistically Annotated Corpora 2006
Word sense disambiguation with pictures
Artificial Intelligence - Special volume on connecting language to the world
Connecting language to the world
Artificial Intelligence - Special volume on connecting language to the world
Semiotic schemas: A framework for grounding language in action and perception
Artificial Intelligence - Special volume on connecting language to the world
Protocols from perceptual observations
Artificial Intelligence - Special volume on connecting language to the world
Multilayer pLSA for multimodal image retrieval
Proceedings of the ACM International Conference on Image and Video Retrieval
A visual analysis of the relationship between word concepts and geographical locations
Proceedings of the ACM International Conference on Image and Video Retrieval
NUS-WIDE: a real-world web image database from National University of Singapore
Proceedings of the ACM International Conference on Image and Video Retrieval
Shape reasoning on mis-segmented and mis-labeled objects using approximated Fisher criterion
Computers and Graphics
IEEE Transactions on Multimedia - Special issue on integration of context and content
Using visual context and region semantics for high-level concept detection
IEEE Transactions on Multimedia - Special issue on integration of context and content
Effective annotation and search for video blogs with integration of context and content analysis
IEEE Transactions on Multimedia - Special issue on integration of context and content
Learning color names for real-world applications
IEEE Transactions on Image Processing
Exploiting multi-modal interactions: a unified framework
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Enrichment and Ranking of the YouTube Tag Space and Integration with the Linked Data Cloud
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Learning image semantics with latent aspect model
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Multimodal pLSA on visual features and tags
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Multimedia multimodal methodologies
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Towards developing a unified multimodal image retrieval framework
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Leveraging social media for training object detectors
DSP'09 Proceedings of the 16th international conference on Digital Signal Processing
The role of interactivity in human-machine conversation for automatic word acquisition
SIGDIAL '09 Proceedings of the SIGDIAL 2009 Conference: The 10th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Unsupervised image categorization
Image and Vision Computing
Qualitative evaluation of automatic assignment of keywords to images
Information Processing and Management: an International Journal - Special issue: Formal methods for information retrieval
Learning social tag relevance by neighbor voting
IEEE Transactions on Multimedia
A Multi-Pronged Approach to Improving Semantic Extraction of News Video
Journal of Signal Processing Systems
Incorporating concept ontology into multi-level image indexing
Proceedings of the First International Conference on Internet Multimedia Computing and Service
Improved Resulted Word Counts Optimizer for Automatic Image Annotation Problem
Fundamenta Informaticae - Advances in Artificial Intelligence and Applications
Quest for relevant tags using local interaction networks and visual content
Proceedings of the international conference on Multimedia information retrieval
Topic models for semantics-preserving video compression
Proceedings of the international conference on Multimedia information retrieval
Region-based automatic web image selection
Proceedings of the international conference on Multimedia information retrieval
Combining visual features and text data for medical image retrieval using latent semantic kernels
Proceedings of the international conference on Multimedia information retrieval
Assessment of the utility of tag clouds for faster image retrieval
Proceedings of the international conference on Multimedia information retrieval
Image annotation with tagprop on the MIRFLICKR set
Proceedings of the international conference on Multimedia information retrieval
Statistical modeling and conceptualization of natural images
Pattern Recognition
Combining intra-image and inter-class semantics for consumer image retrieval
Pattern Recognition
OPTIMOL: Automatic Online Picture Collection via Incremental Model Learning
International Journal of Computer Vision
The segmented and annotated IAPR TC-12 benchmark
Computer Vision and Image Understanding
Learning natural scene categories by selective multi-scale feature extraction
Image and Vision Computing
A shared-subspace learning framework for multi-label classification
ACM Transactions on Knowledge Discovery from Data (TKDD)
Learning to retrieve images from text queries with a discriminative model
AMR'06 Proceedings of the 4th international conference on Adaptive multimedia retrieval: user, context, and feedback
Semantic feature selection for object discovery in high-resolution remote sensing imagery
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Combining stochastic block models and mixed membership for statistical network analysis
ICML'06 Proceedings of the 2006 conference on Statistical network analysis
ISVC'07 Proceedings of the 3rd international conference on Advances in visual computing - Volume Part II
Multi-modal multi-label semantic indexing of images based on hybrid ensemble learning
PCM'07 Proceedings of the multimedia 8th Pacific Rim conference on Advances in multimedia information processing
Hierarchical long-term learning for automatic image annotation
SAMT'07 Proceedings of the semantic and digital media technologies 2nd international conference on Semantic Multimedia
Document layout substructure discovery
SAMT'07 Proceedings of the semantic and digital media technologies 2nd international conference on Semantic Multimedia
A study of vocabularies for image annotation
SAMT'07 Proceedings of the semantic and digital media technologies 2nd international conference on Semantic Multimedia
Collaterally cued labelling framework underpinning semantic-level visual content descriptor
VISUAL'07 Proceedings of the 9th international conference on Advances in visual information systems
Comparing LDA with pLSI as a dimensionality reduction method in document clustering
LKR'08 Proceedings of the 3rd international conference on Large-scale knowledge resources: construction and application
A system that learns to tag videos by watching youtube
ICVS'08 Proceedings of the 6th international conference on Computer vision systems
Semantic relationships in multi-modal graphs for automatic image annotation
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Finding the best picture: cross-media retrieval of content
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Modeling, classifying and annotating weakly annotated images using Bayesian network
Journal of Visual Communication and Image Representation
Variational Bayes for generic topic models
KI'09 Proceedings of the 32nd annual German conference on Advances in artificial intelligence
Proceedings of the ACM International Conference on Image and Video Retrieval
Modeling latent aspects for automatic image annotation
ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
Automatic tag expansion using visual similarity for photo sharing websites
Multimedia Tools and Applications
Visual information in semantic representation
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Topic models for image annotation and text illustration
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
How many words is a picture worth? Automatic caption generation for news images
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Generating image descriptions using dependency relational patterns
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Context-based word acquisition for situated dialogue in a virtual world
Journal of Artificial Intelligence Research
Training a multilingual sportscaster: using perceptual context to learn language
Journal of Artificial Intelligence Research
Semantics-preserving bag-of-words models and applications
IEEE Transactions on Image Processing
Combining CBIR and NLP for multilingual terminology alignment and cross-language image indexing
YIWCALA '10 Proceedings of the NAACL HLT 2010 Young Investigators Workshop on Computational Approaches to Languages of the Americas
Cross-caption coreference resolution for automatic image understanding
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Fusing semantic aspects for image annotation and retrieval
Journal of Visual Communication and Image Representation
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Multi-label boosting for image annotation by structural grouping sparsity
Proceedings of the international conference on Multimedia
A new approach to cross-modal multimedia retrieval
Proceedings of the international conference on Multimedia
Context dependent SVMs for interconnected image network annotation
Proceedings of the international conference on Multimedia
Image to text translation by multi-label classification
ICIC'10 Proceedings of the Advanced intelligent computing theories and applications, and 6th international conference on Intelligent computing
Automatic attribute discovery and characterization from noisy web data
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Impact of visual information on text and content based image retrieval
SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Large-scale text to image retrieval using a Bayesian K-neighborhood model
SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Improving automatic image captioning using text summarization techniques
TSD'10 Proceedings of the 13th international conference on Text, speech and dialogue
Seeing people in social context: recognizing people and social relationships
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Discovering multipart appearance models from captioned images
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Image annotation by sparse logistic regression
PCM'10 Proceedings of the Advances in multimedia information processing, and 11th Pacific Rim conference on Multimedia: Part II
MOSIR: image and segment-based retrieval for mobile phones
PCM'10 Proceedings of the Advances in multimedia information processing, and 11th Pacific Rim conference on Multimedia: Part II
Attacking image recognition CAPTCHAS: a naive but effective approach
TrustBus'10 Proceedings of the 7th international conference on Trust, privacy and security in digital business
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Multiple Bernoulli relevance models for image and video annotation
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Variational inference with graph regularization for image annotation
ACM Transactions on Intelligent Systems and Technology (TIST)
Video annotation using hierarchical Dirichlet process mixture model
Expert Systems with Applications: An International Journal
Modeling continuous visual features for semantic image annotation and retrieval
Pattern Recognition Letters
Cross-media entity recognition in nearly parallel visual and textual documents
Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Visual topic model for web image annotation
ICIMCS '10 Proceedings of the Second International Conference on Internet Multimedia Computing and Service
Automatic image semantic interpretation using social action and tagging data
Multimedia Tools and Applications
Multimedia data mining: state of the art and challenges
Multimedia Tools and Applications
Multimodal summarization of complex sentences
Proceedings of the 16th international conference on Intelligent user interfaces
Multiple hypergraph clustering of web images by mining Word2Image correlations
Journal of Computer Science and Technology
Image annotation with concept level feature using PLSA+CCA
MMM'11 Proceedings of the 17th international conference on Advances in multimedia modeling - Volume Part II
Logistic Stick-Breaking Process
The Journal of Machine Learning Research
Context-based support vector machines for interconnected image annotation
ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part I
An HMM-SVM-based automatic image annotation approach
ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part IV
An energy-based model for region-labeling
Computer Vision and Image Understanding
Visual content representation using semantically similar visual words
Expert Systems with Applications: An International Journal
Mining software repositories using topic models
Proceedings of the 33rd International Conference on Software Engineering
Probabilistic image tagging with tags expanded by text-based search
DASFAA'11 Proceedings of the 16th international conference on Database systems for advanced applications - Volume Part I
Semantics extraction from images
Knowledge-driven multimedia information extraction and ontology evolution
Proceedings of the 2010 workshop on Eye gaze in intelligent human machine interaction
Performance analysis of improved affinity propagation algorithm for image semantic annotation
ISNN'11 Proceedings of the 8th international conference on Advances in neural networks - Volume Part II
Composing simple image descriptions using web-scale n-grams
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Mining partially annotated images
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
The Journal of Machine Learning Research
Typology of mixed-membership models: towards a design method
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Tagging image by exploring weighted correlation between visual features and tags
WAIM'11 Proceedings of the 12th international conference on Web-age information management
Regionwise classification of building facade images
PIA'11 Proceedings of the 2011 ISPRS conference on Photogrammetric image analysis
Enriching textbooks with images
Proceedings of the 20th ACM international conference on Information and knowledge management
Fusing object detection and region appearance for image-text alignment
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Multi-feature pLSA for combining visual features in image annotation
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Low-dimensional and comprehensive color texture description
Computer Vision and Image Understanding
A Multi-Directional Search technique for image annotation propagation
Journal of Visual Communication and Image Representation
Image clustering using multimodal keywords
SAMT'06 Proceedings of the First international conference on Semantic and Digital Media Technologies
Automatic video annotation and retrieval based on bayesian inference
MMM'07 Proceedings of the 13th international conference on Multimedia Modeling - Volume Part I
Feature selection for automatic image annotation
DAGM'06 Proceedings of the 28th conference on Pattern Recognition
A neural network to retrieve images from text queries
ICANN'06 Proceedings of the 16th international conference on Artificial Neural Networks - Volume Part II
Naming of image regions for user-friendly image retrieval
ICIAR'06 Proceedings of the Third international conference on Image Analysis and Recognition - Volume Part I
A discriminative approach for the retrieval of images from text queries
ECML'06 Proceedings of the 17th European conference on Machine Learning
The 2005 PASCAL visual object classes challenge
MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment
Finding uninformative features in binary data
IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
Automatic image annotation based on topic-based smoothing
IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
Approximation of linear discriminant analysis for word dependent visual features selection
ACIVS'05 Proceedings of the 7th international conference on Advanced Concepts for Intelligent Vision Systems
Automatic annotation of images from the practitioner perspective
CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Semantic annotation of image groups with self-organizing maps
CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
Learning shapes for image classification and retrieval
CIVR'05 Proceedings of the 4th international conference on Image and Video Retrieval
SIERRA – a superimposed application for enhanced image description and retrieval
ECDL'06 Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
A semantic fusion approach between medical images and reports using UMLS
AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
Waterfall segmentation of complex scenes
ACCV'06 Proceedings of the 7th Asian conference on Computer Vision - Volume Part I
Automatic image annotation based on wordnet and hierarchical ensembles
CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
Automatic image annotation by mining the web
DaWaK'06 Proceedings of the 8th international conference on Data Warehousing and Knowledge Discovery
A correlation approach for automatic image annotation
ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
Face Recognition from Caption-Based Supervision
International Journal of Computer Vision
Studying aesthetics in photographic images using a computational approach
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part III
Estimation of mixture models using Co-EM
ECML'05 Proceedings of the 16th European conference on Machine Learning
Object recognition via local patch labelling
Proceedings of the First international conference on Deterministic and Statistical Methods in Machine Learning
Automatic image annotation using maximum entropy model
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Ontology-mediated distributed decision support for breast cancer
AIME'05 Proceedings of the 10th conference on Artificial Intelligence in Medicine
Improving image annotations using wordnet
MIS'05 Proceedings of the 11th international conference on Advances in Multimedia Information Systems
Multimodal indexing based on semantic cohesion for image retrieval
Information Retrieval
Video fingerprinting using Latent Dirichlet Allocation and facial images
Pattern Recognition
Automatic image tagging using community-driven online image databases
AMR'08 Proceedings of the 6th international conference on Adaptive Multimedia Retrieval: identifying, Summarizing, and Recommending Image and Music
A generative model for multi class object recognition and detection
TAINN'05 Proceedings of the 14th Turkish conference on Artificial Intelligence and Neural Networks
CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Recognizing objects and scenes in news videos
CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Semi-supervised learning for image annotation based on conditional random fields
CIVR'06 Proceedings of the 5th international conference on Image and Video Retrieval
Automatic image segmentation by positioning a seed
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II
Automatic generation of funny cartoons diary for everyday mobile life
AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Automatic annotation and retrieval for videos
PSIVT'06 Proceedings of the First Pacific Rim conference on Advances in Image and Video Technology
Leveraging social media for scalable object detection
Pattern Recognition
Interactive multimedia system for distance learning of higher education
Edutainment'06 Proceedings of the First international conference on Technologies for E-Learning and Digital Entertainment
Topic discovery and topic-driven clustering for audit method datasets
ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II
Building semantic hierarchies faithful to image semantics
MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Combining image-level and segment-level models for automatic annotation
MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
A novel multi-modal integration and propagation model for cross-media information retrieval
MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Discovering hierarchical object models from captioned images
Computer Vision and Image Understanding
Learning bilingual lexicons using the visual similarity of labeled web images
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Halfway through the semantic gap: Prosemantic features for image retrieval
Information Sciences: an International Journal
Learning to summarize web image and text mutually
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
On Taxonomies for Multi-class Image Categorization
International Journal of Computer Vision
Improved Resulted Word Counts Optimizer for Automatic Image Annotation Problem
Fundamenta Informaticae - Advances in Artificial Intelligence and Applications
DRETOM: developer recommendation based on topic models for bug resolution
Proceedings of the 8th International Conference on Predictive Models in Software Engineering
A semantic approach to recommending text advertisements for images
Proceedings of the sixth ACM conference on Recommender systems
Movie keyframe retrieval based on cross-media correlation detection and context model
IEA/AIE'12 Proceedings of the 25th international conference on Industrial Engineering and Other Applications of Applied Intelligent Systems: advanced research in applied artificial intelligence
Apples to oranges: evaluating image annotations from natural language processing systems
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Learning the Relative Importance of Objects from Tagged Images for Retrieval and Cross-Modal Search
International Journal of Computer Vision
Multi-view learning from imperfect tagging
Proceedings of the 20th ACM international conference on Multimedia
Use of adaptive still image descriptors for annotation of video frames
ICIAR'07 Proceedings of the 4th international conference on Image Analysis and Recognition
Mining the web for appearance description
DEXA'07 Proceedings of the 18th international conference on Database and Expert Systems Applications
Proceedings of the 20th ACM international conference on Multimedia
Analyzing social media via event facets
Proceedings of the 20th ACM international conference on Multimedia
Proceedings of the 21st ACM international conference on Information and knowledge management
Hierarchical image annotation using semantic hierarchies
Proceedings of the 21st ACM international conference on Information and knowledge management
A game-theoretic analysis of the ESP game
ACM Transactions on Economics and Computation - Inaugural Issue
Towards concept anchoring for cognitive robots
Intelligent Service Robotics
An efficient two-stage framework for image annotation
Pattern Recognition
Image retrieval with structured object queries using latent ranking SVM
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Discriminative factor alignment across heterogeneous feature space
ECML PKDD'12 Proceedings of the 2012 European conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II
An interactive semi-supervised approach for automatic image annotation
PCM'12 Proceedings of the 13th Pacific-Rim conference on Advances in Multimedia Information Processing
Improving image tags by exploiting web search results
Multimedia Tools and Applications
Bidirectional-isomorphic manifold learning at image semantic understanding & representation
Multimedia Tools and Applications
Tagging photos using users' vocabularies
Neurocomputing
A picture is worth a thousand tags: automatic web based image tag expansion
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Unsupervised language learning for discovered visual concepts
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part IV
Annotation propagation in image databases using similarity graphs
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
"Tell me more": how semantic technologies can help refining internet image search
Proceedings of the International Workshop on Video and Image Ground Truth in Computer Vision Applications
Cross-media semantic representation via bi-directional learning to rank
Proceedings of the 21st ACM international conference on Multimedia
Picture tags and world knowledge: learning tag relations from visual semantic sources
Proceedings of the 21st ACM international conference on Multimedia
Scientific articles recommendation
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
3D Wikipedia: using online text to automatically label and navigate reconstructed geometry
ACM Transactions on Graphics (TOG)
Learning semantic concepts from image database with hybrid generative/discriminative approach
Engineering Applications of Artificial Intelligence
Applying a lightweight iterative merging chinese segmentation in web image annotation
MLDM'13 Proceedings of the 9th international conference on Machine Learning and Data Mining in Pattern Recognition
Explicit context-aware kernel map learning for image annotation
ICVS'13 Proceedings of the 9th international conference on Computer Vision Systems
Neighborhood rough sets based multi-label classification for automatic image annotation
International Journal of Approximate Reasoning
A User-friendly Image-Text Fusion CAPTCHA for Secure Web Services
Proceedings of International Conference on Information Integration and Web-based Applications & Services
Manifold alignment preserving global geometry
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Learning group-based dictionaries for discriminative image representation
Pattern Recognition
Support vector description of clusters for content-based image annotation
Pattern Recognition
Cross domain recommendation based on multi-type media fusion
Neurocomputing
Framing image description as a ranking task: data, models and evaluation metrics
Journal of Artificial Intelligence Research
Clustering results of image searches by annotations and visual features
Telematics and Informatics
Probabilistic Joint Image Segmentation and Labeling by Figure-Ground Composition
International Journal of Computer Vision
Hi-index | 0.00 |
We present a new approach for modeling multi-modal data sets, focusing on the specific case of segmented images with associated text. Learning the joint distribution of image regions and words has many applications. We consider in detail predicting words associated with whole images (auto-annotation) and corresponding to particular image regions (region naming). Auto-annotation might help organize and access large collections of images. Region naming is a model of object recognition as a process of translating image regions to words, much as one might translate from one language to another. Learning the relationships between image regions and semantic correlates (words) is an interesting example of multi-modal data mining, particularly because it is typically hard to apply data mining techniques to collections of images. We develop a number of models for the joint distribution of image regions and words, including several which explicitly learn the correspondence between regions and words. We study multi-modal and correspondence extensions to Hofmann's hierarchical clustering/aspect model, a translation model adapted from statistical machine translation (Brown et al.), and a multi-modal extension to mixture of latent Dirichlet allocation (MoM-LDA). All models are assessed using a large collection of annotated images of real scenes. We study in depth the difficult problem of measuring performance. For the annotation task, we look at prediction performance on held out data. We present three alternative measures, oriented toward different types of task. Measuring the performance of correspondence methods is harder, because one must determine whether a word has been placed on the right region of an image. We can use annotation performance as a proxy measure, but accurate measurement requires hand labeled data, and thus must occur on a smaller scale. We show results using both an annotation proxy, and manually labeled data.