LabelMe: A Database and Web-Based Tool for Image Annotation

Authors:
Bryan C. Russell;Antonio Torralba;Kevin P. Murphy;William T. Freeman
Affiliations:
Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, USA 02139;Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, USA 02139;Departments of computer science and statistics, University of British Columbia, Vancouver, Canada V6T 1Z4;Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, USA 02139
Venue:
International Journal of Computer Vision
Year:
2008

Citing 26
Cited 214

Color indexing

International Journal of Computer Vision
The nature of statistical learning theory

The nature of statistical learning theory
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

International Journal of Computer Vision
Contextual Priming for Object Detection

International Journal of Computer Vision
A bootstrapping algorithm for learning linear models of object classes

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Consistent Line Clusters for Building Recognition in CBIR

ICPR '02 Proceedings of the 16 th International Conference on Pattern Recognition (ICPR'02) Volume 3 - Volume 3
A Bayesian Approach to Unsupervised One-Shot Learning of Object Categories

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Introduction to the special issue on word sense disambiguation: the state of the art

Computational Linguistics - Special issue on word sense disambiguation
Labeling images with a computer game

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Learning to Detect Objects in Images via a Sparse, Part-Based Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Learning Generative Visual Models from Few Training Examples: An Incremental Bayesian Approach Tested on 101 Object Categories

CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
Discovering Objects and their Localization in Images

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Modeling Scenes with Local Descriptors and Latent Aspects

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Learning Hierarchical Models of Scenes, Objects, and Parts

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Learning Object Categories from Google"s Image Search

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Object Categorization by Learned Universal Visual Dictionary

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Generic Object Recognition with Boosting

IEEE Transactions on Pattern Analysis and Machine Intelligence
One-Shot Learning of Object Categories

IEEE Transactions on Pattern Analysis and Machine Intelligence
Peekaboom: a game for locating objects in images

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Photo tourism: exploring photo collections in 3D

ACM SIGGRAPH 2006 Papers
Putting Objects in Perspective

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Animals on the Web

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Using Multiple Segmentations to Discover Objects and their Extent in Image Collections

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Learning methods for generic object recognition with invariance to pose and lighting

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Sharing features: efficient boosting procedures for multiclass object detection

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
A boundary-fragment-model for object detection

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II

Editorial: From sensors to human spatial concepts

Robotics and Autonomous Systems
Locating key views for image indexing of spaces

MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Towards Scalable Dataset Construction: An Active Learning Approach

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Concept detection and keyframe extraction using a visual thesaurus

Multimedia Tools and Applications
Web-Scale Image Annotation

PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Large Scale Concept Detection in Video Using a Region Thesaurus

MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
A human computer integrated approach for content based image retrieval

ICCOMP'08 Proceedings of the 12th WSEAS international conference on Computers
A benchmark for 3D mesh segmentation

ACM SIGGRAPH 2009 papers
Segmentation of Natural and Man-Made Structures by Independent Component Analysis

ICA '09 Proceedings of the 8th International Conference on Independent Component Analysis and Signal Separation
A 3D hypermedia with biomedical stereoscopic images: from creation to exploration in virtual reality

Proceedings of the 20th ACM conference on Hypertext and hypermedia
Key views for visualizing large spaces

Journal of Visual Communication and Image Representation
A sketch-based interface for photo pop-up

Proceedings of the 6th Eurographics Symposium on Sketch-Based Interfaces and Modeling
Games for sketch data collection

Proceedings of the 6th Eurographics Symposium on Sketch-Based Interfaces and Modeling
Concept-Based Video Retrieval

Foundations and Trends in Information Retrieval
Regional category parsing in undirected graphical models

Pattern Recognition Letters
Fusion of Multiple Expert Annotations and Overall Score Selection for Medical Image Diagnosis

SCIA '09 Proceedings of the 16th Scandinavian Conference on Image Analysis
Towards automatic image region annotation: image region textual coreference resolution

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Collaborative text-annotation resource for disease-centered relation extraction from biomedical text

Journal of Biomedical Informatics
A new gaussian mixture conditional random field model for indoor image labeling

IMCE '09 Proceedings of the 1st international workshop on Interactive multimedia for consumer electronics
Scene classification using pLSA with visterm spatial location

IMCE '09 Proceedings of the 1st international workshop on Interactive multimedia for consumer electronics
Semantics-preserving bag-of-words models for efficient image annotation

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Leveraging large-scale weakly-tagged images to train inter-related classifiers for multi-label annotation

LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Distance metric learning from uncertain side information with application to automated photo tagging

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Learning to generate novel views of objects for class recognition

Computer Vision and Image Understanding
Shape-from-recognition: Recognition enables meta-data transfer

Computer Vision and Image Understanding
Image annotation within the context of personal photo collections using hierarchical event and scene models

IEEE Transactions on Multimedia - Special issue on integration of context and content
An edge-weighted centroidal Voronoi tessellation model for image segmentation

IEEE Transactions on Image Processing
Semi-automatically labeling objects in images

IEEE Transactions on Image Processing
Patch Growing: Object segmentation using spatial coherence of local patches

Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
A first glimpse of cryptography's Holy Grail

Communications of the ACM
Using the forest to see the trees: exploiting context for visual object detection and localization

Communications of the ACM
Patch Growing: Object segmentation using spatial coherence of local patches

Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
A new localized superpixel Markov random field for image segmentation

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Active tagging for image indexing

ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
GAT: a Graphical Annotation Tool for semantic regions

Multimedia Tools and Applications
Gathering and ranking photos of named entities with high precision, high recall, and diversity

Proceedings of the third ACM international conference on Web search and data mining
Cognitive vision for efficient scene processing and object categorization in highly cluttered environments

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Building a distributed robot garden

IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
High-level concept annotation using ontology and probabilistic inference

Proceedings of the First International Conference on Internet Multimedia Computing and Service
Automatic online labeling images via co-active-learning

Proceedings of the First International Conference on Internet Multimedia Computing and Service
The participation payoff: challenges and opportunities for multimedia access in networked communities

Proceedings of the international conference on Multimedia information retrieval
Object-based tag propagation for semi-automatic annotation of images

Proceedings of the international conference on Multimedia information retrieval
The Pascal Visual Object Classes (VOC) Challenge

International Journal of Computer Vision
The segmented and annotated IAPR TC-12 benchmark

Computer Vision and Image Understanding
Structured max-margin learning for multi-label image annotation

Proceedings of the ACM International Conference on Image and Video Retrieval
Object Recognition in 3D Point Clouds Using Web Data and Domain Adaptation

International Journal of Robotics Research
Image search by concept map

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Nonnegative shared subspace learning and its application to social media retrieval

Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Horizon estimation: perceptual and computational experiments

Proceedings of the 7th Symposium on Applied Perception in Graphics and Visualization
Semantics-preserving bag-of-words models and applications

IEEE Transactions on Image Processing
TurKit: human computation algorithms on mechanical turk

UIST '10 Proceedings of the 23nd annual ACM symposium on User interface software and technology
Context-based search for 3D models

ACM SIGGRAPH Asia 2010 papers
A probabilistic topic-connection model for automatic image annotation

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Indoor robot gardening: design and implementation

Intelligent Service Robotics
Increasing interactivity in street view web navigation systems

Proceedings of the international conference on Multimedia
Two-stage localization for image labeling

PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Extracting structures in image collections for object recognition

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Semantic label sharing for learning with many categories

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
From region based image representation to object discovery and recognition

SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Detecting ground shadows in outdoor consumer photographs

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Learning what and how of contextual models for scene labeling

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Localizing objects while learning their appearance

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Efficiently scaling up video annotation with crowdsourced marketplaces

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Semantic segmentation of urban scenes using dense depth maps

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Non-local characterization of scenery images: statistics, 3D reasoning, and a generative model

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Superparsing: scalable nonparametric image parsing with superpixels

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
A coarse-to-fine taxonomy of constellations for fast multi-class object detection

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Thinking inside the box: using appearance models and context based on room geometry

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Bayesian belief network learning algorithms for modeling contextual relationships in natural imagery: a comparative study

Artificial Intelligence Review
Human posture recognition for intelligent vehicles

Journal of Real-Time Image Processing
Relevance of a feed-forward model of visual attention for goal-oriented and free-viewing tasks

IEEE Transactions on Image Processing
Distance metric learning from uncertain side information for automated photo tagging

ACM Transactions on Intelligent Systems and Technology (TIST)
Learning an interactive segmentation system

Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing
Random Fourier approximations for skewed multiplicative histogram kernels

Proceedings of the 32nd DAGM conference on Pattern recognition
Large scale visual classification via learned dictionaries and sparse representation

AICI'10 Proceedings of the 2010 international conference on Artificial intelligence and computational intelligence: Part I
Mining social images with distance metric learning for automated image tagging

Proceedings of the fourth ACM international conference on Web search and data mining
Automatic image semantic interpretation using social action and tagging data

Multimedia Tools and Applications
Context modeling in computer vision: techniques, implications, and applications

Multimedia Tools and Applications
Characterizing structural relationships in scenes using graph kernels

ACM SIGGRAPH 2011 papers
PhotoCity: training experts at large-scale image acquisition through a competitive game

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Region-based annotation of digital photographs

CCIW'11 Proceedings of the Third international conference on Computational color imaging
Fast object detection using steiner tree

Machine Graphics & Vision International Journal
Social negative bootstrapping for visual categorization

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Consumer video understanding: a benchmark database and an evaluation of human and machine performance

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Active learning through notes data in Flickr: an effortless training data acquisition approach for object localization

Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Multipedia: enriching DBpedia with multimedia information

Proceedings of the sixth international conference on Knowledge capture
A survey of semantic image and video annotation tools

Knowledge-driven multimedia information extraction and ontology evolution
Mining weakly labeled web facial images for search-based face annotation

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Improving the usability of hierarchical representations for interactively labeling large image data sets

HCII'11 Proceedings of the 14th international conference on Human-computer interaction: design and development approaches - Volume Part I
Not far away from home: a relational distance-based approach to understanding images of houses

ILP'10 Proceedings of the 20th international conference on Inductive logic programming
A semantic web annotation tool for a web-based audio sequencer

ICWE'11 Proceedings of the 11th international conference on Web engineering
Toward coherent object detection and scene layout understanding

Image and Vision Computing
Combining visual and textual modalities for multimedia ontology matching

SAMT'10 Proceedings of the 5th international conference on Semantic and digital media technologies
A cartography of spatial relationships in a symbolic image database

CAIP'11 Proceedings of the 14th international conference on Computer analysis of images and patterns - Volume Part I
Recursive Compositional Models for Vision: Description and Review of Recent Work

Journal of Mathematical Imaging and Vision
Two-probabilistic latent semantic model for image annotation and retrieval

ACCV'10 Proceedings of the 2010 international conference on Computer vision - Volume Part I
Image retrieval with semantic sketches

INTERACT'11 Proceedings of the 13th IFIP TC 13 international conference on Human-computer interaction - Volume Part I
Multiple region categorization for scenery images

ICIAP'11 Proceedings of the 16th international conference on Image analysis and processing: Part I
Generating resource profiles by exploiting the context of social annotations

ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part I
Where do people draw lines?

Communications of the ACM
Finding images of difficult entities in the long tail

Proceedings of the 20th ACM international conference on Information and knowledge management
Purposive hidden-object-game: embedding human computation in popular game

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Multiclass object detection by combining local appearances and context

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Multi-feature pLSA for combining visual features in image annotation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Exploiting the entire feature space with sparsity for automatic image annotation

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Retrieval-based face annotation by weak label regularized local coordinate coding

MM '11 Proceedings of the 19th ACM international conference on Multimedia
Scene reconstruction and visualization from internet photo collections

Scene reconstruction and visualization from internet photo collections
Towards a multimodality ontology image retrieval

IVIC'11 Proceedings of the Second international conference on Visual informatics: sustaining research and innovations - Volume Part II
IAIR-CarPed: A psychophysically annotated dataset with fine-grained and layered semantic labels for object recognition

Pattern Recognition Letters
Sub-sampling: Real-time vision for micro air vehicles

Robotics and Autonomous Systems
The impact of multifaceted tagging on learning tag relations and search

ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
Mid-Level concept learning with visual contextual ontologies and probabilistic inference for image annotation

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Generating visual concept network from large-scale weakly-tagged images

MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Cut and paste: ambient interaction using annotated cut-outs

AmI'11 Proceedings of the Second international conference on Ambient Intelligence
How do humans sketch objects?

ACM Transactions on Graphics (TOG) - SIGGRAPH 2012 Conference Proceedings
Understanding and improving the realism of image composites

ACM Transactions on Graphics (TOG) - SIGGRAPH 2012 Conference Proceedings
Multi-instance methods for partially supervised image segmentation

PSL'11 Proceedings of the First IAPR TC3 conference on Partially Supervised Learning
Identifying objects in images from analyzing the users' gaze movements for provided tags

MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Estimating the Natural Illumination Conditions from a Single Outdoor Image

International Journal of Computer Vision
Motion chain: a webcam game for crowdsourcing gesture collection

CHI '12 Extended Abstracts on Human Factors in Computing Systems
Micro perceptual human computation for visual tasks

ACM Transactions on Graphics (TOG)
Human action recognition and localization in video at contextual level

ICDEM'10 Proceedings of the Second international conference on Data Engineering and Management
NEOCR: a configurable dataset for natural image text recognition

CBDAR'11 Proceedings of the 4th international conference on Camera-Based Document Analysis and Recognition
First International Workshop on Visual Interfaces for Ground Truth Collection in Computer Vision Applications

Proceedings of the International Working Conference on Advanced Visual Interfaces
Multiscale annotation of still images with GAT

Proceedings of the 1st International Workshop on Visual Interfaces for Ground Truth Collection in Computer Vision Applications
Efficient annotation of image data sets for computer vision applications

Proceedings of the 1st International Workshop on Visual Interfaces for Ground Truth Collection in Computer Vision Applications
CoVidA: pen-based collaborative video annotation

Proceedings of the 1st International Workshop on Visual Interfaces for Ground Truth Collection in Computer Vision Applications
Assistive tagging: A survey of multimedia tagging with human-computer joint exploration

ACM Computing Surveys (CSUR)
Fast Structured Prediction Using Large Margin Sigmoid Belief Networks

International Journal of Computer Vision
Enhancing image retrieval by an exploration-exploitation approach

MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
On the consistency and features of image similarity

Proceedings of the 4th Information Interaction in Context Symposium
A hybrid semi-supervised topic model

IScIDE'11 Proceedings of the Second Sino-foreign-interchange conference on Intelligent Science and Intelligent Data Engineering
An analytical model for generalized ESP games

Knowledge-Based Systems
Orientation-aware scene understanding for mobile cameras

Proceedings of the 2012 ACM Conference on Ubiquitous Computing
Performance evaluation procedure for vision based object feature extraction algorithms

Proceedings of the 10th Performance Metrics for Intelligent Systems Workshop
Sketch-editing games: human-machine communication, game theory and applications

Proceedings of the 25th annual ACM symposium on User interface software and technology
Object Detection using Geometrical Context Feedback

International Journal of Computer Vision
Weakly Supervised Localization and Learning with Generic Knowledge

International Journal of Computer Vision
User-Centric Learning and Evaluation of Interactive Segmentation Systems

International Journal of Computer Vision
Unsupervised object discovery via self-organisation

Pattern Recognition Letters
Pushing the limits of mechanical turk: qualifying the crowd for video geo-location

Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia
On shape and the computability of emotions

Proceedings of the 20th ACM international conference on Multimedia
Search web images using objects, backgrounds and conditions

Proceedings of the 20th ACM international conference on Multimedia
Towards indexing representative images on the web

Proceedings of the 20th ACM international conference on Multimedia
Making use of eye tracking information in image collection creation and region annotation

Proceedings of the 20th ACM international conference on Multimedia
Interactive tool for image annotation using a semi-supervised and hierarchical approach

Computer Standards & Interfaces
A unified learning framework for auto face annotation by mining web facial images

Proceedings of the 21st ACM international conference on Information and knowledge management
Undoing the damage of dataset bias

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Annotation propagation in large image databases via dense image correspondence

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Attributes for classifier feedback

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Constrained semi-supervised learning using attributes and comparative attributes

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Abnormal object detection by canonical scene-based contextual model

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Co-inference for multi-modal scene analysis

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Simultaneous image classification and annotation via biased random walk on tri-relational graph

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
On performance analysis of optical flow algorithms

Proceedings of the 15th international conference on Theoretical Foundations of Computer Vision: outdoor and large-scale real-world scene analysis
Technical Section: Automatic color realism enhancement for computer generated images

Computers and Graphics
Visual saliency detection with center shift

Neurocomputing
A novel image annotation feedback model based on internet-search

WISM'12 Proceedings of the 2012 international conference on Web Information Systems and Mining
Image memorability and visual inception

SIGGRAPH Asia 2012 Technical Briefs
A multiple object geometric deformable model for image segmentation

Computer Vision and Image Understanding
Web-enhanced object category learning for domestic robots

Intelligent Service Robotics
Multimedia ontology matching by using visual and textual modalities

Multimedia Tools and Applications
A spike-timing-based integrated model for pattern recognition

Neural Computation
Superparsing

International Journal of Computer Vision
Efficiently Scaling up Crowdsourced Video Annotation

International Journal of Computer Vision
Improving image tags by exploiting web search results

Multimedia Tools and Applications
Learning saliency-based visual attention: A review

Signal Processing
Adaptive object detection by implicit sub-class sharing features

Signal Processing
Popular research topics in multimedia

Scientometrics
Multifeature analysis and semantic context learning for image classification

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Learning realistic facial expressions from web images

Pattern Recognition
Sketch2Scene: sketch-based co-retrieval and co-placement of 3D models

ACM Transactions on Graphics (TOG) - SIGGRAPH 2013 Conference Proceedings
OpenSurfaces: a richly annotated catalog of surface appearance

ACM Transactions on Graphics (TOG) - SIGGRAPH 2013 Conference Proceedings
Efficient ad-hoc search for personalized PageRank

Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Generating annotations for how-to videos using crowdsourcing

CHI '13 Extended Abstracts on Human Factors in Computing Systems
Combining crowdsourcing and google street view to identify street-level accessibility problems

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Online web-data-driven segmentation of selected moving objects in videos

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Finding happiest moments in a social context

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Hierarchical space tiling for scene modeling

ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Efficient development of user-defined image recognition systems

ACCV'12 Proceedings of the 11th international conference on Computer Vision - Volume Part I
Learning to name faces: a multimodal learning scheme for search-based face annotation

Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Annotation propagation in image databases using similarity graphs

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Variational learning of a Dirichlet process of generalized Dirichlet distributions for simultaneous clustering and feature selection

Pattern Recognition
Do you need experts in the crowd?: a case study in image annotation for marine biology

Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Fish4label: accomplishing an expert task without expert knowledge

Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Approximate nearest neighbor search to support manual image annotation of large domain-specific datasets

Proceedings of the International Workshop on Video and Image Ground Truth in Computer Vision Applications
A crowdsourcing approach to support video annotation

Proceedings of the International Workshop on Video and Image Ground Truth in Computer Vision Applications
Order preserving hashing for approximate nearest neighbor search

Proceedings of the 21st ACM international conference on Multimedia
Human vs machine: establishing a human baseline for multimodal location estimation

Proceedings of the 21st ACM international conference on Multimedia
Static saliency vs. dynamic saliency: a comparative study

Proceedings of the 21st ACM international conference on Multimedia
πLDA: document clustering with selective structural constraints

Proceedings of the 21st ACM international conference on Multimedia
MedLDA: maximum margin supervised topic models

The Journal of Machine Learning Research
Unsupervised feature construction for improving data representation and semantics

Journal of Intelligent Information Systems
Crowdsourced object segmentation with a game

Proceedings of the 2nd ACM international workshop on Crowdsourcing for multimedia
Indoor scene recognition by a mobile robot through adaptive object detection

Robotics and Autonomous Systems
A boosting approach for the simultaneous detection and segmentation of generic objects

Pattern Recognition Letters
3D Wikipedia: using online text to automatically label and navigate reconstructed geometry

ACM Transactions on Graphics (TOG)
SLEDGE: Sequential Labeling of Image Edges for Boundary Detection

International Journal of Computer Vision
Object class detection: A survey

ACM Computing Surveys (CSUR)
Feasibility of identifying eating moments from first-person images leveraging human computation

Proceedings of the 4th International SenseCam & Pervasive Imaging Conference
Using objective ground-truth labels created by multiple annotators for improved video classification: A comparative study

Computer Vision and Image Understanding
A lossy counting based approach for learning on streams of graphs on a budget

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Multi-view embedding learning for incompletely labeled data

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Tagging-by-search: automatic image region labeling using gaze information obtained from image search

Proceedings of the 19th international conference on Intelligent User Interfaces
Constraining image object search by multi-scale spectral residue analysis

Pattern Recognition Letters
Special Section on CAD/Graphics 2013: Detecting soft shadows in a single outdoor image: From local edge-based models to global constraints

Computers and Graphics
Top-Down Saliency Detection via Contextual Pooling

Journal of Signal Processing Systems
Efficient semantic image segmentation with multi-class ranking prior

Computer Vision and Image Understanding
Learning semantic representations of objects and their parts

Machine Learning
From machine learning to machine reasoning

Machine Learning
C2TAM: A Cloud framework for cooperative tracking and mapping

Robotics and Autonomous Systems
Non-Gaussian Data Clustering via Expectation Propagation Learning of Finite Dirichlet Mixture Models and Applications

Neural Processing Letters
The Shape Boltzmann Machine: A Strong Model of Object Shape

International Journal of Computer Vision
A jointly distributed semi-supervised topic model

Neurocomputing

Quantified Score

Hi-index	0.05

Visualization

Abstract

We seek to build a large collection of images with ground truth labels to be used for object detection and recognition research. Such data is useful for supervised learning and quantitative evaluation. To achieve this, we developed a web-based tool that allows easy image annotation and instant sharing of such annotations. Using this annotation tool, we have collected a large dataset that spans many object categories, often containing multiple instances over a wide variety of images. We quantify the contents of the dataset and compare against existing state of the art datasets used for object recognition and detection. Also, we show how to extend the dataset to automatically enhance object labels with WordNet, discover object parts, recover a depth ordering of objects in a scene, and increase the number of labels using minimal user supervision and images from the web.