International Journal of Computer Vision
The nature of statistical learning theory
The nature of statistical learning theory
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope
International Journal of Computer Vision
Contextual Priming for Object Detection
International Journal of Computer Vision
A bootstrapping algorithm for learning linear models of object classes
CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Consistent Line Clusters for Building Recognition in CBIR
ICPR '02 Proceedings of the 16 th International Conference on Pattern Recognition (ICPR'02) Volume 3 - Volume 3
A Bayesian Approach to Unsupervised One-Shot Learning of Object Categories
ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Introduction to the special issue on word sense disambiguation: the state of the art
Computational Linguistics - Special issue on word sense disambiguation
Labeling images with a computer game
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Learning to Detect Objects in Images via a Sparse, Part-Based Representation
IEEE Transactions on Pattern Analysis and Machine Intelligence
CVPRW '04 Proceedings of the 2004 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'04) Volume 12 - Volume 12
Discovering Objects and their Localization in Images
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Modeling Scenes with Local Descriptors and Latent Aspects
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Learning Hierarchical Models of Scenes, Objects, and Parts
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Learning Object Categories from Google"s Image Search
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Object Categorization by Learned Universal Visual Dictionary
ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
Generic Object Recognition with Boosting
IEEE Transactions on Pattern Analysis and Machine Intelligence
One-Shot Learning of Object Categories
IEEE Transactions on Pattern Analysis and Machine Intelligence
Peekaboom: a game for locating objects in images
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Photo tourism: exploring photo collections in 3D
ACM SIGGRAPH 2006 Papers
Putting Objects in Perspective
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Using Multiple Segmentations to Discover Objects and their Extent in Image Collections
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Learning methods for generic object recognition with invariance to pose and lighting
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Sharing features: efficient boosting procedures for multiclass object detection
CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
A boundary-fragment-model for object detection
ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part II
Editorial: From sensors to human spatial concepts
Robotics and Autonomous Systems
Locating key views for image indexing of spaces
MIR '08 Proceedings of the 1st ACM international conference on Multimedia information retrieval
Towards Scalable Dataset Construction: An Active Learning Approach
ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Concept detection and keyframe extraction using a visual thesaurus
Multimedia Tools and Applications
PCM '08 Proceedings of the 9th Pacific Rim Conference on Multimedia: Advances in Multimedia Information Processing
Large Scale Concept Detection in Video Using a Region Thesaurus
MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
A human computer integrated approach for content based image retrieval
ICCOMP'08 Proceedings of the 12th WSEAS international conference on Computers
A benchmark for 3D mesh segmentation
ACM SIGGRAPH 2009 papers
Segmentation of Natural and Man-Made Structures by Independent Component Analysis
ICA '09 Proceedings of the 8th International Conference on Independent Component Analysis and Signal Separation
A 3D hypermedia with biomedical stereoscopic images: from creation to exploration in virtual reality
Proceedings of the 20th ACM conference on Hypertext and hypermedia
Key views for visualizing large spaces
Journal of Visual Communication and Image Representation
A sketch-based interface for photo pop-up
Proceedings of the 6th Eurographics Symposium on Sketch-Based Interfaces and Modeling
Games for sketch data collection
Proceedings of the 6th Eurographics Symposium on Sketch-Based Interfaces and Modeling
Foundations and Trends in Information Retrieval
Regional category parsing in undirected graphical models
Pattern Recognition Letters
Fusion of Multiple Expert Annotations and Overall Score Selection for Medical Image Diagnosis
SCIA '09 Proceedings of the 16th Scandinavian Conference on Image Analysis
Towards automatic image region annotation: image region textual coreference resolution
NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Collaborative text-annotation resource for disease-centered relation extraction from biomedical text
Journal of Biomedical Informatics
A new gaussian mixture conditional random field model for indoor image labeling
IMCE '09 Proceedings of the 1st international workshop on Interactive multimedia for consumer electronics
Scene classification using pLSA with visterm spatial location
IMCE '09 Proceedings of the 1st international workshop on Interactive multimedia for consumer electronics
Semantics-preserving bag-of-words models for efficient image annotation
LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
LS-MMRM '09 Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining
Distance metric learning from uncertain side information with application to automated photo tagging
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Learning to generate novel views of objects for class recognition
Computer Vision and Image Understanding
Shape-from-recognition: Recognition enables meta-data transfer
Computer Vision and Image Understanding
IEEE Transactions on Multimedia - Special issue on integration of context and content
An edge-weighted centroidal Voronoi tessellation model for image segmentation
IEEE Transactions on Image Processing
Semi-automatically labeling objects in images
IEEE Transactions on Image Processing
Patch Growing: Object segmentation using spatial coherence of local patches
Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
A first glimpse of cryptography's Holy Grail
Communications of the ACM
Using the forest to see the trees: exploiting context for visual object detection and localization
Communications of the ACM
Patch Growing: Object segmentation using spatial coherence of local patches
Proceedings of the 2009 conference on Artificial Intelligence Research and Development: Proceedings of the 12th International Conference of the Catalan Association for Artificial Intelligence
A new localized superpixel Markov random field for image segmentation
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
Active tagging for image indexing
ICME'09 Proceedings of the 2009 IEEE international conference on Multimedia and Expo
GAT: a Graphical Annotation Tool for semantic regions
Multimedia Tools and Applications
Gathering and ranking photos of named entities with high precision, high recall, and diversity
Proceedings of the third ACM international conference on Web search and data mining
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
Building a distributed robot garden
IROS'09 Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems
High-level concept annotation using ontology and probabilistic inference
Proceedings of the First International Conference on Internet Multimedia Computing and Service
Automatic online labeling images via co-active-learning
Proceedings of the First International Conference on Internet Multimedia Computing and Service
Proceedings of the international conference on Multimedia information retrieval
Object-based tag propagation for semi-automatic annotation of images
Proceedings of the international conference on Multimedia information retrieval
The Pascal Visual Object Classes (VOC) Challenge
International Journal of Computer Vision
The segmented and annotated IAPR TC-12 benchmark
Computer Vision and Image Understanding
Structured max-margin learning for multi-label image annotation
Proceedings of the ACM International Conference on Image and Video Retrieval
Object Recognition in 3D Point Clouds Using Web Data and Domain Adaptation
International Journal of Robotics Research
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Nonnegative shared subspace learning and its application to social media retrieval
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Horizon estimation: perceptual and computational experiments
Proceedings of the 7th Symposium on Applied Perception in Graphics and Visualization
Semantics-preserving bag-of-words models and applications
IEEE Transactions on Image Processing
TurKit: human computation algorithms on mechanical turk
UIST '10 Proceedings of the 23nd annual ACM symposium on User interface software and technology
Context-based search for 3D models
ACM SIGGRAPH Asia 2010 papers
A probabilistic topic-connection model for automatic image annotation
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Indoor robot gardening: design and implementation
Intelligent Service Robotics
Increasing interactivity in street view web navigation systems
Proceedings of the international conference on Multimedia
Two-stage localization for image labeling
PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Extracting structures in image collections for object recognition
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Semantic label sharing for learning with many categories
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
From region based image representation to object discovery and recognition
SSPR&SPR'10 Proceedings of the 2010 joint IAPR international conference on Structural, syntactic, and statistical pattern recognition
Detecting ground shadows in outdoor consumer photographs
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part II
Learning what and how of contextual models for scene labeling
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Localizing objects while learning their appearance
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Efficiently scaling up video annotation with crowdsourced marketplaces
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Semantic segmentation of urban scenes using dense depth maps
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Non-local characterization of scenery images: statistics, 3D reasoning, and a generative model
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Superparsing: scalable nonparametric image parsing with superpixels
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
A coarse-to-fine taxonomy of constellations for fast multi-class object detection
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Thinking inside the box: using appearance models and context based on room geometry
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part VI
Artificial Intelligence Review
Human posture recognition for intelligent vehicles
Journal of Real-Time Image Processing
Relevance of a feed-forward model of visual attention for goal-oriented and free-viewing tasks
IEEE Transactions on Image Processing
Distance metric learning from uncertain side information for automated photo tagging
ACM Transactions on Intelligent Systems and Technology (TIST)
Learning an interactive segmentation system
Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing
Random Fourier approximations for skewed multiplicative histogram kernels
Proceedings of the 32nd DAGM conference on Pattern recognition
Large scale visual classification via learned dictionaries and sparse representation
AICI'10 Proceedings of the 2010 international conference on Artificial intelligence and computational intelligence: Part I
Mining social images with distance metric learning for automated image tagging
Proceedings of the fourth ACM international conference on Web search and data mining
Automatic image semantic interpretation using social action and tagging data
Multimedia Tools and Applications
Context modeling in computer vision: techniques, implications, and applications
Multimedia Tools and Applications
Characterizing structural relationships in scenes using graph kernels
ACM SIGGRAPH 2011 papers
PhotoCity: training experts at large-scale image acquisition through a competitive game
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Region-based annotation of digital photographs
CCIW'11 Proceedings of the Third international conference on Computational color imaging
Fast object detection using steiner tree
Machine Graphics & Vision International Journal
Social negative bootstrapping for visual categorization
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Proceedings of the 1st ACM International Conference on Multimedia Retrieval
Multipedia: enriching DBpedia with multimedia information
Proceedings of the sixth international conference on Knowledge capture
A survey of semantic image and video annotation tools
Knowledge-driven multimedia information extraction and ontology evolution
Mining weakly labeled web facial images for search-based face annotation
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
HCII'11 Proceedings of the 14th international conference on Human-computer interaction: design and development approaches - Volume Part I
Not far away from home: a relational distance-based approach to understanding images of houses
ILP'10 Proceedings of the 20th international conference on Inductive logic programming
A semantic web annotation tool for a web-based audio sequencer
ICWE'11 Proceedings of the 11th international conference on Web engineering
Toward coherent object detection and scene layout understanding
Image and Vision Computing
Combining visual and textual modalities for multimedia ontology matching
SAMT'10 Proceedings of the 5th international conference on Semantic and digital media technologies
A cartography of spatial relationships in a symbolic image database
CAIP'11 Proceedings of the 14th international conference on Computer analysis of images and patterns - Volume Part I
Recursive Compositional Models for Vision: Description and Review of Recent Work
Journal of Mathematical Imaging and Vision
Two-probabilistic latent semantic model for image annotation and retrieval
ACCV'10 Proceedings of the 2010 international conference on Computer vision - Volume Part I
Image retrieval with semantic sketches
INTERACT'11 Proceedings of the 13th IFIP TC 13 international conference on Human-computer interaction - Volume Part I
Multiple region categorization for scenery images
ICIAP'11 Proceedings of the 16th international conference on Image analysis and processing: Part I
Generating resource profiles by exploiting the context of social annotations
ISWC'11 Proceedings of the 10th international conference on The semantic web - Volume Part I
Communications of the ACM
Finding images of difficult entities in the long tail
Proceedings of the 20th ACM international conference on Information and knowledge management
Purposive hidden-object-game: embedding human computation in popular game
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Multiclass object detection by combining local appearances and context
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Multi-feature pLSA for combining visual features in image annotation
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Exploiting the entire feature space with sparsity for automatic image annotation
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Retrieval-based face annotation by weak label regularized local coordinate coding
MM '11 Proceedings of the 19th ACM international conference on Multimedia
Scene reconstruction and visualization from internet photo collections
Scene reconstruction and visualization from internet photo collections
Towards a multimodality ontology image retrieval
IVIC'11 Proceedings of the Second international conference on Visual informatics: sustaining research and innovations - Volume Part II
Sub-sampling: Real-time vision for micro air vehicles
Robotics and Autonomous Systems
The impact of multifaceted tagging on learning tag relations and search
ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Generating visual concept network from large-scale weakly-tagged images
MMM'10 Proceedings of the 16th international conference on Advances in Multimedia Modeling
Cut and paste: ambient interaction using annotated cut-outs
AmI'11 Proceedings of the Second international conference on Ambient Intelligence
ACM Transactions on Graphics (TOG) - SIGGRAPH 2012 Conference Proceedings
Understanding and improving the realism of image composites
ACM Transactions on Graphics (TOG) - SIGGRAPH 2012 Conference Proceedings
Multi-instance methods for partially supervised image segmentation
PSL'11 Proceedings of the First IAPR TC3 conference on Partially Supervised Learning
Identifying objects in images from analyzing the users' gaze movements for provided tags
MMM'12 Proceedings of the 18th international conference on Advances in Multimedia Modeling
Estimating the Natural Illumination Conditions from a Single Outdoor Image
International Journal of Computer Vision
Motion chain: a webcam game for crowdsourcing gesture collection
CHI '12 Extended Abstracts on Human Factors in Computing Systems
Micro perceptual human computation for visual tasks
ACM Transactions on Graphics (TOG)
Human action recognition and localization in video at contextual level
ICDEM'10 Proceedings of the Second international conference on Data Engineering and Management
NEOCR: a configurable dataset for natural image text recognition
CBDAR'11 Proceedings of the 4th international conference on Camera-Based Document Analysis and Recognition
Proceedings of the International Working Conference on Advanced Visual Interfaces
Multiscale annotation of still images with GAT
Proceedings of the 1st International Workshop on Visual Interfaces for Ground Truth Collection in Computer Vision Applications
Efficient annotation of image data sets for computer vision applications
Proceedings of the 1st International Workshop on Visual Interfaces for Ground Truth Collection in Computer Vision Applications
CoVidA: pen-based collaborative video annotation
Proceedings of the 1st International Workshop on Visual Interfaces for Ground Truth Collection in Computer Vision Applications
Assistive tagging: A survey of multimedia tagging with human-computer joint exploration
ACM Computing Surveys (CSUR)
Fast Structured Prediction Using Large Margin Sigmoid Belief Networks
International Journal of Computer Vision
Enhancing image retrieval by an exploration-exploitation approach
MLDM'12 Proceedings of the 8th international conference on Machine Learning and Data Mining in Pattern Recognition
On the consistency and features of image similarity
Proceedings of the 4th Information Interaction in Context Symposium
A hybrid semi-supervised topic model
IScIDE'11 Proceedings of the Second Sino-foreign-interchange conference on Intelligent Science and Intelligent Data Engineering
An analytical model for generalized ESP games
Knowledge-Based Systems
Orientation-aware scene understanding for mobile cameras
Proceedings of the 2012 ACM Conference on Ubiquitous Computing
Performance evaluation procedure for vision based object feature extraction algorithms
Proceedings of the 10th Performance Metrics for Intelligent Systems Workshop
Sketch-editing games: human-machine communication, game theory and applications
Proceedings of the 25th annual ACM symposium on User interface software and technology
Object Detection using Geometrical Context Feedback
International Journal of Computer Vision
Weakly Supervised Localization and Learning with Generic Knowledge
International Journal of Computer Vision
User-Centric Learning and Evaluation of Interactive Segmentation Systems
International Journal of Computer Vision
Unsupervised object discovery via self-organisation
Pattern Recognition Letters
Pushing the limits of mechanical turk: qualifying the crowd for video geo-location
Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia
On shape and the computability of emotions
Proceedings of the 20th ACM international conference on Multimedia
Search web images using objects, backgrounds and conditions
Proceedings of the 20th ACM international conference on Multimedia
Towards indexing representative images on the web
Proceedings of the 20th ACM international conference on Multimedia
Making use of eye tracking information in image collection creation and region annotation
Proceedings of the 20th ACM international conference on Multimedia
Interactive tool for image annotation using a semi-supervised and hierarchical approach
Computer Standards & Interfaces
A unified learning framework for auto face annotation by mining web facial images
Proceedings of the 21st ACM international conference on Information and knowledge management
Undoing the damage of dataset bias
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Annotation propagation in large image databases via dense image correspondence
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Attributes for classifier feedback
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Constrained semi-supervised learning using attributes and comparative attributes
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Abnormal object detection by canonical scene-based contextual model
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Co-inference for multi-modal scene analysis
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
Simultaneous image classification and annotation via biased random walk on tri-relational graph
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part VI
On performance analysis of optical flow algorithms
Proceedings of the 15th international conference on Theoretical Foundations of Computer Vision: outdoor and large-scale real-world scene analysis
Technical Section: Automatic color realism enhancement for computer generated images
Computers and Graphics
Visual saliency detection with center shift
Neurocomputing
A novel image annotation feedback model based on internet-search
WISM'12 Proceedings of the 2012 international conference on Web Information Systems and Mining
Image memorability and visual inception
SIGGRAPH Asia 2012 Technical Briefs
A multiple object geometric deformable model for image segmentation
Computer Vision and Image Understanding
Web-enhanced object category learning for domestic robots
Intelligent Service Robotics
Multimedia ontology matching by using visual and textual modalities
Multimedia Tools and Applications
A spike-timing-based integrated model for pattern recognition
Neural Computation
International Journal of Computer Vision
Efficiently Scaling up Crowdsourced Video Annotation
International Journal of Computer Vision
Improving image tags by exploiting web search results
Multimedia Tools and Applications
Learning saliency-based visual attention: A review
Signal Processing
Adaptive object detection by implicit sub-class sharing features
Signal Processing
Popular research topics in multimedia
Scientometrics
Multifeature analysis and semantic context learning for image classification
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Learning realistic facial expressions from web images
Pattern Recognition
Sketch2Scene: sketch-based co-retrieval and co-placement of 3D models
ACM Transactions on Graphics (TOG) - SIGGRAPH 2013 Conference Proceedings
OpenSurfaces: a richly annotated catalog of surface appearance
ACM Transactions on Graphics (TOG) - SIGGRAPH 2013 Conference Proceedings
Efficient ad-hoc search for personalized PageRank
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Generating annotations for how-to videos using crowdsourcing
CHI '13 Extended Abstracts on Human Factors in Computing Systems
Combining crowdsourcing and google street view to identify street-level accessibility problems
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Online web-data-driven segmentation of selected moving objects in videos
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Finding happiest moments in a social context
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Hierarchical space tiling for scene modeling
ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part II
Efficient development of user-defined image recognition systems
ACCV'12 Proceedings of the 11th international conference on Computer Vision - Volume Part I
Learning to name faces: a multimodal learning scheme for search-based face annotation
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Annotation propagation in image databases using similarity graphs
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Do you need experts in the crowd?: a case study in image annotation for marine biology
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Fish4label: accomplishing an expert task without expert knowledge
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Proceedings of the International Workshop on Video and Image Ground Truth in Computer Vision Applications
A crowdsourcing approach to support video annotation
Proceedings of the International Workshop on Video and Image Ground Truth in Computer Vision Applications
Order preserving hashing for approximate nearest neighbor search
Proceedings of the 21st ACM international conference on Multimedia
Human vs machine: establishing a human baseline for multimodal location estimation
Proceedings of the 21st ACM international conference on Multimedia
Static saliency vs. dynamic saliency: a comparative study
Proceedings of the 21st ACM international conference on Multimedia
πLDA: document clustering with selective structural constraints
Proceedings of the 21st ACM international conference on Multimedia
MedLDA: maximum margin supervised topic models
The Journal of Machine Learning Research
Unsupervised feature construction for improving data representation and semantics
Journal of Intelligent Information Systems
Crowdsourced object segmentation with a game
Proceedings of the 2nd ACM international workshop on Crowdsourcing for multimedia
Indoor scene recognition by a mobile robot through adaptive object detection
Robotics and Autonomous Systems
A boosting approach for the simultaneous detection and segmentation of generic objects
Pattern Recognition Letters
3D Wikipedia: using online text to automatically label and navigate reconstructed geometry
ACM Transactions on Graphics (TOG)
SLEDGE: Sequential Labeling of Image Edges for Boundary Detection
International Journal of Computer Vision
Object class detection: A survey
ACM Computing Surveys (CSUR)
Feasibility of identifying eating moments from first-person images leveraging human computation
Proceedings of the 4th International SenseCam & Pervasive Imaging Conference
Computer Vision and Image Understanding
A lossy counting based approach for learning on streams of graphs on a budget
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Multi-view embedding learning for incompletely labeled data
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Tagging-by-search: automatic image region labeling using gaze information obtained from image search
Proceedings of the 19th international conference on Intelligent User Interfaces
Constraining image object search by multi-scale spectral residue analysis
Pattern Recognition Letters
Top-Down Saliency Detection via Contextual Pooling
Journal of Signal Processing Systems
Efficient semantic image segmentation with multi-class ranking prior
Computer Vision and Image Understanding
Learning semantic representations of objects and their parts
Machine Learning
From machine learning to machine reasoning
Machine Learning
C2TAM: A Cloud framework for cooperative tracking and mapping
Robotics and Autonomous Systems
The Shape Boltzmann Machine: A Strong Model of Object Shape
International Journal of Computer Vision
A jointly distributed semi-supervised topic model
Neurocomputing
Hi-index | 0.05 |
We seek to build a large collection of images with ground truth labels to be used for object detection and recognition research. Such data is useful for supervised learning and quantitative evaluation. To achieve this, we developed a web-based tool that allows easy image annotation and instant sharing of such annotations. Using this annotation tool, we have collected a large dataset that spans many object categories, often containing multiple instances over a wide variety of images. We quantify the contents of the dataset and compare against existing state of the art datasets used for object recognition and detection. Also, we show how to extend the dataset to automatically enhance object labels with WordNet, discover object parts, recover a depth ordering of objects in a scene, and increase the number of labels using minimal user supervision and images from the web.