Visual recognition with humans in the loop

Authors:
Steve Branson;Catherine Wah;Florian Schroff;Boris Babenko;Peter Welinder;Pietro Perona;Serge Belongie
Affiliations:
University of California, San Diego;University of California, San Diego;University of California, San Diego;University of California, San Diego;California Institute of Technology;California Institute of Technology;University of California, San Diego
Venue:
ECCV'10 Proceedings of the 11th European conference on Computer vision: Part IV
Year:
2010

Citing 9
Cited 7

Probabilistic reasoning in expert systems: theory and algorithms

Probabilistic reasoning in expert systems: theory and algorithms
C4.5: programs for machine learning

C4.5: programs for machine learning
Support vector machine active learning with applications to text classification

The Journal of Machine Learning Research
A Maximum Entropy Framework for Part-Based Texture and Object Recognition

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
Scalable Recognition with a Vocabulary Tree

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Automated Flower Classification over a Large Number of Classes

ICVGIP '08 Proceedings of the 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing
Decision Trees for Uncertain Data

ICDE '09 Proceedings of the 2009 IEEE International Conference on Data Engineering
Solving multiclass learning problems via error-correcting output codes

Journal of Artificial Intelligence Research
Sharing features: efficient boosting procedures for multiclass object detection

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition

Micro perceptual human computation for visual tasks

ACM Transactions on Graphics (TOG)
Dog breed classification using part localization

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part I
Leafsnap: a computer vision system for automatic plant species identification

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part II
Attributes for classifier feedback

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part III
Review: A review of earth observation using mobile personal communication devices

Computers & Geosciences
Human - humanoid robot interaction: the 20Q game

UCAmI'12 Proceedings of the 6th international conference on Ubiquitous Computing and Ambient Intelligence
Accurate integration of crowdsourced labels using workers' self-reported confidence scores

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present an interactive, hybrid human-computer method for object classification. The method applies to classes of objects that are recognizable by people with appropriate expertise (e.g., animal species or airplane model), but not (in general) by people without such expertise. It can be seen as a visual version of the 20 questions game, where questions based on simple visual attributes are posed interactively. The goal is to identify the true class while minimizing the number of questions asked, using the visual content of the image. We introduce a general framework for incorporating almost any off-the-shelf multi-class object recognition algorithm into the visual 20 questions game, and provide methodologies to account for imperfect user responses and unreliable computer vision algorithms. We evaluate our methods on Birds-200, a difficult dataset of 200 tightly-related bird species, and on the Animals With Attributes dataset. Our results demonstrate that incorporating user input drives up recognition accuracy to levels that are good enough for practical applications, while at the same time, computer vision reduces the amount of human interaction required.