Using Multi-view Recognition and Meta-data Annotation to Guide a Robot's Attention

Authors:
Alexander Thomas;Vittorio Ferrari;Bastian Leibe;Tinne Tuytelaars;Luc Van Gool
Affiliations:
ESAT-PSI/VISICS, KU Leuven, Heverlee 3001, Belgium,;Computer Vision Laboratory, ETH Zürich, Zürich 8092,Switzerland;UMIC Research Centre RWTH Aachen, 52056 Aachen Germany;ESAT-PSI/VISICS, KU Leuven, Heverlee 3001, Belgium;Computer Vision Laboratory, ETH Zürich, Zürich 8092,Switzerland, ESAT-PSI/VISICS, KU Leuven, Heverlee 3001, Belgium
Venue:
International Journal of Robotics Research
Year:
2009

Citing 20
Cited 6

Image analogies

Proceedings of the 28th annual conference on Computer graphics and interactive techniques
Mean Shift, Mode Seeking, and Clustering

IEEE Transactions on Pattern Analysis and Machine Intelligence
Bayesian Reconstruction of 3D Shapes and Scenes From A Single Image

HLK '03 Proceedings of the First IEEE International Workshop on Higher-Level Knowledge in 3D Modeling and Motion Analysis
Scale & Affine Invariant Interest Point Detectors

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Pedestrian Detection in Crowded Scenes

CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Geometric Context from a Single Image

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1 - Volume 01
3D Object Modeling and Recognition Using Local Affine-Invariant Image Descriptors and Multi-View Spatial Constraints

International Journal of Computer Vision
Simultaneous Object Recognition and Segmentation from Single or Multiple Model Views

International Journal of Computer Vision
Putting Objects in Perspective

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Towards Multi-View Object Class Detection

CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Example Based 3D Reconstruction from Single 2D Images

CVPRW '06 Proceedings of the 2006 Conference on Computer Vision and Pattern Recognition Workshop
3D City Modeling Using Cognitive Loops

3DPVT '06 Proceedings of the Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06)
Omnidirectional Vision Based Topological Navigation

International Journal of Computer Vision
Robust Object Detection with Interleaved Categorization and Segmentation

International Journal of Computer Vision
Curious George: An attentive semantic robot

Robotics and Autonomous Systems
SIFT Flow: Dense Correspondence across Different Scenes

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part III
Shape-from-recognition: Recognition enables meta-data transfer

Computer Vision and Image Understanding
SURF: speeded up robust features

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part I
The 2005 PASCAL visual object classes challenge

MLCW'05 Proceedings of the First international conference on Machine Learning Challenges: evaluating Predictive Uncertainty Visual Object Classification, and Recognizing Textual Entailment

Backprojection revisited: scalable multi-view object detection and similarity metrics for detections

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part I
Depth-encoded hough voting for joint object detection and shape recovery

ECCV'10 Proceedings of the 11th European conference on Computer vision: Part V
Multiple viewpoint recognition and localization

ACCV'10 Proceedings of the 10th Asian conference on Computer vision - Volume Part I
A multi-view annotation tool for people detection evaluation

Proceedings of the 1st International Workshop on Visual Interfaces for Ground Truth Collection in Computer Vision Applications
Object detection, shape recovery, and 3D modelling by depth-encoded hough voting

Computer Vision and Image Understanding
The Shape Boltzmann Machine: A Strong Model of Object Shape

International Journal of Computer Vision

Quantified Score

Hi-index	0.01

Visualization

Abstract

In the transition from industrial to service robotics, robots will have to deal with increasingly unpredictable and variable environments. We present a system that is able to recognize objects of a certain class in an image and to identify their parts for potential interactions. The method can recognize objects from arbitrary viewpoints and generalizes to instances that have never been observed during training, even if they are partially occluded and appear against cluttered backgrounds. Our approach builds on the implicit shape model of Leibe et al. We extend it to couple recognition to the provision of meta-data useful for a task and to the case of multiple viewpoints by integrating it with the dense multi-view correspondence finder of Ferrari et al. Meta-data can be part labels but also depth estimates, information on material types, or any other pixelwise annotation. We present experimental results on wheelchairs, cars, and motorbikes.