Contextual object category recognition for RGB-D scene labeling

Authors:
Haider Ali;Faisal Shafait;Eirini Giannakidou;Athena Vakali;Nadia Figueroa;Theodoros Varvadoukas;Nikolaos Mavridis
Affiliations:
-;-;-;-;-;-;-
Venue:
Robotics and Autonomous Systems
Year:
2014

Citing 26
Cited 0

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Communications of the ACM
Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons

International Journal of Computer Vision
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope

International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Usage patterns of collaborative tagging systems

Journal of Information Science
HT06, tagging paper, taxonomy, Flickr, academic article, to read

Proceedings of the seventeenth conference on Hypertext and hypermedia
Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive Study

International Journal of Computer Vision
The complex dynamics of collaborative tagging

Proceedings of the 16th international conference on World Wide Web
Towards automatic extraction of event and place semantics from flickr tags

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
How flickr helps us make sense of the world: context and content in community-contributed media collections

Proceedings of the 15th international conference on Multimedia
Speeded-Up Robust Features (SURF)

Computer Vision and Image Understanding
World-scale mining of objects and events from community photo collections

CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
LIBLINEAR: A Library for Large Linear Classification

The Journal of Machine Learning Research
Co-Clustering Tags and Social Data Sources

WAIM '08 Proceedings of the 2008 The Ninth International Conference on Web-Age Information Management
Towards semantic maps for mobile robots

Robotics and Autonomous Systems
Boosting image retrieval through aggregating search results based on visual annotations

MM '08 Proceedings of the 16th ACM international conference on Multimedia
Mapping the world's photos

Proceedings of the 18th international conference on World wide web
Semantic context transfer across heterogeneous sources for domain adaptive video search

MM '09 Proceedings of the 17th ACM international conference on Multimedia
Leveraging social media for training object detectors

DSP'09 Proceedings of the 16th international conference on Digital Signal Processing
Utilizing object-object and object-scene context when planning to find things

ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Probabilistic Graphical Models: Principles and Techniques - Adaptive Computation and Machine Learning

Probabilistic Graphical Models: Principles and Techniques - Adaptive Computation and Machine Learning
LIBSVM: A library for support vector machines

ACM Transactions on Intelligent Systems and Technology (TIST)
RGB-D mapping: Using Kinect-style depth cameras for dense 3D modeling of indoor environments

International Journal of Robotics Research
3D Registration for Verification of Humanoid Justin's Upper Body Kinematics

CRV '12 Proceedings of the 2012 Ninth Conference on Computer and Robot Vision
RGB-(D) scene labeling: Features and algorithms

CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Indoor segmentation and support inference from RGBD images

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V

Quantified Score

Hi-index	0.00

Visualization

Abstract

Recent advances in computer vision on the one hand, and imaging technologies on the other hand, have opened up a number of interesting possibilities for robust 3D scene labeling. This paper presents contributions in several directions to improve the state-of-the-art in RGB-D scene labeling. First, we present a novel combination of depth and color features to recognize different object categories in isolation. Then, we use a context model that exploits detection results of other objects in the scene to jointly optimize labels of co-occurring objects in the scene. Finally, we investigate the use of social media mining to develop the context model, and provide an investigation of its convergence. We perform thorough experimentation on both the publicly available RGB-D Dataset from the University of Washington as well as on the NYU scene dataset. An analysis of the results shows interesting insights about contextual object category recognition, and its benefits.