Representing and Recognizing the Visual Appearance of Materials using Three-dimensional Textons
International Journal of Computer Vision
Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope
International Journal of Computer Vision
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision
Usage patterns of collaborative tagging systems
Journal of Information Science
HT06, tagging paper, taxonomy, Flickr, academic article, to read
Proceedings of the seventeenth conference on Hypertext and hypermedia
International Journal of Computer Vision
The complex dynamics of collaborative tagging
Proceedings of the 16th international conference on World Wide Web
Towards automatic extraction of event and place semantics from flickr tags
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the 15th international conference on Multimedia
Speeded-Up Robust Features (SURF)
Computer Vision and Image Understanding
World-scale mining of objects and events from community photo collections
CIVR '08 Proceedings of the 2008 international conference on Content-based image and video retrieval
LIBLINEAR: A Library for Large Linear Classification
The Journal of Machine Learning Research
Co-Clustering Tags and Social Data Sources
WAIM '08 Proceedings of the 2008 The Ninth International Conference on Web-Age Information Management
Towards semantic maps for mobile robots
Robotics and Autonomous Systems
Boosting image retrieval through aggregating search results based on visual annotations
MM '08 Proceedings of the 16th ACM international conference on Multimedia
Proceedings of the 18th international conference on World wide web
Semantic context transfer across heterogeneous sources for domain adaptive video search
MM '09 Proceedings of the 17th ACM international conference on Multimedia
Leveraging social media for training object detectors
DSP'09 Proceedings of the 16th international conference on Digital Signal Processing
Utilizing object-object and object-scene context when planning to find things
ICRA'09 Proceedings of the 2009 IEEE international conference on Robotics and Automation
Probabilistic Graphical Models: Principles and Techniques - Adaptive Computation and Machine Learning
LIBSVM: A library for support vector machines
ACM Transactions on Intelligent Systems and Technology (TIST)
RGB-D mapping: Using Kinect-style depth cameras for dense 3D modeling of indoor environments
International Journal of Robotics Research
3D Registration for Verification of Humanoid Justin's Upper Body Kinematics
CRV '12 Proceedings of the 2012 Ninth Conference on Computer and Robot Vision
RGB-(D) scene labeling: Features and algorithms
CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Indoor segmentation and support inference from RGBD images
ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V
Hi-index | 0.00 |
Recent advances in computer vision on the one hand, and imaging technologies on the other hand, have opened up a number of interesting possibilities for robust 3D scene labeling. This paper presents contributions in several directions to improve the state-of-the-art in RGB-D scene labeling. First, we present a novel combination of depth and color features to recognize different object categories in isolation. Then, we use a context model that exploits detection results of other objects in the scene to jointly optimize labels of co-occurring objects in the scene. Finally, we investigate the use of social media mining to develop the context model, and provide an investigation of its convergence. We perform thorough experimentation on both the publicly available RGB-D Dataset from the University of Washington as well as on the NYU scene dataset. An analysis of the results shows interesting insights about contextual object category recognition, and its benefits.