SURFing the point clouds: Selective 3D spatial pyramids for category-level object recognition

Authors:
Roberto J. Lopez-Sastre
Affiliations:
GRAM, Dept. of Signal Theory and Communications, University of Alcalá, Alcalá de Henares, Spain
Venue:
CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Year:
2012

Citing 0
Cited 4

3D object detection with multiple kinects

ECCV'12 Proceedings of the 12th international conference on Computer Vision - Volume 2
Special Section on 3D Object Retrieval: Evaluating 3D spatial pyramids for classifying 3D shapes

Computers and Graphics
Multi-modal descriptors for multi-class hand pose recognition in human computer interaction systems

Proceedings of the 15th ACM on International conference on multimodal interaction
Modeling and correction of multipath interference in time of flight cameras

Image and Vision Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a novel approach to recognize object categories in point clouds. By quantizing 3D SURF local descriptors, computed on partial 3D shapes extracted from the point clouds, a vocabulary of 3D visual words is generated. Using this codebook, we build a Bag-of-Words representation in 3D, which is used in conjunction with a SVM classification machinery. We also introduce the 3D Spatial Pyramid Matching Kernel, which works by partitioning a working volume into fine sub-volumes, and computing a hierarchical weighted sum of histogram intersections at each level of the pyramid structure. With the aim of increasing both the classification accuracy and the computational efficiency of the kernel, we propose selective hierarchical volume decomposition strategies, based on representative and discriminative (sub-)volume selection processes, which drastically reduce the pyramid to consider. Results on the challenging large-scale RGB-D object dataset show that our kernels significantly outperform the state-of-the-art results by using a single 3D shape feature type extracted from individual depth images.