Joint 2D-3D temporally consistent semantic segmentation of street scenes

Authors:
Georgios Floros
Affiliations:
UMIC Research Centre RWTH Aachen University, Germany
Venue:
CVPR '12 Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)
Year:
2012

Citing 0
Cited 1

Visual dictionary learning for joint object categorization and segmentation

ECCV'12 Proceedings of the 12th European conference on Computer Vision - Volume Part V

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper we propose a novel Conditional Random Field (CRF) formulation for the semantic scene labeling problem which is able to enforce temporal consistency between consecutive video frames and take advantage of the 3D scene geometry to improve segmentation quality. The main contribution of this work lies in the novel use of a 3D scene reconstruction as a means to temporally couple the individual image segmentations, allowing information flow from 3D geometry to the 2D image space. As our results show, the proposed framework outperforms state-of-the-art methods and opens a new perspective towards a tighter interplay of 2D and 3D information in the scene understanding problem.