Augmented Reality: Enhanced personal autostereoscopic telepresence system using commodity depth cameras

Authors:
Andrew Maimone;Jonathan Bidwell;Kun Peng;Henry Fuchs
Affiliations:
Department of Computer Science, University of North Carolina at Chapel Hill, CB# 3175, Brooks Computer Science Building, 201 S Columbia St., Chapel Hill, NC 27599-3175, United States;Department of Computer Science, University of North Carolina at Chapel Hill, CB# 3175, Brooks Computer Science Building, 201 S Columbia St., Chapel Hill, NC 27599-3175, United States;Department of Computer Science, University of North Carolina at Chapel Hill, CB# 3175, Brooks Computer Science Building, 201 S Columbia St., Chapel Hill, NC 27599-3175, United States;Department of Computer Science, University of North Carolina at Chapel Hill, CB# 3175, Brooks Computer Science Building, 201 S Columbia St., Chapel Hill, NC 27599-3175, United States
Venue:
Computers and Graphics
Year:
2012

Citing 13
Cited 5

The office of the future: a unified approach to image-based modeling and spatially immersive displays

Proceedings of the 25th annual conference on Computer graphics and interactive techniques
TELEPORT— towards immersive copresence

Multimedia Systems - Special issue on video content based retrieval
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Communications of the ACM
blue-c: a spatially immersive display and 3D video portal for telepresence

ACM SIGGRAPH 2003 Papers
Physiological reaction as an objective measure of presence in virtual environments

Physiological reaction as an objective measure of presence in virtual environments
Bilateral Filtering for Gray and Color Images

ICCV '98 Proceedings of the Sixth International Conference on Computer Vision
3D TV: a scalable system for real-time acquisition, transmission, and autostereoscopic display of dynamic scenes

ACM SIGGRAPH 2004 Papers
Understanding performance in coliseum, an immersive videoconferencing system

ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Ensuring Color Consistency across Multiple Cameras

ICCV '05 Proceedings of the Tenth IEEE International Conference on Computer Vision - Volume 2
ENCARA2: Real-time detection of multiple faces at different resolutions in video streams

Journal of Visual Communication and Image Representation
Achieving eye contact in a one-to-many 3D video teleconferencing system

ACM SIGGRAPH 2009 papers
3D deformable face tracking with a commodity depth camera

ECCV'10 Proceedings of the 11th European conference on computer vision conference on Computer vision: Part III
Encumbrance-free telepresence system with real-time 3D capture and display using commodity depth cameras

ISMAR '11 Proceedings of the 2011 10th IEEE International Symposium on Mixed and Augmented Reality

Editorial: Foreword to special section on augmented reality

Computers and Graphics
Multiple structured light-based depth sensors for human motion analysis: a review

IWAAL'12 Proceedings of the 4th international conference on Ambient Assisted Living and Home Care
Virtual director technology for social video communication and live event broadcast production

Proceedings of the 21st ACM international conference on Multimedia
Calibrated depth and color cameras for accurate 3D interaction in a stereoscopic augmented reality environment

Journal of Visual Communication and Image Representation
Hand shape classification using depth data for unconstrained 3D interaction

Journal of Ambient Intelligence and Smart Environments - Ambient and Smart Component Technologies for Human Centric Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes an enhanced telepresence system that offers fully dynamic, real-time 3D scene capture and continuous-viewpoint, head-tracked stereo 3D display without requiring the user to wear any tracking or viewing apparatus. We present a complete software and hardware framework for implementing the system, which is based on an array of commodity Microsoft Kinect^T^Mcolor-plus-depth cameras. Contributions include an algorithm for merging data between multiple depth cameras and techniques for automatic color calibration and preserving stereo quality even with low rendering rates. Also presented is a solution to the problem of interference that occurs between Kinect cameras with overlapping views. Emphasis is placed on a fully GPU-accelerated data processing and rendering pipeline that can apply hole filling, smoothing, data merger, surface generation, and color correction at rates of up to 200 million triangles/s on a single PC and graphics board. Also presented is a Kinect-based markerless tracking system that combines 2D eye recognition with depth information to allow head-tracked stereo views to be rendered for a parallax barrier autostereoscopic display. Enhancements in calibration, filtering, and data merger were made to improve image quality over a previous version of the system.