Real-time upper body detection and 3d pose estimation in monoscopic images

  • Authors:
  • Antonio S. Micilotta;Eng-Jon Ong;Richard Bowden

  • Affiliations:
  • Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, Surrey, United Kingdom;Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, Surrey, United Kingdom;Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, Surrey, United Kingdom

  • Venue:
  • ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part III
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a novel solution to the difficult task of both detecting and estimating the 3D pose of humans in monoscopic images. The approach consists of two parts. Firstly the location of a human is identified by a probabalistic assembly of detected body parts. Detectors for the face, torso and hands are learnt using adaBoost. A pose likliehood is then obtained using an a priori mixture model on body configuration and possible configurations assembled from available evidence using RANSAC. Once a human has been detected, the location is used to initialise a matching algorithm which matches the silhouette and edge map of a subject with a 3D model. This is done efficiently using chamfer matching, integral images and pose estimation from the initial detection stage. We demonstrate the application of the approach to large, cluttered natural images and at near framerate operation (16fps) on lower resolution video streams.