Non-sequential multi-view detection, localization and identification of people using multi-modal feature maps

  • Authors:
  • Rok Mandeljc;Stanislav Kovačič;Matej Kristan;Janez Perš

  • Affiliations:
  • Faculty of Electrical Engineering, University of Ljubljana, Slovenia;Faculty of Electrical Engineering, University of Ljubljana, Slovenia;Faculty of Electrical Engineering, University of Ljubljana, Slovenia;Faculty of Electrical Engineering, University of Ljubljana, Slovenia

  • Venue:
  • ACCV'12 Proceedings of the 11th Asian conference on Computer Vision - Volume Part III
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present a novel multi-modal fusion framework for non-sequential person detection, localization and identification from multiple views. Our goal is independent processing of randomly-accessed sections of video, either individual frames or small batches thereof. This way, we aim to limit the error propagation that makes the existing approaches unsuitable for fully-autonomous tracking of multiple people in long video sequences. Our framework uses one or more trained classifiers to fuse multiple weak feature maps. We perform experimental validation on a challenging dataset, demonstrating how the framework can, depending on the provided feature maps, be used either only to improve generic person detection, or enable simultaneous detection and recognition of individuals. Finally, we show that tracking-by-identification using the output of the proposed framework outperforms the state-of-the-art identification-by-tracking approach in terms of preserved track identities.