Learning neighborhood cooccurrence statistics of sparse features for human activity recognition

  • Authors:
  • P. Banerjee;R. Nevatia

  • Affiliations:
  • Inst. for Robot. & Intell. Syst., Univ. of Southern California, Los Angeles, CA, USA;Inst. for Robot. & Intell. Syst., Univ. of Southern California, Los Angeles, CA, USA

  • Venue:
  • AVSS '11 Proceedings of the 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

A common approach to activity recognition has been the use of histogram of codewords computed from Spatio Temporal Interest Points (STIPs). Recent methods have focused on leveraging the spatio-temporal neighborhood structure of the features, but they are generally restricted to aggregate statistics over the entire video volume, and ignore local pairwise relationships. Our goal is to capture these relations in terms of pairwise cooccurrence statistics of codewords. We show a reduction of such cooccurrence relations to the edges connecting the latent variables of a Conditional Random Field (CRF) classifier. As a consequence, we also learn the codeword dictionary as a part of the maximum likelihood learning process, with each interest point assigned a probability distribution over the codewords. We show results on two widely used activity recognition datasets.