Human action recognition using pyramid vocabulary tree

  • Authors:
  • Chunfeng Yuan;Xi Li;Weiming Hu;Hanzi Wang

  • Affiliations:
  • National Laboratory of Pattern Recognition, Institute of Automation, CAS, Beijing, China;National Laboratory of Pattern Recognition, Institute of Automation, CAS, Beijing, China;National Laboratory of Pattern Recognition, Institute of Automation, CAS, Beijing, China;School of Computer Science, University of Adelaide, SA, Australia

  • Venue:
  • ACCV'09 Proceedings of the 9th Asian conference on Computer Vision - Volume Part III
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The bag-of-visual-words (BOVW) approaches are widely used in human action recognition Usually, large vocabulary size of the BOVW is more discriminative for inter-class action classification while small one is more robust to noise and thus tolerant to the intra-class invariance In this pape, we propose a pyramid vocabulary tree to model local spatio-temporal features, which can characterize the inter-class difference and also allow intra-class variance Moreover, since BOVW is geometrically unconstrained, we further consider the spatio-temporal information of local features and propose a sparse spatio-temporal pyramid matching kernel (termed as SST-PMK) to compute the similarity measures between video sequences SST-PMK satisfies the Mercer's condition and therefore is readily integrated into SVM to perform action recognition Experimental results on the Weizmann datasets show that both the pyramid vocabulary tree and the SST-PMK lead to a significant improvement in human action recognition.