Recognizing human actions by fusing spatio-temporal appearance and motion descriptors

  • Authors:
  • Lamberto Ballan;Marco Bertini;Alberto Del Bimbo;Lorenzo Seidenari;Giuseppe Serra

  • Affiliations:
  • Media Integration and Communication Center, University of Florence, Italy;Media Integration and Communication Center, University of Florence, Italy;Media Integration and Communication Center, University of Florence, Italy;Media Integration and Communication Center, University of Florence, Italy;Media Integration and Communication Center, University of Florence, Italy

  • Venue:
  • ICIP'09 Proceedings of the 16th IEEE international conference on Image processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we propose a new method for human action categorization by using an effective combination of a new 3D gradient descriptor with an optic flow descriptor, to represent spatio-temporal interest points. These points are used to represent video sequences using a bag of spatio-temporal visual words, following the successful results achieved in object and scene classification. We extensively test our approach on the standard KTH and Weizmann actions datasets, showing its validity and good performance. Experimental results outperform state-of-the-art methods, without requiring fine parameter tuning.