On parsing visual sequences with the hidden Markov model

  • Authors:
  • Naomi Harte;Daire Lennon;Anil Kokaram

  • Affiliations:
  • School of Engineering, Trinity College Dublin, Dublin 2, Ireland;School of Engineering, Trinity College Dublin, Dublin 2, Ireland;School of Engineering, Trinity College Dublin, Dublin 2, Ireland

  • Venue:
  • Journal on Image and Video Processing
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Hidden Markov Models have been employed in many vision applications to model and identify events of interest. Their use is common in applications where HMMs are used to classify previously divided segments of video as one of a set of events being modelled. HMMs can also simultaneously segment and classify events within a continuous video, without the need for a separate first step to identify the start and end of the events. This is significantly less common. This paper is an exploration of the development of HMM frameworks for such complete event recognition. A review of how HMMs have been applied to both event classification and recognition is presented. The discussion evolves in parallel with an example of a real application in psychology for illustration. The complete videos depict sessions where candidates perform a number of different exercises under the instruction of a psychologist. The goal is to isolate portions of video containing just one of these exercises. The exercise involves rotating the head of a kneeling subject to the left, back to centre, to the right, to the centre, and repeating a number of times. By designing a HMM system to automatically isolate portions of video containing this exercise, issues such as the strategy of choice of event to be modelled, feature design and selection, as well as training and testing are reviewed. Thus this paper shows how HMMs can be more extensively applied in the domain of event recognition in video.