Action recognition based on learnt motion semantic vocabulary
PCM'10 Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I
Hi-index | 0.00 |
In this paper, we propose a Hierarchical Spatial Markov Model (HSMM) for image categorization. We adopt the Bag-of-Words (BoW) model to represent image features with visual words, thus avoiding the heavy work of manual annotation in most Markov model based approaches. Our HSMM is designed to describe the spatial relations of these visual words by modeling the distribution of transitions between adjacent words over each image category. A novel idea of semantic hierarchy is exerted in the model to represent the composition relationship of visual words at semantic level. Experiments demonstrate that our approach outperforms Bayesian hierarchical model based categorization approach with 12.5% and it also performs better than the previous Markov model based approach with 11.8% on average.