Human action recognition in video by 'meaningful' poses

Authors:
Snehasis Mukherjee;Sujoy Kumar Biswas;Dipti Prasad Mukherjee
Affiliations:
Indian Statistical Institute, Kolkata, India;Indian Statistical Institute, Kolkata, India;Indian Statistical Institute, Kolkata, India
Venue:
Proceedings of the Seventh Indian Conference on Computer Vision, Graphics and Image Processing
Year:
2010

Citing 16
Cited 0

Introduction to Algorithms

Introduction to Algorithms
Estimating Human Body Configurations Using Shape Context Matching

ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part III
X-means: Extending K-means with Efficient Estimation of the Number of Clusters

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
A Grouping Principle and Four Applications

IEEE Transactions on Pattern Analysis and Machine Intelligence
Recognizing Action at a Distance

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Recognizing Human Actions: A Local SVM Approach

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 3 - Volume 03
Maxdiff kd-trees for data condensation

Pattern Recognition Letters
Pattern Recognition and Machine Learning (Information Science and Statistics)

Pattern Recognition and Machine Learning (Information Science and Statistics)
Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words

International Journal of Computer Vision
Tracking and recognizing actions of multiple hockey players using the boosted particle filter

Image and Vision Computing
From Gestalt Theory to Image Analysis: A Probabilistic Approach

From Gestalt Theory to Image Analysis: A Probabilistic Approach
Human Action Recognition by Semilatent Topic Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
An iterative image registration technique with an application to stereo vision

IJCAI'81 Proceedings of the 7th international joint conference on Artificial intelligence - Volume 2
An Experimental Study of Graph Connectivity for Unsupervised Word Sense Disambiguation

IEEE Transactions on Pattern Analysis and Machine Intelligence
Recovering human body configurations: combining segmentation and recognition

CVPR'04 Proceedings of the 2004 IEEE computer society conference on Computer vision and pattern recognition
Shape-from-silhouette of articulated objects and its use for human body kinematics estimation and motion capture

CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

We propose a graph theoretic technique for recognizing actions at a distance by modeling the visual senses associated with human poses. Identifying the intended meaning of poses is a challenging task because of their variability and such variations in poses lead to visual sense ambiguity. Our methodology follows a bag-of-words approach. Here "word" refers to the pose descriptor of the human figure corresponding to a single video frame and a "document" corresponds to the entire video of a particular action. From a large vocabulary of poses we prune out ambiguous poses and extract 'meaningful' [6] poses - for each action type in a supervised fashion - using centrality measure of graph connectivity [16]. The number of 'meaningful' poses per action is determined by setting a bound on the centrality measure. We evaluate our methodology on four standard activity recognition datasets and the results clearly demonstrate the superiority of our approach over the present state-of-the-art.