Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes

Authors:
Jesse Hoey;James J. Little
Affiliations:
-;IEEE
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
2007

Citing 31
Cited 7

On Image Analysis by the Methods of Moments

IEEE Transactions on Pattern Analysis and Machine Intelligence
Fundamentals of speech recognition

Fundamentals of speech recognition
Task-Specific Gesture Analysis in Real-Time Using Interpolated Views

IEEE Transactions on Pattern Analysis and Machine Intelligence
Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection

IEEE Transactions on Pattern Analysis and Machine Intelligence
Automatic Interpretation and Coding of Face Images Using Flexible Models

IEEE Transactions on Pattern Analysis and Machine Intelligence
Coding, Analysis, Interpretation, and Recognition of Facial Expressions

IEEE Transactions on Pattern Analysis and Machine Intelligence
Recognizing Facial Expressions in Image Sequences Using Local Parameterized Models of Image Motion

International Journal of Computer Vision
On the Accuracy of Zernike Moments for Image Analysis

IEEE Transactions on Pattern Analysis and Machine Intelligence
Classifying Facial Actions

IEEE Transactions on Pattern Analysis and Machine Intelligence
The Hierarchical Hidden Markov Model: Analysis and Applications

Machine Learning
Looking at People: Sensing for Ubiquitous and Wearable Computing

IEEE Transactions on Pattern Analysis and Machine Intelligence
Design and Use of Linear Models for Image Motion Analysis

International Journal of Computer Vision
Recognizing Action Units for Facial Expression Analysis

IEEE Transactions on Pattern Analysis and Machine Intelligence
The Recognition of Human Movement Using Temporal Templates

IEEE Transactions on Pattern Analysis and Machine Intelligence
Introduction to Reinforcement Learning

Introduction to Reinforcement Learning
Conversation as Action Under Uncertainty

UAI '00 Proceedings of the 16th Conference on Uncertainty in Artificial Intelligence
Coupled hidden Markov models for complex action recognition

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Learning and Recognizing Human Dynamics in Video Sequences

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Active Gesture Recognition Using Partially Observable markov Decision Processes

ICPR '96 Proceedings of the International Conference on Pattern Recognition (ICPR '96) Volume III-Volume 7276 - Volume 7276
Dynamic Programming

Dynamic Programming
Dynamic bayesian networks: representation, inference and learning

Dynamic bayesian networks: representation, inference and learning
Bayesian Clustering of Optical Flow Fields

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Decision theoretic learning of human facial displays and gestures

Decision theoretic learning of human facial displays and gestures
Layered representations for learning and inferring office activity from multiple sensory channels

Computer Vision and Image Understanding - Special issue on event detection in video
Unsupervised clustering of ambulatory audio and video

ICASSP '99 Proceedings of the Acoustics, Speech, and Signal Processing, 1999. on 1999 IEEE International Conference - Volume 06
Eigenfaces for recognition

Journal of Cognitive Neuroscience
Perseus: randomized point-based value iteration for POMDPs

Journal of Artificial Intelligence Research
Solving POMDPs with continuous or large discrete observation spaces

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
SPUDD: stochastic planning using decision diagrams

UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
A planning system based on Markov decision processes to guide people with dementia through activities of daily living

IEEE Transactions on Information Technology in Biomedicine
Input-output HMMs for sequence processing

IEEE Transactions on Neural Networks

Fuzzy clustering of human motor motion

Applied Soft Computing
Modelling of content-aware indicators for effective determination of shot boundaries in compressed MPEG videos

Multimedia Tools and Applications
Activity recognition: an evolutionary ensembles approach

Proceedings of the 2011 international workshop on Situation activity & goal awareness
People, sensors, decisions: Customizable and adaptive technologies for assistance in healthcare

ACM Transactions on Interactive Intelligent Systems (TiiS) - Special issue on highlights of the decade in interactive intelligent systems
Tractable POMDP representations for intelligent tutoring systems

ACM Transactions on Intelligent Systems and Technology (TIST) - Special section on agent communication, trust in multiagent systems, intelligent tutoring and coaching systems
Decentralized multi-robot cooperation with auctioned POMDPs

International Journal of Robotics Research
Ontology-based Activity Recognition Framework and Services

Proceedings of International Conference on Information Integration and Web-based Applications & Services

Quantified Score

Hi-index	0.15

Visualization

Abstract

This paper presents a method for learning decision theoretic models of human behaviors from video data. Our system learns relationships between the movements of a person, the context in which they are acting, and a utility function. This learning makes explicit that the meaning of a behavior to an observer is contained in its relationship to actions and outcomes. An agent wishing to capitalize on these relationships must learn to distinguish the behaviors according to how they help the agent to maximize utility. The model we use is a partially observable Markov decision process, or POMDP. The video observations are integrated into the POMDP using a dynamic Bayesian network that creates spatial and temporal abstractions amenable to decision making at the high level. The parameters of the model are learned from training data using an a posteriori constrained optimization technique based on the expectation-maximization algorithm. The system automatically discovers classes of behaviors and determines which are important for choosing actions that optimize over the utility of possible outcomes. This type of learning obviates the need for labeled data from expert knowledge about which behaviors are significant and removes bias about what behaviors may be useful to recognize in a particular situation. We show results in three interactions: a single player imitation game, a gestural robotic control problem, and a card game played by two people.