Robust Visual Tracking by Integrating Multiple Cues Based on Co-Inference Learning

Authors:
Ying Wu;Thomas S. Huang
Affiliations:
Department of Electrical & Computer Engineering, Northwestern University, 2145 Sheridan Road, Evanston, IL 60208, USA. yingwu@ece.northwestern.edu;Beckman Institute, University of Illinois at Urbana-Champaign, 405 N. Mathews, Urbana, IL 61801, USA. huang@ifp.uiuc.edu
Venue:
International Journal of Computer Vision - Special Issue on Computer Vision Research at the Beckman Institute of Advanced Science and Technology
Year:
2004

Citing 19
Cited 26

Color indexing

International Journal of Computer Vision
Pfinder: Real-Time Tracking of the Human Body

IEEE Transactions on Pattern Analysis and Machine Intelligence
Visual Interpretation of Hand Gestures for Human-Computer Interaction: A Review

IEEE Transactions on Pattern Analysis and Machine Intelligence
Factorial Hidden Markov Models

Machine Learning - Special issue on learning with probabilistic representations
Combining labeled and unlabeled data with co-training

COLT' 98 Proceedings of the eleventh annual conference on Computational learning theory
CONDENSATION—Conditional Density Propagation forVisual Tracking

International Journal of Computer Vision
The visual analysis of human movement: a survey

Computer Vision and Image Understanding
An Introduction to Variational Methods for Graphical Models

Machine Learning
On sequential Monte Carlo sampling methods for Bayesian filtering

Statistics and Computing
Contour Tracking by Stochastic Propagation of Conditional Density

ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume I - Volume I
EigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation

ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume I - Volume I
ICONDENSATION: Unifying Low-Level and High-Level Tracking in a Stochastic Framework

ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume I - Volume I
Colour Model Selection and Adaption in Dynamic Scenes

ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume I - Volume I
Learning and Recognizing Human Dynamics in Video Sequences

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Real-time tracking of image regions with changes in geometry and illumination

CVPR '96 Proceedings of the 1996 Conference on Computer Vision and Pattern Recognition (CVPR '96)
Reliable Tracking of Human Arm Dynamics by Multiple Cue Integration and Constraint Fusion

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Elliptical Head Tracking Using Intensity Gradients and Color Histograms

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Integrated Person Tracking Using Stereo, Color, and Pattern Detection

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Joint Probabilistic Techniques for Tracking Multi-Part Objects

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

An approach of visual motion analysis

Pattern Recognition Letters - Special issue: In memoriam Azriel Rosenfeld
A General Framework for Combining Visual Trackers --- The "Black Boxes" Approach

International Journal of Computer Vision
Partial Linear Gaussian Models for Tracking in Image Sequences Using Sequential Monte Carlo Methods

International Journal of Computer Vision
Robust Face Tracking with Suppressed False Positives in Smart Home Environment

ICOST '08 Proceedings of the 6th international conference on Smart Homes and Health Telematics
Tracking and recognizing actions of multiple hockey players using the boosted particle filter

Image and Vision Computing
Interactive Tracking of 2D Generic Objects with Spacetime Optimization

ECCV '08 Proceedings of the 10th European Conference on Computer Vision: Part I
Person localization using a wearable camera towards enhancing social interactions for individuals with visual impairment

MSIADU '09 Proceedings of the 1st ACM SIGMM international workshop on Media studies and implementations that help improving access to disabled users
On the optimality of motion-based particle filtering

IEEE Transactions on Circuits and Systems for Video Technology
Visual tracking algorithm based on CAMSHIFT and multi-cue Fusion for human motion analysis

SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
Object tracking with particle filter using color information

MIRAGE'07 Proceedings of the 3rd international conference on Computer vision/computer graphics collaboration techniques
Adaptive multiple object tracking using colour and segmentation cues

ACCV'07 Proceedings of the 8th Asian conference on Computer vision - Volume Part I
Tracking objects with generic calibrated sensors: An algorithm based on color and 3D shape features

Robotics and Autonomous Systems
Learning an intrinsic-variable preserving manifold for dynamic visual tracking

IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics - Special issue on game theory
Dynamic multi-cue tracking with detection responses association

Proceedings of the international conference on Multimedia
Incremental Tensor Subspace Learning and Its Applications to Foreground Segmentation and Tracking

International Journal of Computer Vision
Visual tracking using the Earth Mover's Distance between Gaussian mixtures and Kalman filtering

Image and Vision Computing
A hierarchical feature fusion framework for adaptive visual tracking

Image and Vision Computing
Effective appearance model and similarity measure for particle filtering and visual tracking

ECCV'06 Proceedings of the 9th European conference on Computer Vision - Volume Part III
On pedestrian detection and tracking in infrared videos

Pattern Recognition Letters
A large margin framework for single camera offline tracking with hybrid cues

Computer Vision and Image Understanding
Explaining Activities as Consistent Groups of Events

International Journal of Computer Vision
Dynamic appearance model for particle filter based visual tracking

Pattern Recognition
Robust Visual Tracking via Structured Multi-Task Sparse Learning

International Journal of Computer Vision
Disagreement-Based multi-system tracking

ACCV'12 Proceedings of the 11th international conference on Computer Vision - Volume 2
Adaptive multi-cue based particle swarm optimization guided particle filter tracking in infrared videos

Neurocomputing
Visual tracking via weakly supervised learning from multiple imperfect oracles

Pattern Recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

Visual tracking can be treated as a parameter estimation problem that infers target states based on image observations from video sequences. A richer target representation may incur better chances of successful tracking in cluttered and dynamic environments, and thus enhance the robustness. Richer representations can be constructed by either specifying a detailed model of a single cue or combining a set of rough models of multiple cues. Both approaches increase the dimensionality of the state space, which results in a dramatic increase of computation. To investigate the integration of rough models from multiple cues and to explore computationally efficient algorithms, this paper formulates the problem of multiple cue integration and tracking in a probabilistic framework based on a factorized graphical model. Structured variational analysis of such a graphical model factorizes different modalities and suggests a co-inference process among these modalities. Based on the importance sampling technique, a sequential Monte Carlo algorithm is proposed to provide an efficient simulation and approximation of the co-inferencing of multiple cues. This algorithm runs in real-time at around 30 Hz. Our extensive experiments show that the proposed algorithm performs robustly in a large variety of tracking scenarios. The approach presented in this paper has the potential to solve other problems including sensor fusion problems.