Learning appearance and transparency manifolds of occluded objects in layers

Authors:
Brendan J. Frey;Nebojsa Jojic;Anitha Kannan
Affiliations:
University of Toronto, Toronto, ON, Canada;Microsoft Research, One Microsoft Way, Redmond, WA;University of Toronto, Toronto, ON, Canada
Venue:
CVPR'03 Proceedings of the 2003 IEEE computer society conference on Computer vision and pattern recognition
Year:
2003

Citing 17
Cited 13

Probabilistic Visual Learning for Object Representation

IEEE Transactions on Pattern Analysis and Machine Intelligence
GTM: the generative topographic mapping

Neural Computation
EigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation

International Journal of Computer Vision
Parametric Hidden Markov Models for Gesture Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Design and Use of Linear Models for Image Motion Analysis

International Journal of Computer Vision
Active Contours: The Application of Techniques from Graphics,Vision,Control Theory and Statistics to Visual Tracking of Shapes in Motion

Active Contours: The Application of Techniques from Graphics,Vision,Control Theory and Statistics to Visual Tracking of Shapes in Motion
Transformation-Invariant Clustering Using the EM Algorithm

IEEE Transactions on Pattern Analysis and Machine Intelligence
Generalized Image Matching: Statistical Learning of Physically-Based Deformations

ECCV '96 Proceedings of the 4th European Conference on Computer Vision-Volume I - Volume I
Feature Correspondence by Interleaving Shape and Texture Computations

CVPR '96 Proceedings of the 1996 Conference on Computer Vision and Pattern Recognition (CVPR '96)
Real-time tracking of image regions with changes in geometry and illumination

CVPR '96 Proceedings of the 1996 Conference on Computer Vision and Pattern Recognition (CVPR '96)
Parametric Feature Detection

CVPR '96 Proceedings of the 1996 Conference on Computer Vision and Pattern Recognition (CVPR '96)
Mixtures of Local Linear Subspaces for Face Recognition

CVPR '98 Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Transformed Component Analysis: Joint Estimation of Spatial Transformations and Image Components

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Motion-Based Recognition of People in EigenGait Space

FGR '02 Proceedings of the Fifth IEEE International Conference on Automatic Face and Gesture Recognition
A deformable model for the recognition of human faces under arbitrary illumination

A deformable model for the recognition of human faces under arbitrary illumination
Eigenfaces for recognition

Journal of Cognitive Neuroscience
Fast Transformation-Invariant Component Analysis

International Journal of Computer Vision

Tracking based motion segmentation under relaxed statistical assumptions

Computer Vision and Image Understanding
Interpretation of complex scenes using dynamic tree-structure Bayesian networks

Computer Vision and Image Understanding
Learning Parts-Based Representations of Data

The Journal of Machine Learning Research
Fast Transformation-Invariant Component Analysis

International Journal of Computer Vision
Unwrap mosaics: a new representation for video editing

ACM SIGGRAPH 2008 papers
Construction of Appearance Manifold with Embedded View-Dependent Covariance Matrix for 3D Object Recognition

IEICE - Transactions on Information and Systems
Occlusion reasoning for tracking multiple people

IEEE Transactions on Circuits and Systems for Video Technology
Tracking based motion segmentation under relaxed statistical assumptions

Computer Vision and Image Understanding
Learning a scene background model via classification

IEEE Transactions on Signal Processing
Embedding view-dependent covariance matrix in object manifold for robust recognition

SPPRA '08 Proceedings of the Fifth IASTED International Conference on Signal Processing, Pattern Recognition and Applications
Background updating for visual surveillance

ISVC'05 Proceedings of the First international conference on Advances in Visual Computing
Learning a generative model of images by factoring appearance and shape

Neural Computation
The Shape Boltzmann Machine: A Strong Model of Object Shape

International Journal of Computer Vision

Quantified Score

Hi-index	0.00

Visualization

Abstract

By mapping a set of input images to points in a lowdimensional manifold or subspace, it is possible to efficiently account for a small number of degrees of freedom. For example, images of a person walking can be mapped to a 1-dimensional manifold that measures the phase of the person's gait. However, when the object is moving around the frame and being occluded by other objects, standard manifold modeling techniques (e.g., principal components analysis, factor analysis, locally linear embedding) try to account for global motion and occlusion. We show how factor analysis can be incorporated into a generative model of layered, 2.5-dimensional vision, to jointly locate objects, resolve occlusion ambiguities, and learn models of the appearance manifolds of objects. We demonstrate the algorithm on a video consisting of four occluding objects, two of which are people who are walking, and occlude each other for most of the duration of the video. Whereas standard manifold modeling techniques fail to extract information about the gaits, the layered model successfully extracts a periodic representation of the gait of each person.