Learning Montages of Transformed Latent Images as Representations of Objects That Change in Appearance

Authors:
Chris Pal;Brendan J. Frey;Nebojsa Jojic
Affiliations:
-;-;-
Venue:
ECCV '02 Proceedings of the 7th European Conference on Computer Vision-Part IV
Year:
2002

Citing 11
Cited 0

Learning in graphical models

Learning in graphical models
Reconstruction of articulated objects from point correspondences in a single uncalibrated image

Computer Vision and Image Understanding
Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference

Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
Photomosaics

Photomosaics
Active Appearance Models

ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume II - Volume II
Image Mosaics

EP '98/RIDT '98 Proceedings of the 7th International Conference on Electronic Publishing, Held Jointly with the 4th International Conference on Raster Imaging and Digital Typography: Electronic Publishing, Artistic Imaging, and Digital Typography
Learning and Recognizing Human Dynamics in Video Sequences

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Cardboard People: A Parameterized Model of Articulated Image Motion

FG '96 Proceedings of the 2nd International Conference on Automatic Face and Gesture Recognition (FG '96)
Tracking and recognizing rigid and non-rigid facial motions using local parametric models of image motion

ICCV '95 Proceedings of the Fifth International Conference on Computer Vision
Estimating 3D Body Pose using Uncalibrated Cameras

Estimating 3D Body Pose using Uncalibrated Cameras
The Bayesian structural EM algorithm

UAI'98 Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper introduces a novel probabilistic model for representing objects that change in appearance as a result of changes in pose, due to small deformations of their sub-parts and the relative spatial transformation of sub-parts of the object. We call the model a probabilistic montage. The model is based upon the idea that an image can be represented as a montage using many, small transformed and cropped patches from a collection of latent images. The approach is similar to that which might be employed by a police artist who might represent an image of a criminal suspect's face using a montage of face parts cut out of a "library" of face parts. In contrast, for our model, we learn the library of small latent images from a set of examples of objects that are changing in shape. In our approach, first the image is divided into a grid of sub-images. Each sub-image in the grid acts as window that crops a piece out of one of a collection of slightly larger images possible for that location in the image. We illustrate various probability models that can be used to encode the appropriate relationships for latent images and cropping transformations among the different patches. In this paper we present the complete algorithm for a tree-structured model. We show how the approach and model are able to find representations of the appearance of full body images of people in motion. We show how our approach can be used to learn representations of objects in an "unsupervised" manner and present results using our model for recognition and tracking purposes in a "supervised" manner.