Learning in graphical models
Reconstruction of articulated objects from point correspondences in a single uncalibrated image
Computer Vision and Image Understanding
Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
Photomosaics
ECCV '98 Proceedings of the 5th European Conference on Computer Vision-Volume II - Volume II
EP '98/RIDT '98 Proceedings of the 7th International Conference on Electronic Publishing, Held Jointly with the 4th International Conference on Raster Imaging and Digital Typography: Electronic Publishing, Artistic Imaging, and Digital Typography
Learning and Recognizing Human Dynamics in Video Sequences
CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Cardboard People: A Parameterized Model of Articulated Image Motion
FG '96 Proceedings of the 2nd International Conference on Automatic Face and Gesture Recognition (FG '96)
ICCV '95 Proceedings of the Fifth International Conference on Computer Vision
Estimating 3D Body Pose using Uncalibrated Cameras
Estimating 3D Body Pose using Uncalibrated Cameras
The Bayesian structural EM algorithm
UAI'98 Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence
Hi-index | 0.00 |
This paper introduces a novel probabilistic model for representing objects that change in appearance as a result of changes in pose, due to small deformations of their sub-parts and the relative spatial transformation of sub-parts of the object. We call the model a probabilistic montage. The model is based upon the idea that an image can be represented as a montage using many, small transformed and cropped patches from a collection of latent images. The approach is similar to that which might be employed by a police artist who might represent an image of a criminal suspect's face using a montage of face parts cut out of a "library" of face parts. In contrast, for our model, we learn the library of small latent images from a set of examples of objects that are changing in shape. In our approach, first the image is divided into a grid of sub-images. Each sub-image in the grid acts as window that crops a piece out of one of a collection of slightly larger images possible for that location in the image. We illustrate various probability models that can be used to encode the appropriate relationships for latent images and cropping transformations among the different patches. In this paper we present the complete algorithm for a tree-structured model. We show how the approach and model are able to find representations of the appearance of full body images of people in motion. We show how our approach can be used to learn representations of objects in an "unsupervised" manner and present results using our model for recognition and tracking purposes in a "supervised" manner.