Learning graphical models of images, videos and their spatial transformations

Authors:
Brendan J. Frey;Nebojsa Jojic
Affiliations:
Computer Science, University of Waterloo;Electrical and Computer Engineering, University of Illinois at Urbana
Venue:
UAI'00 Proceedings of the Sixteenth conference on Uncertainty in artificial intelligence
Year:
2000

Citing 5
Cited 1

GTM: the generative topographic mapping

Neural Computation
A Database for Handwritten Text Recognition Research

IEEE Transactions on Pattern Analysis and Machine Intelligence
Efficient Pattern Recognition Using a New Transformation Distance

Advances in Neural Information Processing Systems 5, [NIPS Conference]
Transformed Component Analysis: Joint Estimation of Spatial Transformations and Image Components

ICCV '99 Proceedings of the International Conference on Computer Vision-Volume 2 - Volume 2
Modeling the manifolds of images of handwritten digits

IEEE Transactions on Neural Networks

Intelligent multi-camera video surveillance: A review

Pattern Recognition Letters

Quantified Score

Hi-index	0.00

Visualization

Abstract

Mixtures of Gaussians, factor analyzers (probabilistic PCA) and hidden Markov models are staples of static and dynamic data modeling and image and video modeling in particular. We show how topographic transformations in the input, such as translation and shearing in images, can be accounted for in these models by including a discrete transformation variable. The resulting models perform clustering, dimensionality reduction and time-series analysis in a way that is invariant to transformations in the input. Using the EM algorithm, these transformation invariant models can be fit to static data and time series. We give results on filtering microscopy images, face and facial pose clustering, handwritten digit modeling and recognition, video clustering, object tracking, and removal of distractions from video sequences.