Lie Group Transformations of Objects in Video Images

Authors:
Mike Alder
Affiliations:
School of Mathematics and Statistics, The University of Western Australia, Crawley, Australia 6009
Venue:
Journal of Mathematical Imaging and Vision
Year:
2006

Citing 2
Cited 0

A hierarchical approach to line extraction based on the Hough transform

Computer Vision, Graphics, and Image Processing
Statistical trajectory models for phonetic recognition

Statistical trajectory models for phonetic recognition

Quantified Score

Hi-index	0.00

Visualization

Abstract

Suppose we take a set of images obtained from videophotography of a moving object in three dimensions, such as an aeroplane, and that we compute moments or fourier descriptors, or some other set of smooth features of the resulting image, to get a vector in a feature space F n which describes the image. Then the different orientations and positions of the object in space are generated by a local Lie group, a neighbourhood of the identity in the group SE(3), and provided we compute enough moments or other descriptors, i.e. provided n is big enough, and provided the object is not symmetric, the result is to give a smooth injection of this group of transformations into the feature space. The value of n which is `big enough' is almost always 2d + 1, where d is the dimension of the group, which for rigid objects under translation and rotation is six. This result has applications to object recognition in video images and to the problem of interpolating between different views of an object, as naive interpolations in the Feature space give erroneous results. It extends to moving objects with more degrees of freedom such as robots. In this paper we state and prove the above result formally and illustrate it with synthetic images of a `robot'.