When Pyramids Learned Walking

Authors:
Walter G. Kropatsch
Affiliations:
PRIP, Vienna University of Technology, Austria
Venue:
CIARP '09 Proceedings of the 14th Iberoamerican Conference on Pattern Recognition: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Year:
2009

Citing 30
Cited 0

Paradigms for pyramid machine algorithms

Pyramidal systems for computer vision
A critical view of pyramid segmentation algorithms

Pattern Recognition Letters
On the relative complexity of active vs. passive visual search

International Journal of Computer Vision
The adaptive pyramid: a framework for 2D image analysis

CVGIP: Image Understanding
A State-Based Approach to the Representation and Recognition of Gesture

IEEE Transactions on Pattern Analysis and Machine Intelligence
Real-Time American Sign Language Recognition Using Desk and Wearable Computer Based Video

IEEE Transactions on Pattern Analysis and Machine Intelligence
Shape from Shading: A Survey

IEEE Transactions on Pattern Analysis and Machine Intelligence
A Bayesian Computer Vision System for Modeling Human Interactions

IEEE Transactions on Pattern Analysis and Machine Intelligence
Algorithms for Defining Visual Regions-of-Interest: Comparison with Eye Fixations

IEEE Transactions on Pattern Analysis and Machine Intelligence
Resolving Motion Correspondence for Densely Moving Points

IEEE Transactions on Pattern Analysis and Machine Intelligence
A survey of computer vision-based human motion capture

Computer Vision and Image Understanding - Modeling people toward vision-based underatanding of a person's shape, appearance, and movement
A Pyramid Framework for Early Vision: Multiresolutional Computer Vision

A Pyramid Framework for Early Vision: Multiresolutional Computer Vision
Structured Computer Vision; Machine Perception through Hierarchical Computation Structures

Structured Computer Vision; Machine Perception through Hierarchical Computation Structures
Kernel-Based Object Tracking

IEEE Transactions on Pattern Analysis and Machine Intelligence
Temporal Classification of Natural Gesture and Application to Video Coding

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Normalized Cuts and Image Segmentation

CVPR '97 Proceedings of the 1997 Conference on Computer Vision and Pattern Recognition (CVPR '97)
Dynamic Models of Human Motion

FG '98 Proceedings of the 3rd. International Conference on Face & Gesture Recognition
A Non-Iterative Greedy Algorithm for Multi-frame Point Correspondence

ICCV '03 Proceedings of the Ninth IEEE International Conference on Computer Vision - Volume 2
Distinctive Image Features from Scale-Invariant Keypoints

International Journal of Computer Vision
Probabilistic Object Tracking Using Multiple Features

ICPR '04 Proceedings of the Pattern Recognition, 17th International Conference on (ICPR'04) Volume 2 - Volume 02
Pictorial Structures for Object Recognition

International Journal of Computer Vision
Contour-Based Object Tracking with Occlusion Handling in Video Acquired Using Mobile Cameras

IEEE Transactions on Pattern Analysis and Machine Intelligence
Vision pyramids that do not grow too high

Pattern Recognition Letters - Special issue: In memoriam Azriel Rosenfeld
Generic Model Abstraction from Examples

IEEE Transactions on Pattern Analysis and Machine Intelligence
Object tracking: A survey

ACM Computing Surveys (CSUR)
Fast stochastic optimization for articulated structure tracking

Image and Vision Computing
A survey of advances in vision-based human motion capture and analysis

Computer Vision and Image Understanding - Special issue on modeling people: Vision-based understanding of a person's shape, appearance, movement, and behaviour
Approximative graph pyramid solution of the E-TSP

Image and Vision Computing
Rigid Part Decomposition in a Graph Pyramid

CIARP '09 Proceedings of the 14th Iberoamerican Conference on Pattern Recognition: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
A survey on visual surveillance of object motion and behaviors

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews

Quantified Score

Hi-index	0.00

Visualization

Abstract

A temporal image sequence increases the dimension of the data by simply stacking images above each other. This further raises the computational complexity of the processes. The typical content of a pixel or a voxel is its grey or color value. With some processing, features and fitted model parameters are added. In a pyramid these values are repeatedly summarized in the stack of images or image descriptions with a constant factor of reduction. From this derives their efficiency of allowing $\log(\mbox{diameter})$ complexity for global information transmission. Content propagates bottom-up by reduction functions like inheritance or filters. Content propagates top-down by expansion functions like interpolation or projection. Moving objects occlude different parts of the image background. Computing one pyramid per frame needs lots of bottom-up computation and very complex and time consuming updating. In the new concept we propose one pyramid per object and one pyramid for the background. The connection between both is established by coordinates that are coded in the pyramidal cells much like in a Laplacian pyramid or a wavelet. We envision that this code will be stored in each cell and will be invariant to the basic movements of the object. All the information about position and orientation of the object is concentrated in the apex. New positions are calculated for the apex and can be accurately reconstructed for every cell in a top-down process. At the new pixel locations the expected content can be verified by comparing it with the actual image frame.