Parallel volume rendering on a single-chip SIMD architecture

Authors:
M. Meißner;S. Grimm;W. Straßer;J. Packer;D. Latimer
Affiliations:
University of Tübingen, Auf der Morgenstelle 10/C9, D-72076 Tübingen, Germany;University of Tübingen, Auf der Morgenstelle 10/C9, D-72076 Tübingen, Germany;University of Tübingen, Auf der Morgenstelle 10/C9, D-72076 Tübingen, Germany;ClearSpeed Technology Ltd., Wallscourt Farm, Filton Road, Bristol, UK;ClearSpeed Technology Ltd., Wallscourt Farm, Filton Road, Bristol, UK
Venue:
PVG '01 Proceedings of the IEEE 2001 symposium on parallel and large-data visualization and graphics
Year:
2001

Citing 21
Cited 8

PixelFlow: high-speed rendering using image composition

SIGGRAPH '92 Proceedings of the 19th annual conference on Computer graphics and interactive techniques
Volume rendering on the MasPar MP-1

VVS '92 Proceedings of the 1992 workshop on Volume visualization
Volume rendering on scalable shared-memory MIMD architectures

VVS '92 Proceedings of the 1992 workshop on Volume visualization
Segmented ray casting for data parallel volume rendering

PRS '93 Proceedings of the 1993 symposium on Parallel rendering
A data distributed, parallel algorithm for ray-traced volume rendering

PRS '93 Proceedings of the 1993 symposium on Parallel rendering
Parallel volume rendering and data coherence

PRS '93 Proceedings of the 1993 symposium on Parallel rendering
Permutation warping for data parallel volume rendering

PRS '93 Proceedings of the 1993 symposium on Parallel rendering
An efficient parallel ray tracing scheme for distributed memory parallel computers

PRS '93 Proceedings of the 1993 symposium on Parallel rendering
Parallel volume-rendering algorithm performance on mesh-connected multicomputers

PRS '93 Proceedings of the 1993 symposium on Parallel rendering
Fast volume rendering using a shear-warp factorization of the viewing transformation

SIGGRAPH '94 Proceedings of the 21st annual conference on Computer graphics and interactive techniques
Fast volume rendering using an efficient, scalable parallel formulation of the shear-warp algorithm

PRS '95 Proceedings of the IEEE symposium on Parallel rendering
Real-time volume rendering on shared memory multiprocessors using the shear-warp factorization

PRS '95 Proceedings of the IEEE symposium on Parallel rendering
PixelFlow: the realization

HWWS '97 Proceedings of the ACM SIGGRAPH/EUROGRAPHICS workshop on Graphics hardware
ParVox: a parallel splatting volume rendering system for distributed visualization

PRS '97 Proceedings of the IEEE symposium on Parallel rendering
Exploiting deep parallel memory hierarchies for ray casting volume rendering

PRS '97 Proceedings of the IEEE symposium on Parallel rendering
Parallel processing of the shear-warp factorization with the binary-swap method on a distributed-memory multiprocessor system

PRS '97 Proceedings of the IEEE symposium on Parallel rendering
PAVLOV: a programmable architecture for volume processing

HWWS '98 Proceedings of the ACM SIGGRAPH/EUROGRAPHICS workshop on Graphics hardware
Jim Blinn's corner: dirty pixels

Jim Blinn's corner: dirty pixels
The VolumePro real-time ray-casting system

Proceedings of the 26th annual conference on Computer graphics and interactive techniques
The ULTRAVIS system

VVS '00 Proceedings of the 2000 IEEE symposium on Volume visualization
Design of an on-chip reflectance map

EGGH'95 Proceedings of the Tenth Eurographics conference on Graphics Hardware

Massively Parallel Solutions for Molecular Sequence Analysis

IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
A Novel Predication Scheme for a SIMD System-on-Chip

Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Efficient mapping of hierarchical trees on coarse-grain reconfigurable architectures

Proceedings of the 2nd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
Low Energy Data Management for Different On-Chip Memory Levels in Multi-Context Reconfigurable Architectures

DATE '03 Proceedings of the conference on Design, Automation and Test in Europe - Volume 1
Low Power Data Prefetch for 3D Image Applications on Coarse-Grain Reconfigurable Architectures

IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 3 - Volume 04
A modular integer GCD algorithm

Journal of Algorithms
A modular integer GCD algorithm

Journal of Algorithms
Survey of parallel and distributed volume rendering: revisited

ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part III

Quantified Score

Hi-index	0.00

Visualization

Abstract

Volume rendering has great potential for parallelization due to the tremendous number of computations necessary. Besides the enormous computational power needed, the memory interface is usually of crucial importance and frequently the bottleneck.This paper presents an implementation of a parallel ray casting algorithm for orthogonal projections on a new single-chip SIMD architecture. Concurrent processing of rays is scheduled such that redundant memory accesses of the individual processing elements can be detected by the channel controller. Hence, data can be read efficiently in block-wise manner. For improved image quality, a permutation of the Shear-Warp algorithm with trilinear interpolation is used. The steps of the ray casting algorithm are carefully mapped onto the architecture avoiding expensive floating point operation, giving superior performance over previously reported results. A detailed analysis illustrates the timing of the individual computations and memory accesses, identifying the costliest parts of the implementation.