Analysis of a Parallel Volume Rendering System Based on the Shear-Warp Factorization

Authors:
Philippe Lacroute
Affiliations:
-
Venue:
IEEE Transactions on Visualization and Computer Graphics
Year:
1996

Citing 31
Cited 20

A dynamic screen technique for shaded graphics display of slice-represented objects

Computer Vision, Graphics, and Image Processing
Computer architecture: a quantitative approach

Computer architecture: a quantitative approach
Efficient ray tracing of volume data

ACM Transactions on Graphics (TOG)
Building block shaders

SIGGRAPH '90 Proceedings of the 17th annual conference on Computer graphics and interactive techniques
Rendering volumetric data using STICKS representation scheme

VVS '90 Proceedings of the 1990 workshop on Volume visualization
Hierarchical splatting: a progressive refinement algorithm for volume rendering

Proceedings of the 18th annual conference on Computer graphics and interactive techniques
MemSpy: analyzing memory system bottlenecks in programs

SIGMETRICS '92/PERFORMANCE '92 Proceedings of the 1992 ACM SIGMETRICS joint international conference on Measurement and modeling of computer systems
Volume rendering on the MasPar MP-1

VVS '92 Proceedings of the 1992 workshop on Volume visualization
Parallel volume visualization on a hypercube architecture

VVS '92 Proceedings of the 1992 workshop on Volume visualization
Volume rendering on scalable shared-memory MIMD architectures

VVS '92 Proceedings of the 1992 workshop on Volume visualization
Data parallel volume rendering as line drawing

VVS '92 Proceedings of the 1992 workshop on Volume visualization
Segmented ray casting for data parallel volume rendering

PRS '93 Proceedings of the 1993 symposium on Parallel rendering
A data distributed, parallel algorithm for ray-traced volume rendering

PRS '93 Proceedings of the 1993 symposium on Parallel rendering
Parallel volume rendering and data coherence

PRS '93 Proceedings of the 1993 symposium on Parallel rendering
Permutation warping for data parallel volume rendering

PRS '93 Proceedings of the 1993 symposium on Parallel rendering
Parallel volume-rendering algorithm performance on mesh-connected multicomputers

PRS '93 Proceedings of the 1993 symposium on Parallel rendering
Parallel Visualization Algorithms: Performance and Architectural Implications

Computer
Fast volume rendering using a shear-warp factorization of the viewing transformation

SIGGRAPH '94 Proceedings of the 21st annual conference on Computer graphics and interactive techniques
A compact volume rendering accelerator

VVS '94 Proceedings of the 1994 symposium on Volume visualization
Cube-3: a real-time architecture for high-resolution volume visualization

VVS '94 Proceedings of the 1994 symposium on Volume visualization
Accelerated volume rendering and tomographic reconstruction using texture mapping hardware

VVS '94 Proceedings of the 1994 symposium on Volume visualization
Fast volume rendering using an efficient, scalable parallel formulation of the shear-warp algorithm

PRS '95 Proceedings of the IEEE symposium on Parallel rendering
Real-time volume rendering on shared memory multiprocessors using the shear-warp factorization

PRS '95 Proceedings of the IEEE symposium on Parallel rendering
Fast volume rendering using a shear-warp factorization of the viewing transformation

Fast volume rendering using a shear-warp factorization of the viewing transformation
Volume rendering

SIGGRAPH '88 Proceedings of the 15th annual conference on Computer graphics and interactive techniques
Portable Programs for Parallel Processors

Portable Programs for Parallel Processors
Direct Visualization of Volume Data

IEEE Computer Graphics and Applications
The DASH Prototype: Logic Overhead and Performance

IEEE Transactions on Parallel and Distributed Systems
Shade trees

SIGGRAPH '84 Proceedings of the 11th annual conference on Computer graphics and interactive techniques
Compositing digital images

SIGGRAPH '84 Proceedings of the 11th annual conference on Computer graphics and interactive techniques
PARALLEL VOLUME RENDERING ON A SHARED-MEMORY MULTIPROCESSOR

PARALLEL VOLUME RENDERING ON A SHARED-MEMORY MULTIPROCESSOR

ParVox: a parallel splatting volume rendering system for distributed visualization

PRS '97 Proceedings of the IEEE symposium on Parallel rendering
Parallel processing of the shear-warp factorization with the binary-swap method on a distributed-memory multiprocessor system

PRS '97 Proceedings of the IEEE symposium on Parallel rendering
PAVLOV: a programmable architecture for volume processing

HWWS '98 Proceedings of the ACM SIGGRAPH/EUROGRAPHICS workshop on Graphics hardware
High-quality volume rendering using texture mapping hardware

HWWS '98 Proceedings of the ACM SIGGRAPH/EUROGRAPHICS workshop on Graphics hardware
The VolumePro real-time ray-casting system

Proceedings of the 26th annual conference on Computer graphics and interactive techniques
A Parallel Algorithm for Volume Projections on SIMD Mesh-Connected Computers

The Journal of Supercomputing
Efficient Compositing Methods for the Sort-Last-Sparse Parallel Volume Rendering System on Distributed Memory Multicomputers

The Journal of Supercomputing
A unified infrastructure for parallel out-of-core isosurface extraction and volume rendering of unstructured grids

PVG '01 Proceedings of the IEEE 2001 symposium on parallel and large-data visualization and graphics
Parallel Shear-Warp Factorization Volume Rendering Using Efficient 1-D and 2-D Partitioning Schemes for Distributed Memory Multicomputers

The Journal of Supercomputing
Ray Casting Architectures for Volume Visualization

IEEE Transactions on Visualization and Computer Graphics
Visualizing Time-Varying Volume Data

Computing in Science and Engineering
A Rotate-Tiling Image Composition Method for Parallel Volume Rendering on Distributed Memory Multicomputers

IPDPS '01 Proceedings of the 15th International Parallel & Distributed Processing Symposium
Design of a Parallel Accelerator for Volume Rendering

Euro-Par '00 Proceedings from the 6th International Euro-Par Conference on Parallel Processing
Scientific Visualization on Sparse Grids

Dagstuhl '97, Scientific Visualization
Differential coding scheme for efficient parallel image composition on a PC cluster system

Parallel Computing
Application of virtual reality in volumetric cellular visualization

VRCAI '04 Proceedings of the 2004 ACM SIGGRAPH international conference on Virtual Reality continuum and its applications in industry
TRLE--an efficient data compression scheme for image composition of volume rendering on distributed memory multicomputers

The Journal of Supercomputing
Survey of parallel and distributed volume rendering: revisited

ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part III
Parallelizing the ZSWEEP algorithm for distributed-shared memory architectures

VG'01 Proceedings of the 2001 Eurographics conference on Volume Graphics
Real-time volume rendering for virtual colonoscopy

VG'01 Proceedings of the 2001 Eurographics conference on Volume Graphics

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a parallel volume rendering algorithm that can render a 256 脳 256 脳 225 voxel medical data set at over 15 Hz and a 512 脳 512 脳 334 voxel data set at over 7 Hz on a 32-processor Silicon Graphics Challenge. The algorithm achieves these results by minimizing each of the three components of execution time: computation time, synchronization time, and data communication time. Computation time is low because the parallel algorithm is based on the recently-reported shear-warp serial volume rendering algorithm which is over five times faster than previous serial algorithms. The algorithm uses run-length encoding to exploit coherence and an efficient volume traversal to reduce overhead. Synchronization time is minimized by using dynamic load balancing and a task partition that minimizes synchronization events. Data communication costs are low because the algorithm is implemented for shared-memory multiprocessors, a class of machines with hardware support for low-latency fine-grain communication and hardware caching to hide latency.We draw two conclusions from our implementation. First, we find that on shared-memory architectures data redistribution and communication costs do not dominate rendering time. Second, we find that cache locality requirements impose a limit on parallelism in volume rendering algorithms. Specifically, our results indicate that shared-memory machines with hundreds of processors would be useful only for rendering very large data sets.