PixelFlow: high-speed rendering using image composition
SIGGRAPH '92 Proceedings of the 19th annual conference on Computer graphics and interactive techniques
Volume rendering on the MasPar MP-1
VVS '92 Proceedings of the 1992 workshop on Volume visualization
Volume rendering on scalable shared-memory MIMD architectures
VVS '92 Proceedings of the 1992 workshop on Volume visualization
Segmented ray casting for data parallel volume rendering
PRS '93 Proceedings of the 1993 symposium on Parallel rendering
A data distributed, parallel algorithm for ray-traced volume rendering
PRS '93 Proceedings of the 1993 symposium on Parallel rendering
Parallel volume rendering and data coherence
PRS '93 Proceedings of the 1993 symposium on Parallel rendering
Permutation warping for data parallel volume rendering
PRS '93 Proceedings of the 1993 symposium on Parallel rendering
An efficient parallel ray tracing scheme for distributed memory parallel computers
PRS '93 Proceedings of the 1993 symposium on Parallel rendering
Parallel volume-rendering algorithm performance on mesh-connected multicomputers
PRS '93 Proceedings of the 1993 symposium on Parallel rendering
Fast volume rendering using a shear-warp factorization of the viewing transformation
SIGGRAPH '94 Proceedings of the 21st annual conference on Computer graphics and interactive techniques
Fast volume rendering using an efficient, scalable parallel formulation of the shear-warp algorithm
PRS '95 Proceedings of the IEEE symposium on Parallel rendering
Real-time volume rendering on shared memory multiprocessors using the shear-warp factorization
PRS '95 Proceedings of the IEEE symposium on Parallel rendering
HWWS '97 Proceedings of the ACM SIGGRAPH/EUROGRAPHICS workshop on Graphics hardware
ParVox: a parallel splatting volume rendering system for distributed visualization
PRS '97 Proceedings of the IEEE symposium on Parallel rendering
Exploiting deep parallel memory hierarchies for ray casting volume rendering
PRS '97 Proceedings of the IEEE symposium on Parallel rendering
PRS '97 Proceedings of the IEEE symposium on Parallel rendering
PAVLOV: a programmable architecture for volume processing
HWWS '98 Proceedings of the ACM SIGGRAPH/EUROGRAPHICS workshop on Graphics hardware
Jim Blinn's corner: dirty pixels
Jim Blinn's corner: dirty pixels
The VolumePro real-time ray-casting system
Proceedings of the 26th annual conference on Computer graphics and interactive techniques
VVS '00 Proceedings of the 2000 IEEE symposium on Volume visualization
Design of an on-chip reflectance map
EGGH'95 Proceedings of the Tenth Eurographics conference on Graphics Hardware
Massively Parallel Solutions for Molecular Sequence Analysis
IPDPS '02 Proceedings of the 16th International Parallel and Distributed Processing Symposium
A Novel Predication Scheme for a SIMD System-on-Chip
Euro-Par '02 Proceedings of the 8th International Euro-Par Conference on Parallel Processing
Efficient mapping of hierarchical trees on coarse-grain reconfigurable architectures
Proceedings of the 2nd IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis
DATE '03 Proceedings of the conference on Design, Automation and Test in Europe - Volume 1
Low Power Data Prefetch for 3D Image Applications on Coarse-Grain Reconfigurable Architectures
IPDPS '05 Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 3 - Volume 04
A modular integer GCD algorithm
Journal of Algorithms
A modular integer GCD algorithm
Journal of Algorithms
Survey of parallel and distributed volume rendering: revisited
ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part III
Hi-index | 0.00 |
Volume rendering has great potential for parallelization due to the tremendous number of computations necessary. Besides the enormous computational power needed, the memory interface is usually of crucial importance and frequently the bottleneck.This paper presents an implementation of a parallel ray casting algorithm for orthogonal projections on a new single-chip SIMD architecture. Concurrent processing of rays is scheduled such that redundant memory accesses of the individual processing elements can be detected by the channel controller. Hence, data can be read efficiently in block-wise manner. For improved image quality, a permutation of the Shear-Warp algorithm with trilinear interpolation is used. The steps of the ray casting algorithm are carefully mapped onto the architecture avoiding expensive floating point operation, giving superior performance over previously reported results. A detailed analysis illustrates the timing of the individual computations and memory accesses, identifying the costliest parts of the implementation.