Efficient hardware implementation of Ray Tracing based on an embedded software for intersection computation

Authors:
Alexandre S. Nery;Nadia Nedjah;Felipe M. G. FrançA
Affiliations:
LAM - Computer Architecture and Microelectronics Laboratory Systems Engineering and Computer Science Program, COPPE Universidade Federal do Rio de Janeiro, Brazil;Department of Electronics Engineering and Telecommunications, Faculty of Engineering Universidade do Estado do Rio de Janeiro, Brazil;LAM - Computer Architecture and Microelectronics Laboratory Systems Engineering and Computer Science Program, COPPE Universidade Federal do Rio de Janeiro, Brazil
Venue:
Journal of Systems Architecture: the EUROMICRO Journal
Year:
2013

Citing 22
Cited 1

An introduction to ray tracing

An introduction to ray tracing
ARTS: accelerated ray-tracing system

Tutorial: computer graphics; image synthesis
An improved illumination model for shaded display

Communications of the ACM
Illumination for computer generated pictures

Communications of the ACM
The ray engine

Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware
On the efficiency of ray-shooting acceleration schemes

SCCG '02 Proceedings of the 18th spring conference on Computer graphics
Realtime ray tracing of dynamic scenes on an FPGA chip

Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware
Reconfigurable Designs for Ray Tracing

FCCM '01 Proceedings of the the 9th Annual IEEE Symposium on Field-Programmable Custom Computing Machines
RPU: a programmable ray processing unit for realtime ray tracing

ACM SIGGRAPH 2005 Papers
Ray tracing animated scenes using coherent grid traversal

ACM SIGGRAPH 2006 Papers
Exploring the use of ray tracing for future games

Proceedings of the 2006 ACM SIGGRAPH symposium on Videogames
Custom hardware support for realtime ray tracing

SIGGRAPH '05 ACM SIGGRAPH 2005 Courses
Ray tracing on programmable graphics hardware

SIGGRAPH '05 ACM SIGGRAPH 2005 Courses
Ray Tracing from the Ground Up

Ray Tracing from the Ground Up
Using FPGAs to Supplement Ray-Tracing Computations on the Cray XD-1

HPCMP-UGC '07 Proceedings of the 2007 DoD High Performance Computing Modernization Program Users Group Conference
Toward a multicore architecture for real-time ray-tracing

Proceedings of the 41st annual IEEE/ACM International Symposium on Microarchitecture
A parallel algorithm for construction of uniform grids

Proceedings of the Conference on High Performance Graphics 2009
Ray tracing on a cell cluster for virtual environments

Proceedings of the 2009 Computer Graphics International Conference
GridRT: A Massively Parallel Architecture for Ray-Tracing Using Uniform Grids

DSD '09 Proceedings of the 2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools
A massively parallel hardware architecture for ray-tracing

International Journal of High Performance Systems Architecture
Programming Massively Parallel Processors: A Hands-on Approach

Programming Massively Parallel Processors: A Hands-on Approach
Interactive distributed ray tracing of highly complex models

EGWR'01 Proceedings of the 12th Eurographics conference on Rendering

Parallel processing of intersections for ray-tracing in application-specific processors and GPGPUs

Microprocessors & Microsystems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Parallel implementations of Ray Tracing have been enabling real time performance, as the algorithm is embarrassingly parallel. However, in order to achieve both interactivity and real time performance, the algorithm should run at a high frame rates, i.e. at least 60 frames per second. Thus, a custom parallel design in hardware is likely to achieve high rendering performance. In this paper, we improve the GridRT architecture presented in previous work. GridRT is capable of dealing with the main desirable features of Ray Tracing, such as shadows and reflection effects, imposing low area cost and a promising rendering performance. As to this work, an application-specific instruction has been added and the underlaying computation embedded into the processor's microprogram in order to calculate the ray-triangle intersection computations. These computations are performed in pipeline, whenever possible, yielding to a considerable reduction in terms of cycles per intersection test. The presented architecture is based on the uniform grid acceleration structure. It allows for a massive twofold parallelism: parallel ray-triangle intersection tests as well as parallel processing of many rays. A hardware implementation of the improved architecture is presented, together with the corresponding performance results and resources requirements. The rendering time is reduced by 80% using a grid configuration of eight processing elements and each intersection computation time is reduced by 50% with respect to the original GridRT implementation.