Predicting reflectance functions from complex surfaces
SIGGRAPH '92 Proceedings of the 19th annual conference on Computer graphics and interactive techniques
Interactive Distributed Ray Tracing of Highly Complex Models
Proceedings of the 12th Eurographics Workshop on Rendering Techniques
High quality rendering using ray tracing and photon mapping
ACM SIGGRAPH 2007 courses
Real-time KD-tree construction on graphics hardware
ACM SIGGRAPH Asia 2008 papers
Understanding the efficiency of ray traversal on GPUs
Proceedings of the Conference on High Performance Graphics 2009
Stream compaction for deferred shading
Proceedings of the Conference on High Performance Graphics 2009
Accelerating large graph algorithms on the GPU using CUDA
HiPC'07 Proceedings of the 14th international conference on High performance computing
Cache-oblivious ray reordering
ACM Transactions on Graphics (TOG)
Improving SIMD efficiency for parallel Monte Carlo light transport on the GPU
Proceedings of the ACM SIGGRAPH Symposium on High Performance Graphics
Active thread compaction for GPU path tracing
Proceedings of the ACM SIGGRAPH Symposium on High Performance Graphics
Fast, effective BVH updates for animated scenes
I3D '12 Proceedings of the ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games
Megakernels considered harmful: wavefront path tracing on GPUs
Proceedings of the 5th High-Performance Graphics Conference
Hi-index | 0.00 |
As one of the essential global illumination algorithms, Monte Carlo path tracing has long been considered as a typical irregular problem that is less friendly for graphics hardware. To improve the efficiency of Monte Carlo path tracing, such techniques as ray reordering, ray compaction and wavefront formulation have been proposed to exploit the inherent coherence in processing different paths and materials for better SIMD efficiency on GPUs. In this paper, we develop a novel technique to extract extra parallelism in Monte Carlo path tracing applications by identifying hidden coherence. The basic idea is to perform a partial traversal in the fast on-chip memory of GPU and then identify coherent paths by analyzing the traversal results as well as other features of rays. Such coherence enables a higher level of parallelism that not only compensates the overhead of traversal, but also leads to improved performance. Experiments prove that our techniques deliver a traversal throughput higher than leading-edge results by up to 15%.