Data locality enhancement by memory reduction
ICS '01 Proceedings of the 15th international conference on Supercomputing
Skeletons for parallel image processing: an overview of the SKIPPER project
Parallel Computing - Special issue: Advanced environments for parallel and distributed computing
Brook for GPUs: stream computing on graphics hardware
ACM SIGGRAPH 2004 Papers
Concurrency and Computation: Practice & Experience
Program optimization space pruning for a multithreaded gpu
Proceedings of the 6th annual IEEE/ACM international symposium on Code generation and optimization
Scalable Parallel Programming with CUDA
Queue - GPU Computing
A compiler framework for optimization of affine loop nests for gpgpus
Proceedings of the 22nd annual international conference on Supercomputing
Generating GPU code from a high-level representation for image processing kernels
Euro-Par'11 Proceedings of the 2011 international conference on Parallel Processing
Proceedings of the 34th ACM SIGPLAN conference on Programming language design and implementation
Hi-index | 0.00 |
SIMT (Single-Instruction Multiple-Thread) is an emerging programming paradigm for high-performance computational accelerators, pioneered in current and next generation GPUs and hybrid CPUs. We present a domain-specific active-library supported approach to SIMT code generation and optimisation in the field of visual effects. Our approach uses high-level metadata and runtime context to guide and to ensure the correctness of optimisation-driven code transformations and to implement runtime-context-sensitive optimisations. Our advanced optimisations require no analysis of the original C++ kernel code and deliver 1.3x to 6.6x speedups over syntax-directed translation on GeForce 8800 GTX and GTX 260 GPUs with two commercial visual effects.