Efficient histogram generation using scattering on GPUs
Proceedings of the 2007 symposium on Interactive 3D graphics and games
Real-time mesh simplification using the GPU
Proceedings of the 2007 symposium on Interactive 3D graphics and games
EXOCHI: architecture and programming environment for a heterogeneous multi-core multithreaded system
Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
Scan primitives for GPU computing
Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics hardware
Exploring weak scalability for FEM calculations on a GPU-enhanced cluster
Parallel Computing
Efficient computation of sum-products on GPUs through software-managed cache
Proceedings of the 22nd annual international conference on Supercomputing
Fast parallel GPU-sorting using a hybrid algorithm
Journal of Parallel and Distributed Computing
Using GPUs to improve multigrid solver performance on a cluster
International Journal of Computational Science and Engineering
A translation system for enabling data mining applications on GPUs
Proceedings of the 23rd international conference on Supercomputing
Concurrent number cruncher: a GPU implementation of a general sparse linear solver
International Journal of Parallel, Emergent and Distributed Systems
Real-time Visual Tracker by Stream Processing
Journal of Signal Processing Systems
Compiler support for general-purpose computation on GPUs
The Journal of Supercomputing
Massive threading: Using GPUs to increase the performance of digital forensics tools
Digital Investigation: The International Journal of Digital Forensics & Incident Response
A framework for GPU accelerated deformable object modeling
International Journal of High Performance Computing Applications
CUDASA: compute unified device and systems architecture
EG PGV'08 Proceedings of the 8th Eurographics conference on Parallel Graphics and Visualization
Early evaluation of directive-based GPU programming models for productive exascale computing
SC '12 Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis
Concurrent number cruncher: an efficient sparse linear solver on the GPU
HPCC'07 Proceedings of the Third international conference on High Performance Computing and Communications
Hi-index | 0.00 |
Existing GPU programming interfaces require applications to adopt a graphics-centric programming model exported by a device driver tuned for real-time graphics and games. This programming model, however, hinders the development and performance of non-graphics applications by imposing a graphics policy for program execution and hiding hardware resources. We present a new virtual machine abstraction for GPUs that provides policy-free, low-level access to the hardware and is designed for high-performance, data-parallel applications.