Parallel processing on NVIDIA graphics processing units using CUDA

Authors:
Erik Wynters
Affiliations:
Bloomsburg University of Pennsylvania, Bloomsburg, PA
Venue:
Journal of Computing Sciences in Colleges
Year:
2011

Citing 4
Cited 0

An updated set of basic linear algebra subprograms (BLAS)

ACM Transactions on Mathematical Software (TOMS)
GPU Gems: Programming Techniques, Tips and Tricks for Real-Time Graphics

GPU Gems: Programming Techniques, Tips and Tricks for Real-Time Graphics
Scalable Parallel Programming with CUDA

Queue - GPU Computing
Accelerating large graph algorithms on the GPU using CUDA

HiPC'07 Proceedings of the 14th international conference on High performance computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper is an introduction to general-purpose computing on graphics processing units. This involves taking advantage of the parallel processing power of modern graphics cards to do general purpose computation. The CUDA architecture used for general purpose computations on NVIDIA graphics cards is described, and important features affecting the run times of CUDA programs are discussed. Experimental results showing the potential for obtaining speedups by two or three orders of magnitude will be presented, showing that CUDA is a cost-effective way to make high-performance computing widely available to programmers and consumers. In colleges, graphics cards can be used to make hands-on experience in massively parallel processing an easily obtained component of courses or research programs.