A refactoring tool to extract GPU kernels

Authors:
Kostadin Damevski;Madhan Muralimanohar
Affiliations:
Virginia State University, Petersburg, VA, USA;Virginia State University, Petersburg, VA, USA
Venue:
Proceedings of the 4th Workshop on Refactoring Tools
Year:
2011

Citing 5
Cited 2

SUIF Explorer: an interactive and interprocedural parallelizer

Proceedings of the seventh ACM SIGPLAN symposium on Principles and practice of parallel programming
Software Development Environments for Scientific and Engineering Software: A Series of Case Studies

ICSE '07 Proceedings of the 29th international conference on Software Engineering
Relooper: refactoring for loop parallelism in Java

Proceedings of the 24th ACM SIGPLAN conference companion on Object oriented programming systems languages and applications
Towards a framework for abstracting accelerators in parallel applications: experience with cell

Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis
Scientific Computing's Productivity Gridlock: How Software Engineering Can Help

IEEE Design & Test

CUDACL+: a framework for GPU programs

Proceedings of the ACM international conference companion on Object oriented programming systems languages and applications companion
Reconciling manual and automatic refactoring

Proceedings of the 34th International Conference on Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Significant performance gains can be achieved by using hardware architectures that integrate GPUs with conventional CPUs to form a hybrid and highly parallel computational engine. However, programming these novel architectures is tedious and error prone, reducing their ease of acceptance in an even wider range of computationally intensive applications. In this paper we discuss a refactoring technique, called Extract Kernel that transforms a loop written in C into a parallel function that uses NVIDIA's CUDA framework to execute on a GPU. The selected approach and the challenges encountered are described, as well as some early results that demonstrate the potential of this refactoring.