BSGP: bulk-synchronous GPU programming

Authors:
Qiming Hou;Kun Zhou;Baining Guo
Affiliations:
Tsinghua University;Microsoft Research Asia;Tsinghua University and Microsoft Research Asia
Venue:
ACM SIGGRAPH 2008 papers
Year:
2008

Citing 21
Cited 19

A bridging model for parallel computation

Communications of the ACM
Constant propagation with conditional branches

ACM Transactions on Programming Languages and Systems (TOPLAS)
Efficiently computing static single assignment form and the control dependence graph

ACM Transactions on Programming Languages and Systems (TOPLAS)
CHARM++: a portable concurrent object oriented system based on C++

OOPSLA '93 Proceedings of the eighth annual conference on Object-oriented programming systems, languages, and applications
A shading language on graphics hardware: the pixelflow shading system

Proceedings of the 25th annual conference on Computer graphics and interactive techniques
BSPlib: The BSP programming library

Parallel Computing
Interactive multi-pass programmable shading

Proceedings of the 27th annual conference on Computer graphics and interactive techniques
A real-time procedural shading system for programmable graphics hardware

Proceedings of the 28th annual conference on Computer graphics and interactive techniques
Watertight tessellation using forward differencing

Proceedings of the ACM SIGGRAPH/EUROGRAPHICS workshop on Graphics hardware
Shader metaprogramming

Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware
Parallelism in random access machines

STOC '78 Proceedings of the tenth annual ACM symposium on Theory of computing
Cg: a system for programming graphics hardware in a C-like language

ACM SIGGRAPH 2003 Papers
Flux: lightweight, standards-based Web graphics in XML

ACM SIGGRAPH 2003 Web Graphics
Brook for GPUs: stream computing on graphics hardware

ACM SIGGRAPH 2004 Papers
Shader algebra

ACM SIGGRAPH 2004 Papers
Metaprogramming GPUs with Sh

Metaprogramming GPUs with Sh
KD-tree acceleration structures for a GPU raytracer

Proceedings of the ACM SIGGRAPH/EUROGRAPHICS conference on Graphics hardware
Accelerator: using data parallelism to program GPUs for general-purpose uses

Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
Interactive k-d tree GPU raytracing

Proceedings of the 2007 symposium on Interactive 3D graphics and games
Scan primitives for GPU computing

Proceedings of the 22nd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics hardware
GPU-ABiSort: optimal parallel sorting on stream architectures

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing

An efficient GPU-based approach for interactive global illumination

ACM SIGGRAPH 2009 papers
Fast motion deblurring

ACM SIGGRAPH Asia 2009 papers
Ray casting of multiple volumetric datasets with polyhedral boundaries on manycore GPUs

ACM SIGGRAPH Asia 2009 papers
Debugging GPU stream programs through automatic dataflow recording and visualization

ACM SIGGRAPH Asia 2009 papers
RenderAnts: interactive Reyes rendering on GPUs

ACM SIGGRAPH Asia 2009 papers
The virtual marathon: parallel computing supports crowd simulations

IEEE Computer Graphics and Applications - Special issue on non-photorealistic rendering a virtual environment for teaching social skills
Micropolygon ray tracing with defocus and motion blur

ACM SIGGRAPH 2010 papers
memCUDA: map device memory to host memory on GPGPU platform

NPC'10 Proceedings of the 2010 IFIP international conference on Network and parallel computing
A bridging model for multi-core computing

Journal of Computer and System Sciences
Algorithm engineering: bridging the gap between algorithm theory and practice

Algorithm engineering: bridging the gap between algorithm theory and practice
FPGA vs. multi-core CPUs vs. GPUs: hands-on experience with a sorting application

Facing the multicore-challenge
FPGA vs. multi-core CPUs vs. GPUs: hands-on experience with a sorting application

Facing the multicore-challenge
PyCUDA and PyOpenCL: A scripting-based approach to GPU run-time code generation

Parallel Computing
An object-oriented bulk synchronous parallel library for multicore programming

Concurrency and Computation: Practice & Experience
Softshell: dynamic scheduling on GPUs

ACM Transactions on Graphics (TOG) - Proceedings of ACM SIGGRAPH Asia 2012
Data-Parallel Decompression of Triangle Mesh Topology

Computer Graphics Forum
Scalable Programmable Motion Effects on GPUs

Computer Graphics Forum
From physics model to results: an optimizing framework for cross-architecture code generation

Proceedings of the Extreme Scaling Workshop
From physics model to results: An optimizing framework for cross-architecture code generation

Scientific Programming

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present BSGP, a new programming language for general purpose computation on the GPU. A BSGP program looks much the same as a sequential C program. Programmers only need to supply a bare minimum of extra information to describe parallel processing on GPUs. As a result, BSGP programs are easy to read, write, and maintain. Moreover, the ease of programming does not come at the cost of performance. A well-designed BSGP compiler converts BSGP programs to kernels and combines them using optimally allocated temporary streams. In our benchmark, BSGP programs achieve similar or better performance than well-optimized CUDA programs, while the source code complexity and programming time are significantly reduced. To test BSGP's code efficiency and ease of programming, we implemented a variety of GPU applications, including a highly sophisticated X3D parser that would be extremely difficult to develop with existing GPU programming languages.