Aggressive Value Prediction on a GPU

Authors:
Enqiang Sun;David Kaeli
Affiliations:
Department of Electrical and Computer Engineering, Northeastern University, Boston, USA;Department of Electrical and Computer Engineering, Northeastern University, Boston, USA
Venue:
International Journal of Parallel Programming
Year:
2014

Citing 16
Cited 0

Exceeding the dataflow limit via value prediction

Proceedings of the 29th annual ACM/IEEE international symposium on Microarchitecture
The performance potential of data dependence speculation & collapsing

Proceedings of the 29th annual ACM/IEEE international symposium on Microarchitecture
The predictability of data values

MICRO 30 Proceedings of the 30th annual ACM/IEEE international symposium on Microarchitecture
Data speculation support for a chip multiprocessor

Proceedings of the eighth international conference on Architectural support for programming languages and operating systems
Value prediction for speculative multithreaded architectures

Proceedings of the 32nd annual ACM/IEEE international symposium on Microarchitecture
Techniques for speculative run-time parallelization of loops

SC '98 Proceedings of the 1998 ACM/IEEE conference on Supercomputing
Toward efficient and robust software speculative parallelization on multiprocessors

Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
The Potential for Using Thread-Level Data Speculation to Facilitate Automatic Parallelization

HPCA '98 Proceedings of the 4th International Symposium on High-Performance Computer Architecture
In Search of Speculative Thread-Level Parallelism

PACT '99 Proceedings of the 1999 International Conference on Parallel Architectures and Compilation Techniques
The R-LRPD Test: Speculative Parallelization of Partially Parallel Loops

The R-LRPD Test: Speculative Parallelization of Partially Parallel Loops
SPEC CPU2006 benchmark tools

ACM SIGARCH Computer Architecture News
Parallelization spectroscopy: analysis of thread-level parallelism in hpc programs

Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming
Potential Impact of Value Prediction on Communication in Many-Core Architectures

IEEE Transactions on Computers
Multi GPU implementation of iterative tomographic reconstruction algorithms

ISBI'09 Proceedings of the Sixth IEEE international conference on Symposium on Biomedical Imaging: From Nano to Macro
Speculative Execution on GPU: An Exploratory Study

ICPP '10 Proceedings of the 2010 39th International Conference on Parallel Processing
Benchmarking modern multiprocessors

Benchmarking modern multiprocessors

Quantified Score

Hi-index	0.00

Visualization

Abstract

To obtain significant execution speedups, GPUs rely heavily on the inherent data-level parallelism present in the targeted application. However, application programs may not always be able to fully utilize these parallel computing resources due to intrinsic data dependencies or complex data pointer operations. In this paper, we explore how to leverage aggressive software-based value prediction techniques on a GPU to accelerate programs that lack inherent data parallelism. This class of applications are typically difficult to map to parallel architectures due to the presence of data dependencies and complex data pointer manipulation present in these applications. Our experimental results show that, despite the overhead incurred due to software speculation and the communication overhead between the CPU and GPU, we obtain up to 6.5$$\times $$ speedup on a selected set of kernels taken from the SPEC CPU2006, PARSEC and Sequoia benchmark suites.