Efficient and accurate data dependence profiling using software signatures

Authors:
Rajeshwar Vanka;James Tuck
Affiliations:
North Carolina State University, Raleigh, NC;North Carolina State University, Raleigh, NC
Venue:
Proceedings of the Tenth International Symposium on Code Generation and Optimization
Year:
2012

Citing 15
Cited 1

Static and dynamic evaluation of data dependence analysis

ICS '93 Proceedings of the 7th international conference on Supercomputing
Context-sensitive interprocedural points-to analysis in the presence of function pointers

PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
Which pointer analysis should I use?

Proceedings of the 2000 ACM SIGSOFT international symposium on Software testing and analysis
A compiler framework for speculative analysis and optimizations

PLDI '03 Proceedings of the ACM SIGPLAN 2003 conference on Programming language design and implementation
Compiler support for speculative multithreading architecture with probabilistic points-to analysis

Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
A compiler framework for speculative optimizations

ACM Transactions on Architecture and Code Optimization (TACO)
A General Compiler Framework for Speculative Optimizations Using Data Speculative Code Motion

Proceedings of the international symposium on Code generation and optimization
Bulk Disambiguation of Speculative Threads in Multiprocessors

Proceedings of the 33rd annual international symposium on Computer Architecture
A probabilistic pointer analysis for speculative optimizations

Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
LogTM-SE: Decoupling Hardware Transactional Memory from Caches

HPCA '07 Proceedings of the 2007 IEEE 13th International Symposium on High Performance Computer Architecture
Embla - Data Dependence Profiling for Parallel Programming

CISIS '08 Proceedings of the 2008 International Conference on Complex, Intelligent and Software Intensive Systems
Compiler-Driven Dependence Profiling to Guide Program Parallelization

Languages and Compilers for Parallel Computing
BulkCompiler: high-performance sequential consistency through cooperative compiler and hardware support

Proceedings of the 42nd Annual IEEE/ACM International Symposium on Microarchitecture
SD3: A Scalable Approach to Dynamic Data-Dependence Profiling

MICRO '43 Proceedings of the 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture

Online dynamic dependence analysis for speculative polyhedral parallelization

Euro-Par'13 Proceedings of the 19th international conference on Parallel Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Speculative optimizations relax conservative constraints, like ambiguous memory-carried dependences that will rarely occur at runtime, to allow compilers to generate higher performing code. Data dependence profiling enables these techniques by providing runtime dependence information. However, prior data dependence profilers (DDPs) tend to be slow or have significant inaccuracy. Such techniques try to track all pairs of dependences that occur at runtime. To have high accuracy, it usually requires significant storage and performance overhead. Of course, accuracy can be sacrificed via sampling or fixed size storage to provide higher performance; but, these knobs are tough to control and often lead to a significant loss in accuracy. Hence, we search for a better trade-off between speed and accuracy, and we achieve this by considering a new approach based on efficient set operations using software signatures. Rather than tracking pair-wise dependences, we identify important dependence relationships at compile time, group said relationships into sets, and then construct and operate directly on the sets at runtime. By grouping many pair-wise relationships into a single set operation, we achieve greater accuracy with higher performance. Furthermore, we can adjust the size of signatures and the number of set operations through various optimization techniques to trade off speed and accuracy. Using a manual design space search, we evaluated many promising configurations. We found a fast profiler which matches the accuracy of prior works with a slowdown of 2.97x. Also, the most accurate and balanced profilers have slowdowns of 18x and 8x respectively, with an accuracy of 0.15 and 0.17 Normalized Average Euclidean Distance, respectively. To the best of our knowledge, the set-based profiler is the most accurate one to date, with equal or better performance than prior works.