Recovering memory access patterns of executable programs

Authors:
Alain Ketterlin;Philippe Clauss
Affiliations:
-;-
Venue:
Science of Computer Programming
Year:
2014

Citing 22
Cited 0

Efficiently computing static single assignment form and the control dependence graph

ACM Transactions on Programming Languages and Systems (TOPLAS)
Beyond induction variables: detecting and classifying sequences using a demand-driven SSA form

ACM Transactions on Programming Languages and Systems (TOPLAS)
Nesting of reducible and irreducible loops

ACM Transactions on Programming Languages and Systems (TOPLAS)
Making graphs reducible with controlled node splitting

ACM Transactions on Programming Languages and Systems (TOPLAS)
Advanced compiler design and implementation

Advanced compiler design and implementation
Constraint-based array dependence analysis

ACM Transactions on Programming Languages and Systems (TOPLAS)
Aggregate structure identification and its application to program analysis

Proceedings of the 26th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
On loops, dominators, and dominance frontier

PLDI '00 Proceedings of the ACM SIGPLAN 2000 conference on Programming language design and implementation
A fast algorithm for finding dominators in a flowgraph

ACM Transactions on Programming Languages and Systems (TOPLAS)
Optimizing compilers for modern architectures: a dependence-based approach

Optimizing compilers for modern architectures: a dependence-based approach
Handling irreducible loops: optimized node splitting versus DJ-graphs

ACM Transactions on Programming Languages and Systems (TOPLAS)
High Performance Compilers for Parallel Computing

High Performance Compilers for Parallel Computing
Efficient program tracing

Computer
Intraprocedural Static Slicing of Binary Executables

ICSM '97 Proceedings of the International Conference on Software Maintenance
Data Dependence Analysis of Assembly Code

PACT '98 Proceedings of the 1998 International Conference on Parallel Architectures and Compilation Techniques
Modern Compiler Implementation in C

Modern Compiler Implementation in C
Pin: building customized program analysis tools with dynamic instrumentation

Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation
Identifying potential parallelism via loop-centric profiling

Proceedings of the 4th international conference on Computing frontiers
Shadow Profiling: Hiding Instrumentation Costs with Parallelism

Proceedings of the International Symposium on Code Generation and Optimization
A practical automatic polyhedral parallelizer and locality optimizer

Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation
Efficient memory tracing by program skeletonization

ISPASS '11 Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software
Polyhedral parallelization of binary code

ACM Transactions on Architecture and Code Optimization (TACO) - HIPEAC Papers

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper deals with the binary analysis of executable programs, with the goal of understanding how they access memory. It explains how to statically build a formal model of all memory accesses. Starting with a control flow graph of each procedure, well-known techniques are used to structure this graph into a hierarchy of loops in all cases. The paper shows that much more information can be extracted by performing a complete data-flow analysis over machine registers after the program has been put in static single assignment (SSA) form. By using the SSA form, registers used in addressing memory can be symbolically expressed in terms of other previously set registers. By including the loop structures in the analysis, loop indices and trip counts can also often be expressed symbolically. The whole process produces a formal model made of loops where memory accesses are linear expressions of loop counters and registers. The paper provides a quantitative evaluation of the results when applied to several dozens of SPEC benchmark programs. Because static analysis has no access to input data, the paper ends by describing a lightweight instrumentation strategy that collects at run time enough information to rebuild an exact trace. The section on applications also describes how the techniques developed in this paper can be used to perform automatic parallelization of binary code.