Field-sensitive program dependence analysis

Authors:
Shay Litvak;Nurit Dor;Rastislav Bodik;Noam Rinetzky;Mooly Sagiv
Affiliations:
Tel Aviv University & Panaya Inc., Tel-Aviv, Israel;Panaya Inc., Rannana, Israel;University of California, Berkeley, Berkeley, CA, USA;Queen Mary University of London, London, United Kingdom;Tel Aviv University, Tel Aviv, Israel
Venue:
Proceedings of the eighteenth ACM SIGSOFT international symposium on Foundations of software engineering
Year:
2010

Citing 27
Cited 2

Experiments on slicing-based debugging aids

Papers presented at the first workshop on empirical studies of programmers on Empirical studies of programmers
Integrating noninterfering versions of programs

ACM Transactions on Programming Languages and Systems (TOPLAS)
Dependence analysis for pointer variables

PLDI '89 Proceedings of the ACM SIGPLAN 1989 Conference on Programming language design and implementation
Interprocedural slicing using dependence graphs

ACM Transactions on Programming Languages and Systems (TOPLAS)
Identifying the semantic and textual differences between two versions of a program

PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
Using program slicing in software maintenance

Using program slicing in software maintenance
Using Program Slicing in Software Maintenance

IEEE Transactions on Software Engineering
Incremental program testing using program dependence graphs

POPL '93 Proceedings of the 20th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Automated support for legacy code understanding

Communications of the ACM
Speeding up slicing

SIGSOFT '94 Proceedings of the 2nd ACM SIGSOFT symposium on Foundations of software engineering
Precise interprocedural dataflow analysis via graph reachability

POPL '95 Proceedings of the 22nd ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Precise interprocedural dataflow analysis with applications to constant propagation

TAPSOFT '95 Selected papers from the 6th international joint conference on Theory and practice of software development
Advanced compiler design and implementation

Advanced compiler design and implementation
Aggregate structure identification and its application to program analysis

Proceedings of the 26th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Pointer analysis for programs with structures and casting

Proceedings of the ACM SIGPLAN 1999 conference on Programming language design and implementation
Pointer analysis: haven't we solved this problem yet?

PASTE '01 Proceedings of the 2001 ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools and engineering
ABAP objects: introduction to programming SAP applications

ABAP objects: introduction to programming SAP applications
The program dependence graph in a software development environment

SDE 1 Proceedings of the first ACM SIGSOFT/SIGPLAN software engineering symposium on Practical software development environments
Program slices: formal, psychological, and practical investigations of an automatic program abstraction method

Program slices: formal, psychological, and practical investigations of an automatic program abstraction method
Evaluating variations on program slicing for debugging (data-flow, ada)

Evaluating variations on program slicing for debugging (data-flow, ada)
Efficient field-sensitive pointer analysis for C

Proceedings of the 5th ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools and engineering
A framework for numeric analysis of array operations

Proceedings of the 32nd ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Compilers: Principles, Techniques, and Tools (2nd Edition)

Compilers: Principles, Techniques, and Tools (2nd Edition)
Thin slicing

Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
Customization change impact analysis for erp professionals via program slicing

ISSTA '08 Proceedings of the 2008 international symposium on Software testing and analysis
Enterprise Resource Planning

Enterprise Resource Planning
Fluid updates: beyond strong vs. weak updates

ESOP'10 Proceedings of the 19th European conference on Programming Languages and Systems

Fault localization for data-centric programs

Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering
A data dependence test based on the projection of paths over shape graphs

Journal of Parallel and Distributed Computing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Statement st transitively depends on statement stseed if the execution of stseed may affect the execution of st. Computing transitive program dependences is a fundamental operation in many automatic software analysis tools. Existing tools find it challenging to compute transitive dependences for programs manipulating large aggregate structure variables, and their limitations adversely affect analysis of certain important classes of software systems, e.g., large-scale enterprise resource planning (ERP) systems. This paper presents an efficient conservative interprocedural static analysis algorithm for computing field-sensitive transitive program dependences in the presence of large aggregate structure variables. Our key insight is that program dependences coming from operations on whole substructures can be precisely (i.e., field-sensitively) represented at the granularity of substructures instead of individual fields. Technically, we adapt the interval domain to concisely record dependences between multiple pairs of fields of aggregate structure variables by exploiting the fields' spatial arrangement. We prove that our algorithm is as precise as any algorithm which works at the granularity of individual fields, the most-precise known approach for this problem. Our empirical study, in which we analyzed industrial ERP programs with over 100,000 lines of code in average, shows significant improvements in both the running times and memory consumption over existing approaches: The baseline is an efficient field-insensitive whole-structure that incurs a 62% false error rate. An atomization-based algorithm, which disassemble every aggregate structure variable into the collection of its individual fields, can remove all these false errors at the cost of doubling the average analysis time, from 30 to 60 minutes. In contrast, our new precise algorithm removes all false errors by increasing the time only to 35 minutes. In terms of memory consumption, our algorithm increases the footprint by less than 10%, compared to 50% overhead of the atomizing algorithm.