Commutativity analysis: a new analysis technique for parallelizing compilers

Authors:
Martin C. Rinard;Pedro C. Diniz
Affiliations:
Massachusetts Institute of Technology, Cambridge;Univ. of Southern California, Marina del Rey
Venue:
ACM Transactions on Programming Languages and Systems (TOPLAS)
Year:
1997

Citing 45
Cited 41

Object-oriented concurrent programming ABCL/1

OOPLSA '86 Conference proceedings on Object-oriented programming systems, languages and applications
Guided self-scheduling: A practical scheduling scheme for parallel supercomputers

IEEE Transactions on Computers
Portable programs for parallel processors

Portable programs for parallel processors
Detecting conflicts between structure accesses

PLDI '88 Proceedings of the ACM SIGPLAN 1988 conference on Programming Language design and Implementation
Commutativity-Based Concurrency Control for Abstract Data Types

IEEE Transactions on Computers
Lazy task creation: a technique for increasing the granularity of parallel programs

LFP '90 Proceedings of the 1990 ACM conference on LISP and functional programming
Analysis of pointers and structures

PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
Making asynchronous parallelism safe for the world

POPL '90 Proceedings of the 17th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Program optimization and parallelization using idioms

POPL '91 Proceedings of the 18th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Coarse-grain parallel programming in Jade

PPOPP '91 Proceedings of the third ACM SIGPLAN symposium on Principles and practice of parallel programming
SPLASH: Stanford parallel applications for shared-memory

ACM SIGARCH Computer Architecture News
Compiling Fortran D for MIMD distributed-memory machines

Communications of the ACM
The design and analysis of DASH: a scalable directory-based multiprocessor

The design and analysis of DASH: a scalable directory-based multiprocessor
Eliminating false data dependences using the Omega test

PLDI '92 Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
Abstractions for recursive pointer data structures: improving the analysis and transformation of imperative programs

PLDI '92 Proceedings of the ACM SIGPLAN 1992 conference on Programming language design and implementation
Parallel hierarchical N-body methods

Parallel hierarchical N-body methods
Interprocedural modification side effect analysis with pointer aliasing

PLDI '93 Proceedings of the ACM SIGPLAN 1993 conference on Programming language design and implementation
Data locality and load balancing in COOL

PPOPP '93 Proceedings of the fourth ACM SIGPLAN symposium on Principles and practice of parallel programming
Parallel hierarchical N-body methods and their implications for multiprocessors

Parallel hierarchical N-body methods and their implications for multiprocessors
Parallelizing complex scans and reductions

PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
A general data dependence test for dynamic, pointer-based data structures

PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
Context-sensitive interprocedural points-to analysis in the presence of function pointers

PLDI '94 Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation
Aslantest: a symbolic execution tool for testing Aslan formal specifications

ISSTA '94 Proceedings of the 1994 ACM SIGSOFT international symposium on Software testing and analysis
Efficient context-sensitive pointer analysis for C programs

PLDI '95 Proceedings of the ACM SIGPLAN 1995 conference on Programming language design and implementation
Software caching and computation migration in Olden

PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Flattening and parallelizing irregular, recurrent loop nests

PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
Compiler optimizations for eliminating barrier synchronization

PPOPP '95 Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming
The design, implementation and evaluation of Jade: a portable, implicitly parallel programming language

The design, implementation and evaluation of Jade: a portable, implicitly parallel programming language
The SPLASH-2 programs: characterization and methodological considerations

ISCA '95 Proceedings of the 22nd annual international symposium on Computer architecture
Detecting coarse-grain parallelism using an interprocedural parallelizing compiler

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Communication optimizations for parallel computing using data access information

Supercomputing '95 Proceedings of the 1995 ACM/IEEE conference on Supercomputing
Experience with processes and monitors in Mesa

Communications of the ACM
Symbolic execution and program testing

Communications of the ACM
Dependence Analysis for Supercomputing

Dependence Analysis for Supercomputing
Performance Analysis of Parallelizing Compilers on the Perfect Benchmarks Programs

IEEE Transactions on Parallel and Distributed Systems
Symbolic range propagation

IPPS '95 Proceedings of the 9th International Symposium on Parallel Processing
Experience in the Automatic Parallelization of Four Perfect-Benchmark Programs

Proceedings of the Fourth International Workshop on Languages and Compilers for Parallel Computing
Recognizing and Parallelizing Bounded Recurrences

Proceedings of the Fourth International Workshop on Languages and Compilers for Parallel Computing
Analysis of Dynamic Structures for Efficient Parallel Execution

Proceedings of the 6th International Workshop on Languages and Compilers for Parallel Computing
On the Complexity of Commutativity Analysis

COCOON '96 Proceedings of the Second Annual International Conference on Computing and Combinatorics
Commutativity Analysis: A Technique for Automatically Parallelizing Pointer-Based Computations

IPPS '96 Proceedings of the 10th International Parallel Processing Symposium
Lock Coarsening: Eliminating Lock Overhead in Automatically Parallelized Object-Based Programs

LCPC '96 Proceedings of the 9th International Workshop on Languages and Compilers for Parallel Computing
Gprof: A call graph execution profiler

SIGPLAN '82 Proceedings of the 1982 SIGPLAN symposium on Compiler construction
FX-87 PERFORMANCE MEASUREMENTS: DATAFLOW IMPLEMENTATION

FX-87 PERFORMANCE MEASUREMENTS: DATAFLOW IMPLEMENTATION
Program reduction using symbolic execution

ACM SIGSOFT Software Engineering Notes

Eliminating synchronization bottlenecks in object-based programs using adaptive replication

ICS '99 Proceedings of the 13th international conference on Supercomputing
Eliminating synchronization overhead in automatically parallelized programs using dynamic feedback

ACM Transactions on Computer Systems (TOCS)
Effective fine-grain synchronization for automatically parallelized programs using optimistic synchronization primitives

ACM Transactions on Computer Systems (TOCS)
Mapping irregular applications to DIVA, a PIM-based data-intensive architecture

SC '99 Proceedings of the 1999 ACM/IEEE conference on Supercomputing
Symbolic bounds analysis of pointers, array indices, and accessed memory regions

PLDI '00 Proceedings of the ACM SIGPLAN 2000 conference on Programming language design and implementation
Fractal symbolic analysis

ICS '01 Proceedings of the 15th international conference on Supercomputing
Containers on the Parallelization of General-Purpose Java Programs

International Journal of Parallel Programming
Eliminating synchronization bottlenecks using adaptive replication

ACM Transactions on Programming Languages and Systems (TOPLAS)
Beyond Arrays - A Container-Centric Approach for Parallelization of Real-World Symbolic Applications

LCPC '98 Proceedings of the 11th International Workshop on Languages and Compilers for Parallel Computing
Analysis of Multithreaded Programs

SAS '01 Proceedings of the 8th International Symposium on Static Analysis
Inter-procedural Analysis for Parallelization of Java Programs

ParNum '99 Proceedings of the 4th International ACPC Conference Including Special Tracks on Parallel Numerics and Parallel Computing in Image Processing, Video Processing, and Multimedia: Parallel Computation
A Comparison of Locality Transformations for Irregular Codes

LCR '00 Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Detecting Read-Only Methods in Java

LCR '00 Selected Papers from the 5th International Workshop on Languages, Compilers, and Run-Time Systems for Scalable Computers
Automatic pool allocation for disjoint data structures

Proceedings of the 2002 workshop on Memory system performance
Fractal symbolic analysis

ACM Transactions on Programming Languages and Systems (TOPLAS)
Automating commutativity analysis at the design level

ISSTA '04 Proceedings of the 2004 ACM SIGSOFT international symposium on Software testing and analysis
Symbolic bounds analysis of pointers, array indices, and accessed memory regions

ACM Transactions on Programming Languages and Systems (TOPLAS)
Look left, look right, look left again: an application of fractal symbolic analysis to linear algebra code restructuring

International Journal of Parallel Programming
Optimistic parallelism requires abstractions

Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
Transactional boosting: a methodology for highly-concurrent transactional objects

Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming
Full functional verification of linked data structures

Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation
Commutativity analysis for software parallelization: letting program transformations see the big picture

Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
Automatic Parallelization with Separation Logic

ESOP '09 Proceedings of the 18th European Symposium on Programming Languages and Systems: Held as Part of the Joint European Conferences on Theory and Practice of Software, ETAPS 2009
Optimistic parallelism requires abstractions

Communications of the ACM - The Status of the P versus NP Problem
A type and effect system for deterministic parallel Java

Proceedings of the 24th ACM SIGPLAN conference on Object oriented programming systems languages and applications
Coarse-grained transactions

Proceedings of the 37th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Automatic atomic region identification in shared memory SPMD programs

Proceedings of the ACM international conference on Object oriented programming systems languages and applications
The tao of parallelism in algorithms

Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation
ALTER: exploiting breakable dependences for parallelization

Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation
Verification of semantic commutativity conditions and inverse operations on linked data structures

Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation
Localizing globals and statics to make C programs thread-safe

CASES '11 Proceedings of the 14th international conference on Compilers, architectures and synthesis for embedded systems
Enhancing locality for recursive traversals of recursive structures

Proceedings of the 2011 ACM international conference on Object oriented programming systems languages and applications
Specification-based sketching with Sketch

Proceedings of the 13th Workshop on Formal Techniques for Java-Like Programs
Internally deterministic parallel algorithms can be fast

Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
Deterministic parallelism via liquid effects

Proceedings of the 33rd ACM SIGPLAN conference on Programming Language Design and Implementation
Yada: Straightforward parallel programming

Parallel Computing
Automatically enhancing locality for tree traversals with traversal splicing

Proceedings of the ACM international conference on Object oriented programming systems languages and applications
Testing mined specifications

Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering
Parallelizing Sequential Programs with Statistical Accuracy Tests

ACM Transactions on Embedded Computing Systems (TECS) - Special Section on Probabilistic Embedded Computing
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles

ACM SIGOPS 24th Symposium on Operating Systems Principles
The scalable commutativity rule: designing scalable software for multicore processors

Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles

Quantified Score

Hi-index	0.00

Visualization

Abstract

This article presents a new analysis technique, commutativity analysis, for automatically parallelizing computations that manipulate dynamic, pointer-based data structures. Commutativity analysis views the computation as composed of operations on objects. It then analyzes the program at this granularity to discover when operations commute (i.e., generate the same final result regardless of the order in which they execute). If all of the operations required to perform a given computation commute, the compiler can automatically generate parallel code. We have implemented a prototype compilation system that uses commutativity analysis as its primary analysis technique. We have used this system to automatically parallelize three complete scientific computations: the Barnes-Hut N-body solver, the Water liquid simulation code, and the String seismic simulation code. This article presents performance results for the generated parallel code running on the Stanford DASH machine. These results provide encouraging evidence that commutativity analysis can serve as the basis for a successful parallelizing compiler.