CoreDet: a compiler and runtime system for deterministic multithreaded execution

Authors:
Tom Bergan;Owen Anderson;Joseph Devietti;Luis Ceze;Dan Grossman
Affiliations:
University of Washington, Computer Science and Engineering, Seattle, WA, USA;University of Washington, Computer Science and Engineering, Seattle, WA, USA;University of Washington, Computer Science and Engineering, Seattle, WA, USA;University of Washington, Computer Science and Engineering, Seattle, WA, USA;University of Washington, Computer Science and Engineering, Seattle, WA, USA
Venue:
Proceedings of the fifteenth edition of ASPLOS on Architectural support for programming languages and operating systems
Year:
2010

Citing 25
Cited 51

Debugging Parallel Programs with Instant Replay

IEEE Transactions on Computers
Algorithms for scalable synchronization on shared-memory multiprocessors

ACM Transactions on Computer Systems (TOCS)
The SPLASH-2 programs: characterization and methodological considerations

ISCA '95 Proceedings of the 22nd annual international symposium on Computer architecture
Deterministic replay of Java multithreaded applications

SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
The design, implementation, and evaluation of Jade

ACM Transactions on Programming Languages and Systems (TOPLAS)
RecPlay: a fully integrated practical record/replay system

ACM Transactions on Computer Systems (TOCS)
Weak ordering—a new definition

ISCA '90 Proceedings of the 17th annual international symposium on Computer Architecture
Hoard: a scalable memory allocator for multithreaded applications

ASPLOS IX Proceedings of the ninth international conference on Architectural support for programming languages and operating systems
StreamIt: A Language for Streaming Applications

CC '02 Proceedings of the 11th International Conference on Compiler Construction
A "flight data recorder" for enabling full-system multiprocessor deterministic replay

Proceedings of the 30th annual international symposium on Computer architecture
LLVM: A Compilation Framework for Lifelong Program Analysis & Transformation

Proceedings of the international symposium on Code generation and optimization: feedback-directed and runtime optimization
The Java memory model

Proceedings of the 32nd ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Threads cannot be implemented as a library

Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation
SHIM: a deterministic model for heterogeneous embedded systems

Proceedings of the 5th ACM international conference on Embedded software
Macroscopic data structure analysis and optimization

Macroscopic data structure analysis and optimization
Recording shared memory dependencies using strata

Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
Data parallel Haskell: a status report

Proceedings of the 2007 workshop on Declarative aspects of multicore programming
Foundations of the C++ concurrency memory model

Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation
Rerun: Exploiting Episodes for Lightweight Memory Race Recording

ISCA '08 Proceedings of the 35th Annual International Symposium on Computer Architecture
DeLorean: Recording and Deterministically Replaying Shared-Memory Multiprocessor Execution Ef?ciently

ISCA '08 Proceedings of the 35th Annual International Symposium on Computer Architecture
The PARSEC benchmark suite: characterization and architectural implications

Proceedings of the 17th international conference on Parallel architectures and compilation techniques
DMP: deterministic shared memory multiprocessing

Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
Kendo: efficient deterministic multithreading in software

Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
A type and effect system for deterministic parallel Java

Proceedings of the 24th ACM SIGPLAN conference on Object oriented programming systems languages and applications
Finding and reproducing Heisenbugs in concurrent programs

OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation

Determinating timing channels in compute clouds

Proceedings of the 2010 ACM workshop on Cloud computing security workshop
Concurrent programming with revisions and isolation types

Proceedings of the ACM international conference on Object oriented programming systems languages and applications
Deterministic process groups in dOS

OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Efficient system-enforced deterministic parallelism

OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Stable deterministic multithreading through schedule memoization

OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
Safe nondeterminism in a deterministic-by-default parallel language

Proceedings of the 38th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages
InstantCheck: Checking the Determinism of Parallel Programs Using On-the-Fly Incremental Hashing

MICRO '43 Proceedings of the 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture
RCDC: a relaxed consistency deterministic computer

Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems
Karma: scalable deterministic record-replay

Proceedings of the international conference on Supercomputing
Deterministic OpenMP for race-free parallelism

HotPar'11 Proceedings of the 3rd USENIX conference on Hot topic in parallelism
Dthreads: efficient deterministic multithreading

SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Efficient deterministic multithreading through schedule relaxation

SOSP '11 Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles
Accentuating the positive: atomicity inference and enforcement using correct executions

Proceedings of the 2011 ACM international conference on Object oriented programming systems languages and applications
Safe parallel programming using dynamic dependence hints

Proceedings of the 2011 ACM international conference on Object oriented programming systems languages and applications
Two for the price of one: a model for parallel and incremental computation

Proceedings of the 2011 ACM international conference on Object oriented programming systems languages and applications
Toward a formal semantic framework for deterministic parallel programming

DISC'11 Proceedings of the 25th international conference on Distributed computing
A Deterministic Interpreter Simulating A Distributed real time system using VDM

ICFEM'11 Proceedings of the 13th international conference on Formal methods and software engineering
Resource-sensitive synchronization inference by abduction

POPL '12 Proceedings of the 39th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages
A virtual memory foundation for scalable deterministic parallelism

Proceedings of the Second Asia-Pacific Workshop on Systems
DoublePlay: Parallelizing Sequential Logging and Replay

ACM Transactions on Computer Systems (TOCS) - Special Issue APLOS 2011
Internally deterministic parallel algorithms can be fast

Proceedings of the 17th ACM SIGPLAN symposium on Principles and Practice of Parallel Programming
Data races vs. data race bugs: telling the difference with portend

ASPLOS XVII Proceedings of the seventeenth international conference on Architectural Support for Programming Languages and Operating Systems
Enhancing TCP throughput of highly available virtual machines via speculative communication

VEE '12 Proceedings of the 8th ACM SIGPLAN/SIGOPS conference on Virtual Execution Environments
Efficient system-enforced deterministic parallelism

Communications of the ACM
A data-centric approach to synchronization

ACM Transactions on Programming Languages and Systems (TOPLAS)
Exploiting parallelism in deterministic shared memory multiprocessing

Journal of Parallel and Distributed Computing
Sound and precise analysis of parallel programs through schedule specialization

Proceedings of the 33rd ACM SIGPLAN conference on Programming Language Design and Implementation
Verifying GPU kernels by test amplification

Proceedings of the 33rd ACM SIGPLAN conference on Programming Language Design and Implementation
Chimera: hybrid program analysis for determinism

Proceedings of the 33rd ACM SIGPLAN conference on Programming Language Design and Implementation
Disciplined concurrent programming using tasks with effects

HotPar'12 Proceedings of the 4th USENIX conference on Hot Topics in Parallelism
Concurrency attacks

HotPar'12 Proceedings of the 4th USENIX conference on Hot Topics in Parallelism
TACHYON: tandem execution for efficient live patch testing

Security'12 Proceedings of the 21st USENIX conference on Security symposium
Execution privatization for scheduler-oblivious concurrent programs

Proceedings of the ACM international conference on Object oriented programming systems languages and applications
All about Eve: execute-verify replication for multi-core servers

OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
The tasks with effects model for safe concurrency

Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming
RaceFree: an efficient multi-threading model for determinism

Proceedings of the 18th ACM SIGPLAN symposium on Principles and practice of parallel programming
GPUDet: a deterministic GPU architecture

Proceedings of the eighteenth international conference on Architectural support for programming languages and operating systems
DDOS: taming nondeterminism in distributed systems

Proceedings of the eighteenth international conference on Architectural support for programming languages and operating systems
Proving the correctness of nonblocking data structures

Communications of the ACM
Efficient software-based fault tolerance approach on multicore platforms

Proceedings of the Conference on Design, Automation and Test in Europe
Proving the Correctness of Nonblocking Data Structures

Queue - Concurrency
Effective dynamic detection of alias analysis errors

Proceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering
Proof-Directed Parallelization Synthesis by Separation Logic

ACM Transactions on Programming Languages and Systems (TOPLAS)
Lazy tree mapping: generalizing and scaling deterministic parallelism

Proceedings of the 4th Asia-Pacific Workshop on Systems
Making parallel programs reliable with stable multithreading

Communications of the ACM
OCTET: capturing and controlling cross-thread dependences efficiently

Proceedings of the 2013 ACM SIGPLAN international conference on Object oriented programming systems languages & applications
Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles

ACM SIGOPS 24th Symposium on Operating Systems Principles
Parrot: a practical runtime for deterministic, stable, and reliable threads

Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles
DEFINED: deterministic execution for interactive control-plane debugging

USENIX ATC'13 Proceedings of the 2013 USENIX conference on Annual Technical Conference
Deterministic galois: on-demand, portable and parameterless

Proceedings of the 19th international conference on Architectural support for programming languages and operating systems
Efficient deterministic multithreading without global barriers

Proceedings of the 19th ACM SIGPLAN symposium on Principles and practice of parallel programming

Quantified Score

Hi-index	0.05

Visualization

Abstract

The behavior of a multithreaded program does not depend only on its inputs. Scheduling, memory reordering, timing, and low-level hardware effects all introduce nondeterminism in the execution of multithreaded programs. This severely complicates many tasks, including debugging, testing, and automatic replication. In this work, we avoid these complications by eliminating their root cause: we develop a compiler and runtime system that runs arbitrary multithreaded C/C++ POSIX Threads programs deterministically. A trivial non-performant approach to providing determinism is simply deterministically serializing execution. Instead, we present a compiler and runtime infrastructure that ensures determinism but resorts to serialization rarely, for handling interthread communication and synchronization. We develop two basic approaches, both of which are largely dynamic with performance improved by some static compiler optimizations. First, an ownership-based approach detects interthread communication via an evolving table that tracks ownership of memory regions by threads. Second, a buffering approach uses versioned memory and employs a deterministic commit protocol to make changes visible to other threads. While buffering has larger single-threaded overhead than ownership, it tends to scale better (serializing less often). A hybrid system sometimes performs and scales better than either approach individually. Our implementation is based on the LLVM compiler infrastructure. It needs neither programmer annotations nor special hardware. Our empirical evaluation uses the PARSEC and SPLASH2 benchmarks and shows that our approach scales comparably to nondeterministic execution.