A lightweight and portable approach to making concurrent failures reproducible

Authors:
Qingzhou Luo;Sai Zhang;Jianjun Zhao;Min Hu
Affiliations:
School of Software, Shanghai Jiao Tong University;Computer Science S Engineering Department, University of Washington;School of Software, Shanghai Jiao Tong University;School of Software, Shanghai Jiao Tong University
Venue:
FASE'10 Proceedings of the 13th international conference on Fundamental Approaches to Software Engineering
Year:
2010

Citing 16
Cited 1

Dynamic program slicing

PLDI '90 Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation
Deterministic replay of Java multithreaded applications

SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
jRapture: A Capture/Replay tool for observation-based testing

Proceedings of the 2000 ACM SIGSOFT international symposium on Software testing and analysis
Object race detection

OOPSLA '01 Proceedings of the 16th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Efficient and precise datarace detection for multithreaded object-oriented programs

PLDI '02 Proceedings of the ACM SIGPLAN 2002 Conference on Programming language design and implementation
Isolating failure-inducing thread schedules

ISSTA '02 Proceedings of the 2002 ACM SIGSOFT international symposium on Software testing and analysis
Hybrid dynamic data race detection

Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
Atomizer: a dynamic atomicity checker for multithreaded programs

Proceedings of the 31st ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Effective static race detection for Java

Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
Learning from mistakes: a comprehensive study on real world concurrency bug characteristics

Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
Race directed random testing of concurrent programs

Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation
ReCrash: Making Software Failures Reproducible by Preserving Object States

ECOOP '08 Proceedings of the 22nd European conference on Object-Oriented Programming
Effective static deadlock detection

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
PRES: probabilistic replay with execution sketching on multiprocessors

Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Multithreaded java program test generation

IBM Systems Journal
Finding and reproducing Heisenbugs in concurrent programs

OSDI'08 Proceedings of the 8th USENIX conference on Operating systems design and implementation

CoDeSe: fast deserialization via code generation

Proceedings of the 2011 International Symposium on Software Testing and Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

Concurrent programs often exhibit bugs due to unintended interferences among the concurrent threads. Such bugs are often hard to reproduce because they typically happen under very specific interleaving of the executing threads. Basically, it is very hard to fix a bug (or software failure) in concurrent programs without being able to reproduce it. In this paper, we present an approach, called ConCrash, that automatically and deterministically reproduces concurrent failures by recording logical thread schedule and generating unit tests. For a given bug (failure), ConCrash records the logical thread scheduling order and preserves object states in memory at runtime. Then, ConCrash reproduces the failure offline by simply using the saved information without the need for JVM-level or OS-level support. To reduce the runtime performance overhead, ConCrash employs a static data race detection technique to report potential possible race conditions, and only instruments such places. We implement the ConCrash approach in a prototype tool for Java and experimented on a number of multi-threaded Java benchmarks. As a result, we successfully reproduced a number of real concurrent bugs (e.g., deadlocks, data races and atomicity violation) within an acceptable overhead.