Healing data races on-the-fly

Authors:
Bohuslav Krena;Zdenek Letko;Rachel Tzoref;Shmuel Ur;Tomáš Vojnar
Affiliations:
Brno University of Technology, Czech Republic;Brno University of Technology, Czech Republic;Haifa University Campus, Haifa, Israel;Haifa University Campus, Haifa, Israel;Brno University of Technology, Czech Republic
Venue:
Proceedings of the 2007 ACM workshop on Parallel and distributed systems: testing and debugging
Year:
2007

Citing 30
Cited 13

Automatic detection of nondeterminacy in parallel programs

PADD '88 Proceedings of the 1988 ACM SIGPLAN and SIGOPS workshop on Parallel and distributed debugging
An empirical comparison of monitoring algorithms for access anomaly detection

PPOPP '90 Proceedings of the second ACM SIGPLAN symposium on Principles & practice of parallel programming
Improving the accuracy of data race detection

PPOPP '91 Proceedings of the third ACM SIGPLAN symposium on Principles and practice of parallel programming
Detecting data races on weak memory systems

ISCA '91 Proceedings of the 18th annual international symposium on Computer architecture
Detecting access anomalies in programs with critical sections

PADD '91 Proceedings of the 1991 ACM/ONR workshop on Parallel and distributed debugging
What are race conditions?: Some issues and formalizations

ACM Letters on Programming Languages and Systems (LOPLAS)
Compile-time support for efficient data race detection in shared-memory parallel programs

PADD '93 Proceedings of the 1993 ACM/ONR workshop on Parallel and distributed debugging
Eraser: a dynamic data race detector for multithreaded programs

ACM Transactions on Computer Systems (TOCS)
Detecting data races in Cilk programs that use locks

Proceedings of the tenth annual ACM symposium on Parallel algorithms and architectures
Protocol-based data-race detection

SPDT '98 Proceedings of the SIGMETRICS symposium on Parallel and distributed tools
RecPlay: a fully integrated practical record/replay system

ACM Transactions on Computer Systems (TOCS)
Compile-time detection of race conditions in a parallel program

ICS '89 Proceedings of the 3rd international conference on Supercomputing
Model checking

Model checking
Toward integration of data race detection in DSM systems

Journal of Parallel and Distributed Computing - Special issue on software support for distributed computing
A Protocol-Centric Approach to on-the-Fly Race Detection

IEEE Transactions on Parallel and Distributed Systems
Reduction: a method of proving properties of parallel programs

Communications of the ACM
Detecting race conditions in large programs

PASTE '01 Proceedings of the 2001 ACM SIGPLAN-SIGSOFT workshop on Program analysis for software tools and engineering
Object race detection

OOPSLA '01 Proceedings of the 16th ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications
Efficient and precise datarace detection for multithreaded object-oriented programs

PLDI '02 Proceedings of the ACM SIGPLAN 2002 Conference on Programming language design and implementation
Types for atomicity

Proceedings of the 2003 ACM SIGPLAN international workshop on Types in languages design and implementation
Efficient on-the-fly data race detection in multithreaded C++ programs

Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
Concurrent Bug Patterns and How to Test Them

IPDPS '03 Proceedings of the 17th International Symposium on Parallel and Distributed Processing
Atomizer: a dynamic atomicity checker for multithreaded programs

Proceedings of the 31st ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Efficient Verification of Sequential and Concurrent C Programs

Formal Methods in System Design
Finding missing synchronization in a distributed computation using controlled re-execution

Distributed Computing
Static analysis of atomicity for programs with non-blocking synchronization

Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming
Exploiting Purity for Atomicity

IEEE Transactions on Software Engineering
A theory of data race detection

Proceedings of the 2006 workshop on Parallel and distributed systems: testing and debugging
Instrumenting where it hurts: an automatic concurrent debugging technique

Proceedings of the 2007 international symposium on Software testing and analysis
Goldilocks: efficiently computing the happens-before relation using locksets

FATES'06/RV'06 Proceedings of the First combined international conference on Formal Approaches to Software Testing and Runtime Verification

Dynamic recognition of synchronization operations for improved data race detection

ISSTA '08 Proceedings of the 2008 international symposium on Software testing and analysis
AtomRace: data race and atomicity violation detector and healer

PADTAD '08 Proceedings of the 6th workshop on Parallel and distributed systems: testing, analysis, and debugging
Detecting and tolerating asymmetric races

Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming
ISOLATOR: dynamically ensuring isolation in comcurrent programs

Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
In-field healing of integration problems with COTS components

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
Adaptive locks: Combining transactions and locks for efficient concurrency

Journal of Parallel and Distributed Computing
Automated atomicity-violation fixing

Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation
Research in concurrent software testing: a systematic review

Proceedings of the Workshop on Parallel and Distributed Systems: Testing, Analysis, and Debugging
Exploiting cache traffic monitoring for run-time race detection

Euro-Par'11 Proceedings of the 17th international conference on Parallel processing - Volume Part I
Hardware support for enforcing isolation in lock-based parallel programs

Proceedings of the 26th ACM international conference on Supercomputing
Noise-based testing and analysis of multi-threaded C/C++ programs on the binary level

Proceedings of the 2012 Workshop on Parallel and Distributed Systems: Testing, Analysis, and Debugging
Automated concurrency-bug fixing

OSDI'12 Proceedings of the 10th USENIX conference on Operating Systems Design and Implementation
Exception handlers for healing component-based systems

ACM Transactions on Software Engineering and Methodology (TOSEM) - Testing, debugging, and error handling, formal methods, lifecycle concerns, evolution and maintenance

Quantified Score

Hi-index	0.00

Visualization

Abstract

Testing of concurrent software is extremely difficult. Despite all the progress in the testing and verification technology, concurrent bugs, the most common of which are deadlocks and races, make it to the field. This paper describes a set of techniques, implemented in a tool called ConTest, allowing concurrent programs to self-heal at run-time. Concurrent bugs have the very desirable property for healing that some of the interleaving produce correct results while in others bugs manifest. Healing concurrency problems is about limiting, or changing the probability of interleaving, such that bugs will be seen less. When healing concurrent programs, if a deadlock does not result from limiting the interleaving, we are sure that the result of the healed program could have been in the original program and therefore no new functional bug has been introduced. In this initial work which deals with different types of data races, we suggest three types of healing mechanisms: (1) changing the probability of interleaving by introducing sleep or yield statements or by changing thread priorities, (2) removing interleaving using synchronisation commands like locking and unlocking certain mutexes or waits and notifies, and (3) removing the result of "bad interleaving" by replacing the value of variables by the one that "should" have been taken. We also classify races according to the relevant healing strategies to apply.