BALLERINA: automatic generation and clustering of efficient random unit tests for multithreaded code

Authors:
Adrian Nistor;Qingzhou Luo;Michael Pradel;Thomas R. Gross;Darko Marinov
Affiliations:
University of Illinois at Urbana-Champaign, USA;University of Illinois at Urbana-Champaign, USA;ETH Zurich, Switzerland;ETH Zurich, Switzerland;University of Illinois at Urbana-Champaign, USA
Venue:
Proceedings of the 34th International Conference on Software Engineering
Year:
2012

Citing 47
Cited 6

Linearizability: a correctness condition for concurrent objects

ACM Transactions on Programming Languages and Systems (TOPLAS)
Finding failures by cluster analysis of execution profiles

ICSE '01 Proceedings of the 23rd International Conference on Software Engineering
Model Checking Programs

Automated Software Engineering
Superlinear Speedup in Parallel State-Space Search

Proceedings of the Eighth Conference on Foundations of Software Technology and Theoretical Computer Science
Automated support for classifying software failure reports

Proceedings of the 25th International Conference on Software Engineering
Hybrid dynamic data race detection

Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming
DART: directed automated random testing

Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation
Effective static race detection for Java

Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation
Statistical debugging: simultaneous identification of multiple bugs

ICML '06 Proceedings of the 23rd international conference on Machine learning
Application of automated environment generation to commercial software

Proceedings of the 2006 international symposium on Software testing and analysis
DSD-Crasher: a hybrid analysis tool for bug finding

Proceedings of the 2006 international symposium on Software testing and analysis
AVIO: detecting atomicity violations via access interleaving invariants

Proceedings of the 12th international conference on Architectural support for programming languages and operating systems
Controlling factors in evaluating path-sensitive error detection techniques

Proceedings of the 14th ACM SIGSOFT international symposium on Foundations of software engineering
Parallel Randomized State-Space Search

ICSE '07 Proceedings of the 29th international conference on Software Engineering
Feedback-Directed Random Test Generation

ICSE '07 Proceedings of the 29th international conference on Software Engineering
Automatically classifying benign and harmful data races using replay analysis

Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
Iterative context bounding for systematic testing of multithreaded programs

Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
Debugging in Parallel

Proceedings of the 2007 international symposium on Software testing and analysis
Effective random testing of concurrent programs

Proceedings of the twenty-second IEEE/ACM international conference on Automated software engineering
Combining environment generation and slicing for modular software model checking

Proceedings of the twenty-second IEEE/ACM international conference on Automated software engineering
Learning from mistakes: a comprehensive study on real world concurrency bug characteristics

Proceedings of the 13th international conference on Architectural support for programming languages and operating systems
ARTOO: adaptive random testing for object-oriented software

Proceedings of the 30th international conference on Software engineering
jPredictor: a predictive runtime analysis tool for java

Proceedings of the 30th international conference on Software engineering
Race directed random testing of concurrent programs

Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation
Fair stateless model checking

Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation
A Systematic Study of Failure Proximity

IEEE Transactions on Software Engineering
CTrigger: exposing atomicity violation bugs from their hiding places

Proceedings of the 14th international conference on Architectural support for programming languages and operating systems
FastTrack: efficient and precise dynamic race detection

Proceedings of the 2009 ACM SIGPLAN conference on Programming language design and implementation
LiteRace: effective sampling for lightweight data-race detection

Proceedings of the 2009 ACM SIGPLAN conference on Programming language design and implementation
Clustering test cases to achieve effective and scalable prioritisation incorporating expert knowledge

Proceedings of the eighteenth international symposium on Software testing and analysis
Asserting and checking determinism for multithreaded programs

Proceedings of the the 7th joint meeting of the European software engineering conference and the ACM SIGSOFT symposium on The foundations of software engineering
PRES: probabilistic replay with execution sketching on multiprocessors

Proceedings of the ACM SIGOPS 22nd symposium on Operating systems principles
Multithreaded java program test generation

IBM Systems Journal
Adaptive Random Testing: The ART of test case diversity

Journal of Systems and Software
A Divergence-Oriented Approach to Adaptive Random Testing of Java Programs

ASE '09 Proceedings of the 2009 IEEE/ACM International Conference on Automated Software Engineering
Line-up: a complete and automatic linearizability checker

PLDI '10 Proceedings of the 2010 ACM SIGPLAN conference on Programming language design and implementation
DETERMIN: inferring likely deterministic specifications of multithreaded programs

Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 1
TestFul: An Evolutionary Test Approach for Java

ICST '10 Proceedings of the 2010 Third International Conference on Software Testing, Verification and Validation
OCAT: object capture-based automated testing

Proceedings of the 19th international symposium on Software testing and analysis
Directed test suite augmentation: techniques and tradeoffs

Proceedings of the eighteenth ACM SIGSOFT international symposium on Foundations of software engineering
Genetic Algorithms for Randomized Unit Testing

IEEE Transactions on Software Engineering
Specifying and checking semantic atomicity for multithreaded programs

Proceedings of the sixteenth international conference on Architectural support for programming languages and operating systems
NDSeq: runtime checking for nondeterministic sequential specifications of parallel correctness

Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation
Change-aware preemption prioritization

Proceedings of the 2011 International Symposium on Software Testing and Analysis
Combined static and dynamic automated test generation

Proceedings of the 2011 International Symposium on Software Testing and Analysis
Synthesizing method sequences for high-coverage testing

Proceedings of the 2011 ACM international conference on Object oriented programming systems languages and applications
Preemption sealing for efficient concurrency testing

TACAS'10 Proceedings of the 16th international conference on Tools and Algorithms for the Construction and Analysis of Systems

Fully automatic and precise detection of thread safety violations

Proceedings of the 33rd ACM SIGPLAN conference on Programming Language Design and Implementation
Selective mutation testing for concurrent code

Proceedings of the 2013 International Symposium on Software Testing and Analysis
Feedback-directed unit test generation for C/C++ using concolic execution

Proceedings of the 2013 International Conference on Software Engineering
Automatic testing of sequential and concurrent substitutability

Proceedings of the 2013 International Conference on Software Engineering
Chronicler: lightweight recording to reproduce field failures

Proceedings of the 2013 International Conference on Software Engineering
Automatic patch generation learned from human-written patches

Proceedings of the 2013 International Conference on Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Testing multithreaded code is hard and expensive. Each multithreaded unit test creates two or more threads, each executing one or more methods on shared objects of the class under test. Such unit tests can be generated at random, but basic generation produces tests that are either slow or do not trigger concurrency bugs. Worse, such tests have many false alarms, which require human effort to filter out. We present BALLERINA, a novel technique for automatic generation of efficient multithreaded random tests that effectively trigger concurrency bugs. BALLERINA makes tests efficient by having only two threads, each executing a single, randomly selected method. BALLERINA increases chances that such a simple parallel code finds bugs by appending it to more complex, randomly generated sequential code. We also propose a clustering technique to reduce the manual effort in inspecting failures of automatically generated multithreaded tests. We evaluate BALLERINA on 14 real-world bugs from 6 popular codebases: Groovy, Java JDK, jFreeChart, Log4j, Lucene, and Pool. The experiments show that tests generated by BALLERINA can find bugs on average 2X-10X faster than various configurations of basic random generation, and our clustering technique reduces the number of inspected failures on average 4X-8X. Using BALLERINA, we found three previously unknown bugs in Apache Pool and Log4j, one of which was already confirmed and fixed.