On the danger of coverage directed test case generation

Authors:
Matt Staats;Gregory Gay;Michael Whalen;Mats Heimdahl
Affiliations:
Korea Advanced Institute of Science & Technology, Daejeon, Republic of Korea;University of Minnesota, Minneapolis, MN;University of Minnesota, Minneapolis, MN;University of Minnesota, Minneapolis, MN
Venue:
FASE'12 Proceedings of the 15th international conference on Fundamental Approaches to Software Engineering
Year:
2012

Citing 25
Cited 2

Analyzing Partition Testing Strategies

IEEE Transactions on Software Engineering
An experimental comparison of the effectiveness of the all-uses and all-edges adequacy criteria

TAV4 Proceedings of the symposium on Testing, analysis, and verification
Test data adequacy measurement

Software Engineering Journal
On the Expected Number of Failures Detected by Subdomain Testing and Random Testing

IEEE Transactions on Software Engineering
Experiments of the effectiveness of dataflow- and controlflow-based test adequacy criteria

ICSE '94 Proceedings of the 16th international conference on Software engineering
Using model checking to generate tests from requirements specifications

ESEC/FSE-7 Proceedings of the 7th European software engineering conference held jointly with the 7th ACM SIGSOFT international symposium on Foundations of software engineering
Partition Testing vs. Random Testing: The Influence of Uncertainty

IEEE Transactions on Software Engineering
Synchronous Programming of Reactive Systems

Synchronous Programming of Reactive Systems
Reviewing 25 Years of Testing Technique Experiments

Empirical Software Engineering
Test-Suite Reduction for Model Based Tests: Effects on Test Quality and Implications for Testing

Proceedings of the 19th IEEE international conference on Automated software engineering
Is mutation an appropriate tool for testing experiments?

Proceedings of the 27th international conference on Software engineering
DART: directed automated random testing

Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation
CUTE: a concolic unit testing engine for C

Proceedings of the 10th European software engineering conference held jointly with 13th ACM SIGSOFT international symposium on Foundations of software engineering
Coverage-Directed Test Generation with Model Checkers: Challenges and Opportunities

COMPSAC '05 Proceedings of the 29th Annual International Computer Software and Applications Conference - Volume 01
A comparison of MC/DC, MUMCUT and several other coverage criteria for logical decisions

Journal of Systems and Software - Special issue: Quality software
Hybrid Concolic Testing

ICSE '07 Proceedings of the 29th international conference on Software Engineering
The effect of program and model structure on mc/dc test adequacy coverage

Proceedings of the 30th international conference on Software engineering
Requirements Coverage as an Adequacy Measure for Conformance Testing

ICFEM '08 Proceedings of the 10th International Conference on Formal Methods and Software Engineering
The influence of size and coverage on test suite effectiveness

Proceedings of the eighteenth international symposium on Software testing and analysis
Formal analysis of the effectiveness and predictability of random testing

Proceedings of the 19th international symposium on Software testing and analysis
Specification test coverage adequacy criteria = specification test generation inadequacy criteria

HASE'04 Proceedings of the Eighth IEEE international conference on High assurance systems engineering
Error detection rate of MC/DC for a case study from the automotive domain

SEUS'10 Proceedings of the 8th IFIP WG 10.2 international conference on Software technologies for embedded and ubiquitous systems
Adaptive random testing: an illusion of effectiveness?

Proceedings of the 2011 International Symposium on Software Testing and Analysis
An Analysis and Survey of the Development of Mutation Testing

IEEE Transactions on Software Engineering
Sequential equivalence checking based on structural similarities

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

An intuitive approach to determine test adequacy in safety-critical software

ACM SIGSOFT Software Engineering Notes
Semi-valid input coverage for fuzz testing

Proceedings of the 2013 International Symposium on Software Testing and Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

In the avionics domain, the use of structural coverage criteria is legally required in determining test suite adequacy. With the success of automated test generation tools, it is tempting to use these criteria as the basis for test generation. To more firmly establish the effectiveness of such approaches, we have generated and evaluated test suites to satisfy two coverage criteria using counterexample-based test generation and a random generation approach, contrasted against purely random test suites of equal size. Our results yield two key conclusions. First, coverage criteria satisfaction alone is a poor indication of test suite effectiveness. Second, the use of structural coverage as a supplement--not a target--for test generation can have a positive impact. These observations points to the dangers inherent in the increase in test automation in critical systems and the need for more research in how coverage criteria, generation approach, and system structure jointly influence test effectiveness.