An upper bound on software testing effectiveness

Authors:
Tsong Yueh Chen;Robert Merkel
Affiliations:
Swinburne University of Technology, Hawthorn Victoria, Australia;Swinburne University of Technology, Hawthorn Victoria, Australia
Venue:
ACM Transactions on Software Engineering and Methodology (TOSEM)
Year:
2008

Citing 11
Cited 14

Data Diversity: An Approach to Software Fault Tolerance

IEEE Transactions on Computers - Fault-Tolerant Computing
Partition Testing Does Not Inspire Confidence (Program Testing)

IEEE Transactions on Software Engineering
On the Expected Number of Failures Detected by Subdomain Testing and Random Testing

IEEE Transactions on Software Engineering
Evaluating Testing Methods by Delivered Reliability

IEEE Transactions on Software Engineering
Pursuing failure: the distribution of program failures in a profile space

Proceedings of the 8th European software engineering conference held jointly with 9th ACM SIGSOFT international symposium on Foundations of software engineering
A Formal Analysis of the Fault-Detecting Ability of Testing Methods

IEEE Transactions on Software Engineering
On the analytical comparison of testing techniques

ISSTA '04 Proceedings of the 2004 ACM SIGSOFT international symposium on Software testing and analysis
Object distance and its application to adaptive random testing of object-oriented programs

Proceedings of the 1st international workshop on Random testing
On the statistical properties of testing effectiveness measures

Journal of Systems and Software - Special issue: Quality software
A Domain Strategy for Computer Program Testing

IEEE Transactions on Software Engineering
Adaptive random testing

ASIAN'04 Proceedings of the 9th Asian Computing Science conference on Advances in Computer Science: dedicated to Jean-Louis Lassez on the Occasion of His 5th Cycle Birthday

Adaptive random testing by balancing

Proceedings of the 2nd international workshop on Random testing: co-located with the 22nd IEEE/ACM International Conference on Automated Software Engineering (ASE 2007)
Enhancing adaptive random testing for programs with high dimensional input domains or failure-unrelated parameters

Software Quality Control
Distributing test cases more evenly in adaptive random testing

Journal of Systems and Software
Enhanced lattice-based adaptive random testing

Proceedings of the 2009 ACM symposium on Applied Computing
Adaptive random testing based on distribution metrics

Journal of Systems and Software
Adaptive Random Testing: The ART of test case diversity

Journal of Systems and Software
On the estimation of adequate test set size using fault failure rates

Journal of Systems and Software
Non-parametric statistical fault localization

Journal of Systems and Software
Adaptive random testing: an illusion of effectiveness?

Proceedings of the 2011 International Symposium on Software Testing and Analysis
How well does test case prioritization integrate with statistical fault localization?

Information and Software Technology
Comparison of adaptive random testing and random testing under various testing and debugging scenarios

Software—Practice & Experience
An orchestrated survey of methodologies for automated software test case generation

Journal of Systems and Software
Automated cookie collection testing

ACM Transactions on Software Engineering and Methodology (TOSEM)
Do we need to handle every temporal violation in scientific workflow systems?

ACM Transactions on Software Engineering and Methodology (TOSEM)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Failure patterns describe typical ways in which inputs revealing program failure are distributed across the input domain—in many cases, clustered together in contiguous regions. Based on these observations several debug testing methods have been developed. We examine the upper bound of debug testing effectiveness improvements possible through making assumptions about the shape, size and orientation of failure patterns. We consider the bounds for testing strategies with respect to minimizing the F-measure, maximizing the P-measure, and maximizing the E-measure. Surprisingly, we find that the empirically measured effectiveness of some existing methods that are not based on these assumptions is close to the theoretical upper bound of these strategies. The assumptions made to obtain the upper bound, and its further implications, are also examined.