Effectiveness for detecting faults within and outside the scope of testing techniques: an independent replication

Authors:
Cecilia Apa;Oscar Dieste;Edison G. Espinosa G.;Efraín R. Fonseca C.
Affiliations:
Universidad de la República, Montevideo, Uruguay 565;Universidad Politécnica de Madrid, Madrid, Spain 28660;Escuela Politécnica del Ejército Sede Latacunga, Latacunga, Ecuador;Escuela Politécnica del Ejército, Sangolquí, Ecuador
Venue:
Empirical Software Engineering
Year:
2014

Citing 8
Cited 0

Comparing the Effectiveness of Software Testing Strategies

IEEE Transactions on Software Engineering
Software modeling and measurement: the Goal/Question/Metric paradigm

Software modeling and measurement: the Goal/Question/Metric paradigm
Comparing and combining software defect detection techniques: a replicated empirical study

ESEC '97/FSE-5 Proceedings of the 6th European SOFTWARE ENGINEERING conference held jointly with the 5th ACM SIGSOFT international symposium on Foundations of software engineering
A controlled experiment in program testing and code walkthroughs/inspections

Communications of the ACM
An Empirical Evaluation of Three Defect-Detection Techniques

Proceedings of the 5th European Software Engineering Conference
The Case Against Cross-Over Designs in Software Engineering

STEP '03 Proceedings of the Eleventh Annual International Workshop on Software Technology and Engineering Practice
A Survey of Controlled Experiments in Software Engineering

IEEE Transactions on Software Engineering
Replications types in experimental disciplines

Proceedings of the 2010 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement

Quantified Score

Hi-index	0.00

Visualization

Abstract

The verification and validation activity plays a fundamental role in improving software quality. Determining which the most effective techniques for carrying out this activity are has been an aspiration of experimental software engineering researchers for years. This paper reports a controlled experiment evaluating the effectiveness of two unit testing techniques (the functional testing technique known as equivalence partitioning (EP) and the control-flow structural testing technique known as branch testing (BT)). This experiment is a literal replication of Juristo et al. (2013). Both experiments serve the purpose of determining whether the effectiveness of BT and EP varies depending on whether or not the faults are visible for the technique (InScope or OutScope, respectively). We have used the materials, design and procedures of the original experiment, but in order to adapt the experiment to the context we have: (1) reduced the number of studied techniques from 3 to 2; (2) assigned subjects to experimental groups by means of stratified randomization to balance the influence of programming experience; (3) localized the experimental materials and (4) adapted the training duration. We ran the replication at the Escuela Politécnica del Ejército Sede Latacunga (ESPEL) as part of a software verification & validation course. The experimental subjects were 23 master's degree students. EP is more effective than BT at detecting InScope faults. The session/program and group variables are found to have significant effects. BT is more effective than EP at detecting OutScope faults. The session/program and group variables have no effect in this case. The results of the replication and the original experiment are similar with respect to testing techniques. There are some inconsistencies with respect to the group factor. They can be explained by small sample effects. The results for the session/program factor are inconsistent for InScope faults. We believe that these differences are due to a combination of the fatigue effect and a technique x program interaction. Although we were able to reproduce the main effects, the changes to the design of the original experiment make it impossible to identify the causes of the discrepancies for sure. We believe that further replications closely resembling the original experiment should be conducted to improve our understanding of the phenomena under study.