Software reliability: measurement, prediction, application
Software reliability: measurement, prediction, application
Modern mathematical statistics
Modern mathematical statistics
A Two-Person Inspection Method to Improve Programming Productivity
IEEE Transactions on Software Engineering
Experience with Fagan's inspection method
Software—Practice & Experience
Estimating software fault content before coding
ICSE '92 Proceedings of the 14th international conference on Software engineering
Assessing Software Designs Using Capture-Recapture Methods
IEEE Transactions on Software Engineering - Special issue on software reliability
Software Inspection
Software defect-removal efficiency
Computer
Software Inspections: An Effective Verification Process
IEEE Software
Lessons from Three Years of Inspection Data
IEEE Software
An experiment to assess cost-benefits of inspection meetings and their alternatives: a pilot study
METRICS '96 Proceedings of the 3rd International Symposium on Software Metrics: From Measurement to Empirical Results
An Analysis of Competing Software Reliability Models
IEEE Transactions on Software Engineering
Design and code inspections to reduce errors in program development
IBM Systems Journal
Defect content estimations from review data
Proceedings of the 20th international conference on Software engineering
Evaluating Capture-Recapture Models with Two Inspectors
IEEE Transactions on Software Engineering
Empirical Software Engineering
An Empirical Method for Selecting Software Reliability Growth Models
Empirical Software Engineering
ASSISTing Management Decisions in the Software Inspection Process
Information Technology and Management
On the independence of software inspectors
Journal of Systems and Software
An Empirical Study of Experience-Based Software Defect Content Estimation Methods
ISSRE '99 Proceedings of the 10th International Symposium on Software Reliability Engineering
Adding Diversity to Software Inspections
ICCI '03 Proceedings of the 2nd IEEE International Conference on Cognitive Informatics
An experiment to investigate interacting versus nominal groups in software inspection
CASCON '03 Proceedings of the 2003 conference of the Centre for Advanced Studies on Collaborative research
Managing software quality through a hybrid defect content and effectiveness model
Proceedings of the Second ACM-IEEE international symposium on Empirical software engineering and measurement
Empirical Software Engineering
Hi-index | 0.00 |
An important requirement to control the inspection of software artifacts is to be able to decide, based on objective information, whether inspection can stop or whether it should continue to achieve a suitable level of artifact quality. Several studies in software engineering have considered the use of capture-recapture models to predict the number of remaining defects in an inspected document as a decision criterion about reinspection. However, no study on software engineering artifacts compares the actual number of remaining defects to the one predicted by a capture-recapture model. Simulations have been performed but no definite conclusions can be drawn regarding the degree of accuracy of such models under realistic inspection conditions, and the factors affecting this accuracy. Furthermore, none of these studies performed an exhaustive comparison of existing models. In this study, we focus on traditional inspections and estimate, based on actual inspections' data, the degree of accuracy of all relevant, state-of-the-art, capture-recapture models for which statistical estimators exist. We compare the various models' accuracies and look at the impact of the number of inspectors on these accuracies. Results show that models' accuracies are strongly affected by the number of inspectors and, therefore, one must consider this factor before using capture-recapture models. When the number of inspectors is below 4, no model is sufficiently accurate and underestimation may be substantial. In addition, some models perform better than others in a large number of conditions and plausible reasons are discussed. Based on our analyses, we recommend using a model taking into account that defects have different probabilities of being detected and a corresponding Jackknife estimator.