Statistical significance testing: a panacea for software technology experiments?

Authors:
James Miller
Affiliations:
Software Technology, Engineering and Measurement research center (STEAM), Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Alberta, Canada T6H 5M3
Venue:
Journal of Systems and Software - Special issue: Applications of statistics in software engineering
Year:
2004

Citing 9
Cited 9

Statistical reasoning

Statistical reasoning
Shotgun correlations in software measures

Software Engineering Journal
Building Knowledge through Families of Experiments

IEEE Transactions on Software Engineering
Multi-method research: an empirical investigation of object-oriented technology

Journal of Systems and Software
Applying meta-analytical procedures to software engineering experiments

Journal of Systems and Software
Studying programmer behavior experimentally: the problems of proper methodology

Communications of the ACM
Modeling Development Effort in Object-Oriented Systems Using Design Properties

IEEE Transactions on Software Engineering - Special section on the seventh international software metrics symposium
Investigating Reading Techniques for Object-Oriented Framework Learning

IEEE Transactions on Software Engineering
Practical assessment of the models for identification of defect-prone classes in object-oriented commercial systems using design metrics

Journal of Systems and Software

A Cognitive-Based Mechanism for Constructing Software Inspection Teams

IEEE Transactions on Software Engineering
Empirical evaluation of optimization algorithms when used in goal-oriented automated test data generation techniques

Empirical Software Engineering
A Survey and Analysis of the P3P Protocol's Agents, Adoption, Maintenance, and Future

IEEE Transactions on Dependable and Secure Computing
Software engineering article types: An analysis of the literature

Journal of Systems and Software
Estimating software readiness using predictive models

Information Sciences: an International Journal
Comparison of different documentation styles for frameworks of object-oriented code

Behaviour & Information Technology
Replicating software engineering experiments: a poisoned chalice or the Holy Grail

Information and Software Technology
Current state and potential of variability management practices in software-intensive SMEs: Results from a regional industrial survey

Information and Software Technology
Adopting model driven software development in industry: a case study at two companies

MoDELS'06 Proceedings of the 9th international conference on Model Driven Engineering Languages and Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

Empirical software engineering has a long history of utilizing statistical significance testing, and in many ways, it has become the backbone of the topic. What is less obvious is how much consideration has been given to its adoption. Statistical significance testing was initially designed for testing hypotheses in a very different area, and hence the question must be asked: does it transfer into empirical software engineering research? This paper attempts to address this question. The paper finds that this transference is far from straightforward, resulting in several problems in its deployment within the area. Principally problems exist in: formulating hypotheses, the calculation of the probability values and its associated cut-off value, and the construction of the sample and its distribation. Hence, the paper concludes that the topic should explore other avenues of analysis, in an attempt to establish which analysis approaches are preferable under which conditions, when conducting empirical software engineering studies.