A practical guide for using statistical tests to assess randomized algorithms in software engineering

Authors:
Andrea Arcuri;Lionel Briand
Affiliations:
Simula Research Laboratory, Lysaker, Norway;Simula Research Laboratory & University of Oslo, Lysaker, Norway
Venue:
Proceedings of the 33rd International Conference on Software Engineering
Year:
2011

Citing 33
Cited 40

Randomized algorithms

Randomized algorithms
Optimizing for reduced code space using genetic algorithms

Proceedings of the ACM SIGPLAN 1999 workshop on Languages, compilers, and tools for embedded systems
DART: directed automated random testing

Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation
An approach for QoS-aware service composition based on genetic algorithms

GECCO '05 Proceedings of the 7th annual conference on Genetic and evolutionary computation
Search-based software test data generation: a survey: Research Articles

Software Testing, Verification & Reliability
On the Automatic Modularization of Software Systems Using the Bunch Tool

IEEE Transactions on Software Engineering
Systematic review: A systematic review of effect size in software engineering experiments

Information and Software Technology
Systematic literature reviews in software engineering - A systematic literature review

Information and Software Technology
Mutation Operators for Spreadsheets

IEEE Transactions on Software Engineering
Optimized Resource Allocation for Software Release Planning

IEEE Transactions on Software Engineering
Automatic creation of SQL Injection and cross-site scripting attacks

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
Reasoning about edits to feature models

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
Automatically finding patches using genetic programming

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
MINTS: A general framework and tool for supporting test-suite minimization

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
Taint-based directed whitebox fuzzing

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
How to avoid drastic software process change (using stochastic stability)

ICSE '09 Proceedings of the 31st International Conference on Software Engineering
Search-Based Testing of Ajax Web Applications

SSBSE '09 Proceedings of the 2009 1st International Symposium on Search Based Software Engineering
An Improved Meta-heuristic Search for Constrained Interaction Testing

SSBSE '09 Proceedings of the 2009 1st International Symposium on Search Based Software Engineering
Evolution and Search Based Metrics to Improve Defects Prediction

SSBSE '09 Proceedings of the 2009 1st International Symposium on Search Based Software Engineering
A Study of the Multi-objective Next Release Problem

SSBSE '09 Proceedings of the 2009 1st International Symposium on Search Based Software Engineering
Dynamic Architectural Selection: A Genetic Algorithm Based Approach

SSBSE '09 Proceedings of the 2009 1st International Symposium on Search Based Software Engineering
On the Use of Discretized Source Code Metrics for Author Identification

SSBSE '09 Proceedings of the 2009 1st International Symposium on Search Based Software Engineering
WCET Analysis of Modern Processors Using Multi-Criteria Optimisation

SSBSE '09 Proceedings of the 2009 1st International Symposium on Search Based Software Engineering
Full Theoretical Runtime Analysis of Alternating Variable Method on the Triangle Classification Problem

SSBSE '09 Proceedings of the 2009 1st International Symposium on Search Based Software Engineering
Widening the Goal Posts: Program Stretching to Aid Search Based Software Testing

SSBSE '09 Proceedings of the 2009 1st International Symposium on Search Based Software Engineering
Scalable and Effective Test Generation for Role-Based Access Control Systems

IEEE Transactions on Software Engineering
A Theoretical and Empirical Study of Search-Based Testing: Local, Global, and Hybrid Search

IEEE Transactions on Software Engineering
Formal analysis of the effectiveness and predictability of random testing

Proceedings of the 19th international symposium on Software testing and analysis
A Systematic Review of the Application and Empirical Investigation of Search-Based Test Case Generation

IEEE Transactions on Software Engineering
Efficient Software Verification: Statistical Testing Using Automated Search

IEEE Transactions on Software Engineering
Black-box system testing of real-time embedded systems using random and search-based testing

ICTSS'10 Proceedings of the 22nd IFIP WG 6.1 international conference on Testing software and systems
An analysis of the behavior of simplified evolutionary algorithms on trap functions

IEEE Transactions on Evolutionary Computation
A multiobjective module-order model for software quality enhancement

IEEE Transactions on Evolutionary Computation

Testing container classes: random or systematic?

FASE'11/ETAPS'11 Proceedings of the 14th international conference on Fundamental approaches to software engineering: part of the joint European conferences on theory and practice of software
Change-aware preemption prioritization

Proceedings of the 2011 International Symposium on Software Testing and Analysis
Adaptive random testing: an illusion of effectiveness?

Proceedings of the 2011 International Symposium on Software Testing and Analysis
On parameter tuning in search based software engineering

SSBSE'11 Proceedings of the Third international conference on Search based software engineering
Bytecode testability transformation

SSBSE'11 Proceedings of the Third international conference on Search based software engineering
Run your research: on the effectiveness of lightweight mechanization

POPL '12 Proceedings of the 39th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Replication of an experiment on linguistic tool support for consolidation of requirements from multiple sources

Empirical Software Engineering
Search-based approaches for software development effort estimation

Proceedings of the 12th International Conference on Product Focused Software Development and Process Improvement
Search based software engineering: techniques, taxonomy, tutorial

Empirical Software Engineering and Verification
An analysis of the effects of composite objectives in multiobjective software module clustering

Proceedings of the 14th annual conference on Genetic and evolutionary computation
Search-based system testing: high coverage, no false alarms

Proceedings of the 2012 International Symposium on Software Testing and Analysis
Empirical investigation of search algorithms for environment model-based testing of real-time embedded software

Proceedings of the 2012 International Symposium on Software Testing and Analysis
Automatically finding performance problems with feedback-directed learning software testing

Proceedings of the 34th International Conference on Software Engineering
Bug prediction based on fine-grained module histories

Proceedings of the 34th International Conference on Software Engineering
Extensions during software evolution: do objects meet their promise?

ECOOP'12 Proceedings of the 26th European conference on Object-Oriented Programming
A systematic review of software robustness

Information and Software Technology
CarFast: achieving higher statement coverage faster

Proceedings of the ACM SIGSOFT 20th International Symposium on the Foundations of Software Engineering
Enhancing software reliability estimates using modified adaptive testing

Information and Software Technology
Evaluating the importance of randomness in search-based software engineering

SSBSE'12 Proceedings of the 4th international conference on Search Based Software Engineering
Combining search-based and adaptive random testing strategies for environment model-based testing of real-time embedded systems

SSBSE'12 Proceedings of the 4th international conference on Search Based Software Engineering
Achieving scalable model-based testing through test case diversity

ACM Transactions on Software Engineering and Methodology (TOSEM)
Minimizing test suites in software product lines using weight-based genetic algorithms

Proceedings of the 15th annual conference on Genetic and evolutionary computation
Obsolete software requirements

Information and Software Technology
Toward automated refactoring of crosscutting concerns into aspects

Journal of Systems and Software
Multi-objective test case prioritization for GUI applications

Proceedings of the 28th Annual ACM Symposium on Applied Computing
The search for the laws of automatic random testing

Proceedings of the 28th Annual ACM Symposium on Applied Computing
Comparing non-adequate test suites using coverage criteria

Proceedings of the 2013 International Symposium on Software Testing and Analysis
Selective mutation testing for concurrent code

Proceedings of the 2013 International Symposium on Software Testing and Analysis
Using automated program repair for evaluating the effectiveness of fault localization techniques

Proceedings of the 2013 International Symposium on Software Testing and Analysis
What good are strong specifications?

Proceedings of the 2013 International Conference on Software Engineering
Not going to take this anymore: multi-objective overtime planning for software engineering projects

Proceedings of the 2013 International Conference on Software Engineering
Automatic patch generation learned from human-written patches

Proceedings of the 2013 International Conference on Software Engineering
Preventing database deadlocks in applications

Proceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering
Searching for better configurations: a rigorous approach to clone evaluation

Proceedings of the 2013 9th Joint Meeting on Foundations of Software Engineering
Multi-objective test generation for software product lines

Proceedings of the 17th International Software Product Line Conference
The impact of parameter tuning on software effort estimation using learning machines

Proceedings of the 9th International Conference on Predictive Models in Software Engineering
Test case selection for black-box regression testing of database applications

Information and Software Technology
Visiting convex regions in a polygonal map

Robotics and Autonomous Systems
GUI testing assisted by human knowledge: Random vs. functional

Journal of Systems and Software
Static test case prioritization using topic models

Empirical Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

Randomized algorithms have been used to successfully address many different types of software engineering problems. This type of algorithms employ a degree of randomness as part of their logic. Randomized algorithms are useful for difficult problems where a precise solution cannot be derived in a deterministic way within reasonable time. However, randomized algorithms produce different results on every run when applied to the same problem instance. It is hence important to assess the effectiveness of randomized algorithms by collecting data from a large enough number of runs. The use of rigorous statistical tests is then essential to provide support to the conclusions derived by analyzing such data. In this paper, we provide a systematic review of the use of randomized algorithms in selected software engineering venues in 2009. Its goal is not to perform a complete survey but to get a representative snapshot of current practice in software engineering research. We show that randomized algorithms are used in a significant percentage of papers but that, in most cases, randomness is not properly accounted for. This casts doubts on the validity of most empirical results assessing randomized algorithms. There are numerous statistical tests, based on different assumptions, and it is not always clear when and how to use these tests. We hence provide practical guidelines to support empirical research on randomized algorithms in software engineering