Data accumulation and software effort prediction

Authors:
Stephen G. MacDonell;Martin Shepperd
Affiliations:
AUT University, Auckland, New Zealand;Brunel University, Uxbridge, United Kingdom
Venue:
Proceedings of the 2010 ACM-IEEE International Symposium on Empirical Software Engineering and Measurement
Year:
2010

Citing 9
Cited 1

Re-Planning for a Successful Project Schedule

METRICS '99 Proceedings of the 6th International Symposium on Software Metrics
An Empirical Study of Effort Estimation during Project Execution

METRICS '99 Proceedings of the 6th International Symposium on Software Metrics
Using Prior-Phase Effort Records for Re-estimation During Software Projects

METRICS '03 Proceedings of the 9th International Symposium on Software Metrics
Group Processes in Software Effort Estimation

Empirical Software Engineering
Controlling Software Projects: Management, Measurement, and Estimates

Controlling Software Projects: Management, Measurement, and Estimates
Effort Prediction in Iterative Software Development Processes -- Incremental Versus Global Prediction Models

ESEM '07 Proceedings of the First International Symposium on Empirical Software Engineering and Measurement
Applying moving windows to software effort estimation

ESEM '09 Proceedings of the 2009 3rd International Symposium on Empirical Software Engineering and Measurement
Distributed global development parametric cost modeling

ICSP'07 Proceedings of the 2007 international conference on Software process
Investigating the use of chronological splitting to compare software cross-company and single-company effort predictions: a replicated study

EASE'09 Proceedings of the 13th international conference on Evaluation and Assessment in Software Engineering

How to treat timing information for software effort estimation?

Proceedings of the 2013 International Conference on Software and System Process

Quantified Score

Hi-index	0.00

Visualization

Abstract

BACKGROUND: In reality project managers are constrained by the incremental nature of data collection. Specifically, project observations are accumulated one project at a time. Likewise within-project data are accumulated one stage or phase at a time. However, empirical researchers have given limited attention to this perspective. PROBLEM: Consequently, our analyses may be biased. On the one hand, our predictions may be optimistic due to the availability of the entire data set, but on the other hand pessimistic due to the failure to capitalize upon the temporal nature of the data. Our goals are (i) to explore the impact of ignoring time when building cost prediction models and (ii) to show the benefits of re-estimating using completed phase data during a project. METHOD: Using a small industrial data set of sixteen software projects from a single organization we compare predictive models developed using a time-aware approach with a more traditional leave-one-out analysis. We then investigate the impact of using requirements, design and implementation phase data on estimating subsequent phase effort. RESULTS: First, we find that failure to take the temporal nature of data into account leads to unreliable estimates of their predictive efficacy. Second, for this organization, prior-phase effort data could be used to improve the management of subsequent process tasks. CONCLUSION: We should collect time-related data and use it in our analyses. Failure to do so may lead to incorrect conclusions being drawn, and may also inhibit industrial take up of our research work.