What Should You Optimize When Building an Estimation Model?

Authors:
Chris Lokan
Affiliations:
Australian Defence Force Academy
Venue:
METRICS '05 Proceedings of the 11th IEEE International Software Metrics Symposium
Year:
2005

Citing 0
Cited 5

A constrained regression technique for cocomo calibration

Proceedings of the Second ACM-IEEE international symposium on Empirical software engineering and measurement
An empirical evaluation of outlier deletion methods for analogy-based cost estimation

Proceedings of the 7th International Conference on Predictive Models in Software Engineering
Search-based software engineering: Trends, techniques and applications

ACM Computing Surveys (CSUR)
How to treat timing information for software effort estimation?

Proceedings of the 2013 International Conference on Software and System Process
Revisiting software development effort estimation based on early phase development activities

Proceedings of the 10th Working Conference on Mining Software Repositories

Quantified Score

Hi-index	0.00

Visualization

Abstract

When estimation models are derived from existing data, they are commonly evaluated using statistics such as mean magnitude of relative error. But when the models are derived in the first place, it is usually by optimizing something else 驴 typically, as in statistical regression, by minimizing the sum of squared deviations. How do estimation models for typical software engineering data fare, on various common accuracy statistics, if they are derived using other "fitness functions"? In this study, estimation models are built using a variety of fitness functions, and evaluated using a wide range of accuracy statistics. We find that models based on minimizing actual errors generally out-perform models based on minimizing relative errors. Given the nature of software engineering data sets, minimizing the sum of absolute deviations seems an effective compromise.