ACSC '09 Proceedings of the Thirty-Second Australasian Conference on Computer Science - Volume 91
On the dataset shift problem in software engineering prediction models
Empirical Software Engineering
EASE'09 Proceedings of the 13th international conference on Evaluation and Assessment in Software Engineering
EASE'08 Proceedings of the 12th international conference on Evaluation and Assessment in Software Engineering
Hi-index | 0.00 |
Several studies have been conducted to determine if company-specific cost models deliver better prediction accuracy than cross-company cost models. However, mixed results have left the question still open for further investigation. We suspect this to be a consequence of heterogenous data used to build cross-company cost models. In this paper, we build cross-company cost models using homogenous data by grouping projects by their business sector. Our results suggest that it is worth to train models using only homogenous data rather than all projects available.