A new perspective on data homogeneity in software cost estimation: a study in the embedded systems domain

  • Authors:
  • Ayşe Bakır;Burak Turhan;Ayşe B. Bener

  • Affiliations:
  • Department of Computer Engineering, Boğaziçi University, Bebek, Istanbul, Turkey 34342;Software Engineering Group, Institute for Information Technology, National Research Council of Canada, Ottawa, Canada K1A0R6;Department of Computer Engineering, Boğaziçi University, Bebek, Istanbul, Turkey 34342

  • Venue:
  • Software Quality Control
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Cost estimation and effort allocation are the key challenges for successful project planning and management in software development. Therefore, both industry and the research community have been working on various models and techniques to accurately predict the cost of projects. Recently, researchers have started debating whether the prediction performance depends on the structure of data rather than the models used. In this article, we focus on a new aspect of data homogeneity, "cross- versus within-application domain", and investigate what kind of training data should be used for software cost estimation in the embedded systems domain. In addition, we try to find out the effect of training dataset size on the prediction performance. Based on our empirical results, we conclude that it is better to use cross-domain data for embedded software cost estimation and the optimum training data size depends on the method used.