A cost model to estimate the effort of data mining projects (DMCoMo)

  • Authors:
  • Oscar Marbán;Ernestina Menasalvas;Covadonga Fernández-Baizán

  • Affiliations:
  • Facultad de Informática, Universidad Politécnica de Madrid (U.P.M.), Campus de Montegancedo s/n., 28660 Boadilla del Monte, Madrid, Spain;Facultad de Informática, Universidad Politécnica de Madrid (U.P.M.), Campus de Montegancedo s/n., 28660 Boadilla del Monte, Madrid, Spain;Facultad de Informática, Universidad Politécnica de Madrid (U.P.M.), Campus de Montegancedo s/n., 28660 Boadilla del Monte, Madrid, Spain

  • Venue:
  • Information Systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

CRISP-DM is the standard to develop Data Miningprojects. CRISP-DM proposes processes and tasks that you have to carry out to develop a Data Miningproject. A task proposed by CRISP-DM is the cost estimation of the Data Miningproject. In software development a lot of methods are described to estimate the costs of project development (SLIM, SEER-SEM, PRICE-S and COCOMO). These methods are not appropriate in the case of Data Miningprojects because in Data Miningsoftware development is not the first goal. Some methods have been proposed to estimate some phases of a Data Miningproject, but there is no method to estimate the global cost of a generic Data Miningproject. The lack of Data Miningproject estimation methods is because of many real-life project failures due to the non-realistic estimation at the beginning of the projects. Consequently, in this paper we propose to design and validate a parametric cost estimation model, similar to COCOMO or SLIM in software development, for Data Miningprojects (DMCoMo). The drivers of the model will be proposed first and later the equation of the model will be proposed.