Analyzing software effort estimation using k means clustered regression approach

Authors:
Geeta Nagpal;Moin Uddin;Arvinder Kaur
Affiliations:
National Institute of Technology, Jalandhar;Delhi Technological University, Delhi;University School of Information Technology, Indraprastha University, Delhi
Venue:
ACM SIGSOFT Software Engineering Notes
Year:
2013

Citing 11
Cited 1

Introduction to Grey system theory

The Journal of Grey System
Software Engineering Economics

Software Engineering Economics
On-Demand Forecasting of Stock Prices Using a Real-Time Predictor

IEEE Transactions on Knowledge and Data Engineering
Using Grey Relational Analysis to Predict Software Effort with Small Data Sets

METRICS '05 Proceedings of the 11th IEEE International Software Metrics Symposium
A flexible method for software effort estimation by analogy

Empirical Software Engineering
A General Empirical Solution to the Macro Software Sizing and Estimating Problem

IEEE Transactions on Software Engineering
Software Function, Source Lines of Code, and Development Effort Prediction: A Software Science Validation

IEEE Transactions on Software Engineering
Fuzzy grey relational analysis for software effort estimation

Empirical Software Engineering
Target tracking using a hierarchical grey-fuzzy motion decision-making method

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Alternating cluster estimation: a new tool for clustering and function approximation

IEEE Transactions on Fuzzy Systems
The gray prediction search algorithm for block motion estimation

IEEE Transactions on Circuits and Systems for Video Technology

LMES: A localized multi-estimator model to estimate software development effort

Engineering Applications of Artificial Intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

Software estimation is an area where more assurances have been broken than in any other area of software development. Numerous studies attempting new and reliable software effort estimation techniques have been proposed but no consensus as to which techniques are the most appropriate has been reached so far. Due to the intangible nature of "software", effort estimation with a high level of accuracy remains a dream for developers. It is unlikely to expect very accurate estimates of development effort because of the inherent uncertainty in software projects and the complex and dynamic interaction of factors that impact software development. Heterogeneity exists in software engineering datasets because data is obtained from diverse sources. This can be reduced by defining certain relationships between the data values by classifying them into different clusters. This study focuses on how the combination of clustering and regression techniques can reduce the potential problem in effectiveness of predictive efficiency due to heterogeneity of the data. Using a clustered approach creates subsets of data having a degree of homogeneity that enhances prediction accuracy. It was also observed in this study that ridge regression performs better than other regression techniques. Another key finding is that by selecting a subset of highly predictive attributes using Grey relational analysis a significant improvement in prediction can be achieved.