A comprehensive empirical evaluation of missing value imputation in noisy software measurement data
Journal of Systems and Software
SPDW+: a seamless approach for capturing quality metrics in software development environments
Software Quality Control
Software mining and fault prediction
Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
Information Sciences: an International Journal
Hi-index | 0.01 |
Data is extremely important in empirical software engineering. Techniques that provide insight into potential anomalies or inaccuracies in a dataset are becoming an increasingly important way for a data analyst to cope with flawed data. We present a novel hybrid procedure for quantitative outcome correction along with controlled experiments using a real-world software measurement dataset to demonstrate the usefulness of our technique. Instances that are deemed to be noisy relative to the dependent variable, which represents the number of faults recorded in the program module, are cleansed by replacing the original value with a more appropriate alternative value.