Wrappers for feature subset selection
Artificial Intelligence - Special issue on relevance
Data mining: practical machine learning tools and techniques with Java implementations
Data mining: practical machine learning tools and techniques with Java implementations
SEW '02 Proceedings of the 27th Annual NASA Goddard Software Engineering Workshop (SEW-27'02)
Data Mining: Concepts and Techniques
Data Mining: Concepts and Techniques
Comparing Fault-Proneness Estimation Models
ICECCS '05 Proceedings of the 10th IEEE International Conference on Engineering of Complex Computer Systems
Mining metrics to predict component failures
Proceedings of the 28th international conference on Software engineering
Data Mining Static Code Attributes to Learn Defect Predictors
IEEE Transactions on Software Engineering
Empirical Analysis of Object-Oriented Design Metrics for Predicting High and Low Severity Faults
IEEE Transactions on Software Engineering
Mining software repositories for comprehensible software fault prediction models
Journal of Systems and Software
Predicting defect-prone software modules using support vector machines
Journal of Systems and Software
Proceedings of the 30th international conference on Software engineering
Journal of Software Maintenance and Evolution: Research and Practice
Journal of Systems and Software
Software quality analysis by combining multiple projects and learners
Software Quality Control
Review: A systematic review of software fault prediction studies
Expert Systems with Applications: An International Journal
Testing techniques selection based on ODC fault types and software metrics
Journal of Systems and Software
Hi-index | 0.00 |
It would be valuable to use metrics to identify the fault-proneness of software modules. However, few research works are on how to select appropriate metrics for fault-proneness prediction currently. We conduct a large-scale comparative experiment of nine different software metrics reduction methods over eleven public-domain data sets from the NASA metrics data repository. The Naive Bayes data miner, with a log-filtering preprocessor on the numeric data, is utilized to construct the prediction model. Comparisons are based on the analysis of variance. Our conclusion is that, reduction methods of software metrics are important to build adaptable and robust software fault-proneness prediction models. Given our results on Naive Bayes and log-filtering, discrete wavelet transformation outperforms other reduction methods, and correlationbased feature selection with genetic search algorithm and information gain can also obtain better predicted performance.