A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability

  • Authors:
  • S. García;A. Fernández;J. Luengo;F. Herrera

  • Affiliations:
  • University of Jaén, Department of Computer Science, 23071, Jaén, Spain;University of Granada, Department of Computer Science and Artificial Intelligence, 18071, Granada, Spain;University of Granada, Department of Computer Science and Artificial Intelligence, 18071, Granada, Spain;University of Granada, Department of Computer Science and Artificial Intelligence, 18071, Granada, Spain

  • Venue:
  • Soft Computing - A Fusion of Foundations, Methodologies and Applications
  • Year:
  • 2009

Quantified Score

Hi-index 0.01

Visualization

Abstract

The experimental analysis on the performance of a proposed method is a crucial and necessary task to carry out in a research. This paper is focused on the statistical analysis of the results in the field of genetics-based machine Learning. It presents a study involving a set of techniques which can be used for doing a rigorous comparison among algorithms, in terms of obtaining successful classification models. Two accuracy measures for multi-class problems have been employed: classification rate and Cohen’s kappa. Furthermore, two interpretability measures have been employed: size of the rule set and number of antecedents. We have studied whether the samples of results obtained by genetics-based classifiers, using the performance measures cited above, check the necessary conditions for being analysed by means of parametrical tests. The results obtained state that the fulfillment of these conditions are problem-dependent and indefinite, which supports the use of non-parametric statistics in the experimental analysis. In addition, non-parametric tests can be satisfactorily employed for comparing generic classifiers over various data-sets considering any performance measure. According to these facts, we propose the use of the most powerful non-parametric statistical tests to carry out multiple comparisons. However, the statistical analysis conducted on interpretability must be carefully considered.