A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability

Authors:
S. García;A. Fernández;J. Luengo;F. Herrera
Affiliations:
University of Jaén, Department of Computer Science, 23071, Jaén, Spain;University of Granada, Department of Computer Science and Artificial Intelligence, 18071, Granada, Spain;University of Granada, Department of Computer Science and Artificial Intelligence, 18071, Granada, Spain;University of Granada, Department of Computer Science and Artificial Intelligence, 18071, Granada, Spain
Venue:
Soft Computing - A Fusion of Foundations, Methodologies and Applications
Year:
2009

Citing 0
Cited 69

On the influence of an adaptive inference system in fuzzy rule based classification systems for imbalanced data-sets

Expert Systems with Applications: An International Journal
Improving the Performance of Fuzzy Rule Based Classification Systems for Highly Imbalanced Data-Sets Using an Evolutionary Adaptive Inference System

IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part I: Bio-Inspired Systems: Computational and Ambient Intelligence
A First Study on the Use of Interval-Valued Fuzzy Sets with Genetic Tuning for Classification with Imbalanced Data-Sets

HAIS '09 Proceedings of the 4th International Conference on Hybrid Artificial Intelligence Systems
On the 2-tuples based genetic tuning performance for fuzzy rule based classification systems in imbalanced data-sets

Information Sciences: an International Journal
A multiobjective evolutionary approach to concurrently learn rule and data bases of linguistic fuzzy-rule-based systems

IEEE Transactions on Fuzzy Systems
Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power

Information Sciences: an International Journal
Crisp classifiers vs. fuzzy classifiers: a statistical study

ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
Integration of an index to preserve the semantic interpretability in the multiobjective evolutionary rule selection and tuning of linguistic fuzzy systems

IEEE Transactions on Fuzzy Systems - Special section on computing with words
Fuzzy rule classifier: Capability for generalization in wood color recognition

Engineering Applications of Artificial Intelligence
A fuzzy random forest

International Journal of Approximate Reasoning
Improving the performance of fuzzy rule-based classification systems with interval-valued fuzzy sets and genetic amplitude tuning

Information Sciences: an International Journal
Efficient Distributed Genetic Algorithm for Rule extraction

Applied Soft Computing
Analysis of an evolutionary RBFN design algorithm, CO2RBFN, for imbalanced data sets

Pattern Recognition Letters
Solving multi-class problems with linguistic fuzzy rule based classification systems based on pairwise learning and preference relations

Fuzzy Sets and Systems
NMEEF-SD: non-dominated multiobjective evolutionary algorithm for extracting fuzzy rules in subgroup discovery

IEEE Transactions on Fuzzy Systems
Nonparametric statistical analysis of machine learning algorithms for regression problems

KES'10 Proceedings of the 14th international conference on Knowledge-based and intelligent information and engineering systems: Part I
Differential evolution for optimizing the positioning of prototypes in nearest neighbor classification

Pattern Recognition
Genetics-based machine learning for rule induction: state of the art, taxonomy, and comparative study

IEEE Transactions on Evolutionary Computation
Two algorithmic enhancements for the parallel differential evolution

International Journal of Innovative Computing and Applications
IPADE: iterative prototype adjustment for nearest neighbor classification

IEEE Transactions on Neural Networks
Knowledge acquisition in fuzzy-rule-based systems with particle-swarm optimization

IEEE Transactions on Fuzzy Systems
Evolutionary selection of hyperrectangles in nested generalized exemplar learning

Applied Soft Computing
A preliminary study on the selection of generalized instances for imbalanced classification

IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part I
An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs-one and one-vs-all schemes

Pattern Recognition
Intelligent data analysis and model interpretation with spectral analysis fuzzy symbolic modeling

International Journal of Approximate Reasoning
A genetic tuning to improve the performance of Fuzzy Rule-Based Classification Systems with Interval-Valued Fuzzy Sets: Degree of ignorance and lateral position

International Journal of Approximate Reasoning
A new feature selection algorithm based on binomial hypothesis testing for spam filtering

Knowledge-Based Systems
Sensitiveness of evolutionary algorithms to the random number generator

ICANNGA'11 Proceedings of the 10th international conference on Adaptive and natural computing algorithms - Volume Part I
Empirical comparison of bagging ensembles created using weak learners for a regression problem

ACIIDS'11 Proceedings of the Third international conference on Intelligent information and database systems - Volume Part II
A study on the medium-term forecasting using exogenous variable selection of the extra-virgin olive oil with soft computing methods

Applied Intelligence
Evolutionary-based selection of generalized instances for imbalanced classification

Knowledge-Based Systems
Facing dynamic optimization using a cooperative metaheuristic configured via fuzzy logic and SVMs

Applied Soft Computing
A simulated annealing method based on a specialised evolutionary algorithm

Applied Soft Computing
Ockham's Razor in memetic computing: Three stage optimal memetic exploration

Information Sciences: an International Journal
Fuzzy scheduling with swarm intelligence-based knowledge acquisition for grid computing

Engineering Applications of Artificial Intelligence
Analysis of preprocessing vs. cost-sensitive learning for imbalanced classification. Open problems on intrinsic data characteristics

Expert Systems with Applications: An International Journal
GPU-Based evaluation to accelerate particle swarm algorithm

EUROCAST'11 Proceedings of the 13th international conference on Computer Aided Systems Theory - Volume Part I
A new feature selection based on comprehensive measurement both in inter-category and intra-category for text categorization

Information Processing and Management: an International Journal
Real-World problem for checking the sensitiveness of evolutionary algorithms to the choice of the random number generator

HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part I
Compact bacterial foraging optimization

SIDE'12 Proceedings of the 2012 international conference on Swarm and Evolutionary Computation
Contiguous binomial crossover in differential evolution

SIDE'12 Proceedings of the 2012 international conference on Swarm and Evolutionary Computation
On employing fuzzy modeling algorithms for the valuation of residential premises

Information Sciences: an International Journal
A comparative study of efficient initialization methods for the k-means clustering algorithm

Expert Systems with Applications: An International Journal
A hybrid fuzzy rule-based multi-criteria framework for sustainable project portfolio selection

Information Sciences: an International Journal
A genetic design of linguistic terms for fuzzy rule based classifiers

International Journal of Approximate Reasoning
Repairing fractures between data using genetic programming-based feature extraction: A case study in cancer diagnosis

Information Sciences: an International Journal
A hierarchical genetic fuzzy system based on genetic programming for addressing classification with highly imbalanced and borderline data-sets

Knowledge-Based Systems
Parallel memetic structures

Information Sciences: an International Journal
Ensemble fuzzy rule-based classifier design by parallel distributed fuzzy GBML algorithms

SEAL'12 Proceedings of the 9th international conference on Simulated Evolution and Learning
Analysing the classification of imbalanced data-sets with multiple classes: Binarization techniques and ad-hoc approaches

Knowledge-Based Systems
Compact Particle Swarm Optimization

Information Sciences: an International Journal
An interpretable classification rule mining algorithm

Information Sciences: an International Journal
Effective search for genetic-based machine learning systems via estimation of distribution algorithms and embedded feature reduction techniques

Neurocomputing
FRPS: A Fuzzy Rough Prototype Selection method

Pattern Recognition
Feature subset selection Filter-Wrapper based on low quality data

Expert Systems with Applications: An International Journal
Survey Combining accuracy and success-rate to improve the performance of eXtended Classifier System (XCS) for data-mining and control applications

Engineering Applications of Artificial Intelligence
An efficient adaptive fuzzy inference system for complex and high dimensional regression problems in linguistic fuzzy modelling

Knowledge-Based Systems
MEFES: An evolutionary proposal for the detection of exceptions in subgroup discovery. An application to Concentrating Photovoltaic Technology

Knowledge-Based Systems
Selecting the best measures to discover quantitative association rules

Neurocomputing
Addressing imbalanced classification with instance generation techniques: IPADE-ID

Neurocomputing
On the importance of the validation technique for classification with imbalanced datasets: Addressing covariate shift when data is skewed

Information Sciences: an International Journal
QAR-CIP-NSGA-II: A new multi-objective evolutionary algorithm to mine quantitative association rules

Information Sciences: an International Journal
Fuzzy nearest neighbor algorithms: Taxonomy, experimental analysis and prospects

Information Sciences: an International Journal
The influence of global constraints on similarity measures for time-series databases

Knowledge-Based Systems
A new hybrid metaheuristic for medical data classification

International Journal of Metaheuristics
Short, medium and long term forecasting of time series using the L-Co-R algorithm

Neurocomputing
An analysis on separability for Memetic Computing automatic design

Information Sciences: an International Journal
On the characterization of noise filters for self-training semi-supervised in nearest neighbor classification

Neurocomputing
Enhancing the search ability of differential evolution through competent leader

International Journal of High Performance Systems Architecture

Quantified Score

Hi-index	0.01

Visualization

Abstract

The experimental analysis on the performance of a proposed method is a crucial and necessary task to carry out in a research. This paper is focused on the statistical analysis of the results in the field of genetics-based machine Learning. It presents a study involving a set of techniques which can be used for doing a rigorous comparison among algorithms, in terms of obtaining successful classification models. Two accuracy measures for multi-class problems have been employed: classification rate and Cohen’s kappa. Furthermore, two interpretability measures have been employed: size of the rule set and number of antecedents. We have studied whether the samples of results obtained by genetics-based classifiers, using the performance measures cited above, check the necessary conditions for being analysed by means of parametrical tests. The results obtained state that the fulfillment of these conditions are problem-dependent and indefinite, which supports the use of non-parametric statistics in the experimental analysis. In addition, non-parametric tests can be satisfactorily employed for comparing generic classifiers over various data-sets considering any performance measure. According to these facts, we propose the use of the most powerful non-parametric statistical tests to carry out multiple comparisons. However, the statistical analysis conducted on interpretability must be carefully considered.