An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs-one and one-vs-all schemes

Authors:
Mikel Galar;Alberto Fernández;Edurne Barrenechea;Humberto Bustince;Francisco Herrera
Affiliations:
Departamento de Automática y Computación, Universidad Pública de Navarra, Campus Arrosadía s/n, P.O. Box 31006, Pamplona, Spain;Department of Computer Science, University of Jaén, P.O. Box 23071, Jaén, Spain;Departamento de Automática y Computación, Universidad Pública de Navarra, Campus Arrosadía s/n, P.O. Box 31006, Pamplona, Spain;Departamento de Automática y Computación, Universidad Pública de Navarra, Campus Arrosadía s/n, P.O. Box 31006, Pamplona, Spain;Department of Computer Science and Artificial Intelligence, University of Granada, P.O. Box 18071, Granada, Spain
Venue:
Pattern Recognition
Year:
2011

Citing 53
Cited 22

Instance-Based Learning Algorithms

Machine Learning
Modern Information Retrieval

Modern Information Retrieval
Rule Induction with CN2: Some Recent Improvements

EWSL '91 Proceedings of the European Working Session on Machine Learning
Reducing multiclass to binary: a unifying approach for margin classifiers

The Journal of Machine Learning Research
Round robin classification

The Journal of Machine Learning Research
Pattern Classification (2nd Edition)

Pattern Classification (2nd Edition)
In Defense of One-Vs-All Classification

The Journal of Machine Learning Research
Editorial: special issue on learning from imbalanced data sets

ACM SIGKDD Explorations Newsletter - Special issue on learning from imbalanced datasets
Effectiveness of error correcting output coding methods in ensemble and monolithic learning machines

Pattern Analysis & Applications
Ensembles of nested dichotomies for multi-class problems

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Probability Estimates for Multi-class Classification by Pairwise Coupling

The Journal of Machine Learning Research
Introduction to Machine Learning (Adaptive Computation and Machine Learning)

Introduction to Machine Learning (Adaptive Computation and Machine Learning)
Using AUC and Accuracy in Evaluating Learning Algorithms

IEEE Transactions on Knowledge and Data Engineering
A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression

Bioinformatics
Improving Multiclass Pattern Recognition by the Combination of Two Strategies

IEEE Transactions on Pattern Analysis and Machine Intelligence
Discriminant ECOC: A Heuristic Method for Application Dependent Design of Error Correcting Output Codes

IEEE Transactions on Pattern Analysis and Machine Intelligence
Data Complexity in Pattern Recognition (Advanced Information and Knowledge Processing)

Data Complexity in Pattern Recognition (Advanced Information and Knowledge Processing)
Nesting Algorithm for Multi-Classification Problems

Soft Computing - A Fusion of Foundations, Methodologies and Applications
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)

Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Statistical Comparisons of Classifiers over Multiple Data Sets

The Journal of Machine Learning Research
Sharing Visual Features for Multiclass and Multiview Object Detection

IEEE Transactions on Pattern Analysis and Machine Intelligence
A lot of randomness is hiding in accuracy

Engineering Applications of Artificial Intelligence
Round robin ensembles

Intelligent Data Analysis
Fingerprint classification using one-vs-all support vector machines dynamically ordered with naïve Bayes classifiers

Pattern Recognition
An incremental node embedding technique for error correcting output codes

Pattern Recognition
Learning valued preference structures for solving classification problems

Fuzzy Sets and Systems
Efficient Multiclass ROC Approximation by Decomposition via Confusion Matrix Perturbation Analysis

IEEE Transactions on Pattern Analysis and Machine Intelligence
An experimental comparison of performance measures for classification

Pattern Recognition Letters
KEEL: a software tool to assess evolutionary algorithms for data mining problems

Soft Computing - A Fusion of Foundations, Methodologies and Applications - Special Issue on Evolutionary and Metaheuristics based Data Mining (EMBDM); Guest Editors: José A. Gámez, María J. del Jesús, José M. Puerta
A genetic programming-based approach to the classification of multiclass microarray datasets

Bioinformatics
A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability

Soft Computing - A Fusion of Foundations, Methodologies and Applications
Learning from Imbalanced Data

IEEE Transactions on Knowledge and Data Engineering
Combining predictions in pairwise classification: An optimal adaptive voting strategy and its relation to weighted voting

Pattern Recognition
Binary Decomposition Methods for Multipartite Ranking

ECML PKDD '09 Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases: Part I
Solving multiclass learning problems via error-correcting output codes

Journal of Artificial Intelligence Research
Flexible learning of problem solving heuristics through adaptive search

IJCAI'83 Proceedings of the Eighth international joint conference on Artificial intelligence - Volume 1
FR3: a fuzzy rule learner for inducing reliable classifiers

IEEE Transactions on Fuzzy Systems
Domains of competence of fuzzy rule based classification systems with data complexity measures: A case of study using a fuzzy hybrid genetic based machine learning method

Fuzzy Sets and Systems
A review on the combination of binary classifiers in multiclass problems

Artificial Intelligence Review
A multi-class classification strategy for Fisher scores: Application to signer independent sign language recognition

Pattern Recognition
Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power

Information Sciences: an International Journal
Multi-class pairwise linear dimensionality reduction using heteroscedastic schemes

Pattern Recognition
Solving multi-class problems with linguistic fuzzy rule based classification systems based on pairwise learning and preference relations

Fuzzy Sets and Systems
Genetics-based machine learning for rule induction: state of the art, taxonomy, and comparative study

IEEE Transactions on Evolutionary Computation
Beyond accuracy, f-score and ROC: a family of discriminant measures for performance evaluation

AI'06 Proceedings of the 19th Australian joint conference on Artificial Intelligence: advances in Artificial Intelligence
Domain of competence of XCS classifier system in complexity measurement space

IEEE Transactions on Evolutionary Computation
Multiclass Support Vector Machines for EEG-Signals Classification

IEEE Transactions on Information Technology in Biomedicine
Support vector learning for fuzzy rule-based classification systems

IEEE Transactions on Fuzzy Systems
A comparison of methods for multiclass support vector machines

IEEE Transactions on Neural Networks
New results on error correcting output codes of kernel machines

IEEE Transactions on Neural Networks
Binary tree of SVM: a new fast multiclass training and classification algorithm

IEEE Transactions on Neural Networks
Nesting One-Against-One Algorithm Based on SVMs for Pattern Classification

IEEE Transactions on Neural Networks
Efficient classification for multiclass problems using modular neural networks

IEEE Transactions on Neural Networks

Using confusion matrices and confusion graphs to design ensemble classification models from large datasets

DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Comparing multi-class classifiers: on the similarity of confusion matrices for predictive toxicology applications

IDEAL'11 Proceedings of the 12th international conference on Intelligent data engineering and automated learning
Empirical comparison of four classifier fusion strategies for positive-versus-negative ensembles

Proceedings of the South African Institute of Computer Scientists and Information Technologists Conference on Knowledge, Innovation and Leadership in a Diverse, Multidisciplinary Environment
A first study on decomposition strategies with data with class noise using decision trees

HAIS'12 Proceedings of the 7th international conference on Hybrid Artificial Intelligent Systems - Volume Part II
Efficient classifiers for multi-class classification problems

Decision Support Systems
A noise-detection based AdaBoost algorithm for mislabeled data

Pattern Recognition
Transductive cost-sensitive lung cancer image classification

Applied Intelligence
A class centric feature and classifier ensemble selection approach for music genre classification

SSPR'12/SPR'12 Proceedings of the 2012 Joint IAPR international conference on Structural, Syntactic, and Statistical Pattern Recognition
Training inter-related classifiers for automatic image classification and annotation

Pattern Recognition
Comparison of fuzzy combiner training methods

ICCCI'12 Proceedings of the 4th international conference on Computational Collective Intelligence: technologies and applications - Volume Part I
Analysing the classification of imbalanced data-sets with multiple classes: Binarization techniques and ad-hoc approaches

Knowledge-Based Systems
Data weighting method on the basis of binary encoded output to solve multi-class pattern classification problems

Expert Systems with Applications: An International Journal
The one-against-all partition based binary tree support vector machine algorithms for multi-class classification

Neurocomputing
Base Model Combination Algorithm for Resolving Tied Predictions for K-Nearest Neighbor OVA Ensemble Models

INFORMS Journal on Computing
Dynamic classifier selection for One-vs-One strategy: Avoiding non-competent classifiers

Pattern Recognition
A survey of multiple classifier systems as hybrid systems

Information Fusion
Identification of grapevine varieties using leaf spectroscopy and partial least squares

Computers and Electronics in Agriculture
Diversity measures for one-class classifier ensembles

Neurocomputing
Feature selection for high-dimensional multi-category data using PLS-based local recursive feature elimination

Expert Systems with Applications: An International Journal
Novel multiclass classification for home-based diagnosis of sleep apnea hypopnea syndrome

Expert Systems with Applications: An International Journal
Empowering difficult classes with a similarity-based aggregation in multi-class classification problems

Information Sciences: an International Journal
Clustering-based ensembles for one-class classification

Information Sciences: an International Journal

Quantified Score

Hi-index	0.01

Visualization

Abstract

Classification problems involving multiple classes can be addressed in different ways. One of the most popular techniques consists in dividing the original data set into two-class subsets, learning a different binary model for each new subset. These techniques are known as binarization strategies. In this work, we are interested in ensemble methods by binarization techniques; in particular, we focus on the well-known one-vs-one and one-vs-all decomposition strategies, paying special attention to the final step of the ensembles, the combination of the outputs of the binary classifiers. Our aim is to develop an empirical analysis of different aggregations to combine these outputs. To do so, we develop a double study: first, we use different base classifiers in order to observe the suitability and potential of each combination within each classifier. Then, we compare the performance of these ensemble techniques with the classifiers' themselves. Hence, we also analyse the improvement with respect to the classifiers that handle multiple classes inherently. We carry out the experimental study with several well-known algorithms of the literature such as Support Vector Machines, Decision Trees, Instance Based Learning or Rule Based Systems. We will show, supported by several statistical analyses, the goodness of the binarization techniques with respect to the base classifiers and finally we will point out the most robust techniques within this framework.