Selective voting in convex-hull ensembles improves classification accuracy

Authors:
Ralph L. Kodell;Chuanlei Zhang;Eric R. Siegel;Radhakrishnan Nagarajan
Affiliations:
Department of Biostatistics, #781, University of Arkansas for Medical Sciences, 4301 W. Markham St., Little Rock, AR 72205, United States;Department of Biostatistics, #781, University of Arkansas for Medical Sciences, 4301 W. Markham St., Little Rock, AR 72205, United States;Department of Biostatistics, #781, University of Arkansas for Medical Sciences, 4301 W. Markham St., Little Rock, AR 72205, United States;Division of Biomedical Informatics, #782, University of Arkansas for Medical Sciences, 4301 W. Markham St., Little Rock, AR 72205, United States
Venue:
Artificial Intelligence in Medicine
Year:
2012

Citing 14
Cited 1

The nature of statistical learning theory

The nature of statistical learning theory
Bagging predictors

Machine Learning
The Random Subspace Method for Constructing Decision Forests

IEEE Transactions on Pattern Analysis and Machine Intelligence
Random Forests

Machine Learning
Ensembling neural networks: many could be better than all

Artificial Intelligence
Ensemble selection from libraries of models

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Classification by ensembles from random partitions of high-dimensional data

Computational Statistics & Data Analysis
Ensemble methods for classification of patients for personalized medicine with high-dimensional data

Artificial Intelligence in Medicine
Genetic algorithm-based feature set partitioning for classification problems

Pattern Recognition
Collective-agreement-based pruning of ensembles

Computational Statistics & Data Analysis
A model-free ensemble method for class prediction with application to biomedical decision making

Artificial Intelligence in Medicine
Ensemble-based classifiers

Artificial Intelligence Review
Application of majority voting to pattern recognition: an analysis of its behavior and performance

IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Nearest neighbor pattern classification

IEEE Transactions on Information Theory

Subpopulation-specific confidence designation for more informative biomedical classification

Artificial Intelligence in Medicine

Quantified Score

Hi-index	0.00

Visualization

Abstract

Objective: Classification algorithms can be used to predict risks and responses of patients based on genomic and other high-dimensional data. While there is optimism for using these algorithms to improve the treatment of diseases, they have yet to demonstrate sufficient predictive ability for routine clinical practice. They generally classify all patients according to the same criteria, under an implicit assumption of population homogeneity. The objective here is to allow for population heterogeneity, possibly unrecognized, in order to increase classification accuracy and further the goal of tailoring therapies on an individualized basis. Methods and materials: A new selective-voting algorithm is developed in the context of a classifier ensemble of two-dimensional convex hulls of positive and negative training samples. Individual classifiers in the ensemble are allowed to vote on test samples only if those samples are located within or behind pruned convex hulls of training samples that define the classifiers. Results: Validation of the new algorithm's increased accuracy is carried out using two publicly available datasets having cancer as the outcome variable and expression levels of thousands of genes as predictors. Selective voting leads to statistically significant increases in accuracy from 86.0% to 89.8% (p