A model-free ensemble method for class prediction with application to biomedical decision making

  • Authors:
  • Ralph L. Kodell;Bruce A. Pearce;Songjoon Baek;Hojin Moon;Hongshik Ahn;John F. Young;James J. Chen

  • Affiliations:
  • Department of Biostatistics, #781, University of Arkansas for Medical Sciences, 4301 W. Markham St., COPH 3218, Little Rock, AR 72205, United States;Information Technology Staff, National Center for Toxicological Research, Jefferson, AR 72079, United States;Division of Personalized Nutrition and Medicine, National Center for Toxicological Research, Jefferson, AR 72079, United States;Department of Mathematics and Statistics, California State University - Long Beach, Long Beach, CA 90840, United States;Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY 11794, United States;Division of Personalized Nutrition and Medicine, National Center for Toxicological Research, Jefferson, AR 72079, United States;Division of Personalized Nutrition and Medicine, National Center for Toxicological Research, Jefferson, AR 72079, United States

  • Venue:
  • Artificial Intelligence in Medicine
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Objective: A classification algorithm that utilizes two-dimensional convex hulls of training-set samples is presented. Methods and material: For each pair of predictor variables, separate convex hulls of positive and negative samples in the training set are formed, and these convex hulls are used to classify test points according to a nearest-neighbor criterion. An ensemble of these two-dimensional convex-hull classifiers is formed by trimming the "mC"2 possible classifiers derived from the m predictors to a set of classifiers comprised of only unique predictor variables. Because only two-dimensional spaces are required to be populated by training-set samples, the ''curse of dimensionality'' is not an issue. At the same time, the power of ensemble voting is exploited by combining the classifications of the unique two-dimensional classifiers to reach a final classification. Results: The algorithm is illustrated by application to three publicly available biomedical data sets with genomic predictors and is shown to have prediction accuracy that is competitive with a number of published classification procedures. Conclusion: Because of its superior performance in terms of sensitivity and negative predictive value compared to its competitors, the convex-hull ensemble classifier demonstrates good potential for medical screening, where often the major emphasis is placed on having reliable negative predictions.