EDA-Based Logistic Regression Applied to Biomarkers Selection in Breast Cancer

  • Authors:
  • Santiago González;Victor Robles;Jose Maria Peña;Oscar Cubo

  • Affiliations:
  • Department of Computer Architecture, Universidad Politécnica de Madrid, Madrid, Spain;Department of Computer Architecture, Universidad Politécnica de Madrid, Madrid, Spain;Department of Computer Architecture, Universidad Politécnica de Madrid, Madrid, Spain;Department of Computer Architecture, Universidad Politécnica de Madrid, Madrid, Spain

  • Venue:
  • IWANN '09 Proceedings of the 10th International Work-Conference on Artificial Neural Networks: Part II: Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Logistic regression (LR) is a simple and efficient supervised learning algorithm for estimating the probability of an outcome variable. This algorithm is widely accepted and used in medicine for classification of diseases using DNA microarray data. Classical LR does not perform well for microarrays when applied directly, because the number of variables exceeds the number of samples. However, by reducing the number of genes and selecting specific variables (using filtering methods) great results can be obtained with this algorithm. On this contribution we propose a novel approach for fitting the (penalized) LR models based on EDAs. Breast Cancer dataset has been proposed to compare both accuracy and gene selection.