A Robust Meta-classification Strategy for Cancer Diagnosis from Gene Expression Data

  • Authors:
  • Gabriela Alexe;Gyan Bhanot;Babu Venkataraghavan;Ramakrishna Ramaswamy;Jorge Lepre;Arnold J. Levine;Gustavo Stolovitzky

  • Affiliations:
  • IBM T.J. Watson Research Center;IBM T.J. Watson Research Center;Institute for Advanced Study;Institute for Advanced Study;IBM T.J. Watson Research Center;Institute for Advanced Study and Robert Wood Johnson School of Medicine and Dentistry;IBM T.J. Watson Research Center

  • Venue:
  • CSB '05 Proceedings of the 2005 IEEE Computational Systems Bioinformatics Conference
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

One of the major challenges in cancer diagnosis from microarray data is to develop robust classification models which are independent of the analysis techniques used and can combine data from different laboratories. We propose a meta-classification scheme which uses a robust multivariate gene selection procedure and integrates the results of several machine learning tools trained on raw and pattern data. We validate our method by applying it to distinguish diffuse large B-cell lymphoma (DLBCL) from follicular lymphoma (FL) on two independent datasets: the HuGeneFL Affmetrixy dataset of Shipp et al. (www. genome.wi.mit.du/MPR /lymphoma) and the Hu95Av2 Affymetrix dataset (DallaFaveraýs laboratory, Columbia University). Our meta-classification technique achieves higher predictive accuracies than each of the individual classifiers trained on the same dataset and is robust against various data perturbations. We also find that combinations of p53 responsive genes (e.g., p53, PLK1 and CDK2) are highly predictive of the phenotype.