Bayesian dimension reduction models for microarray data

  • Authors:
  • Albert D. Shieh

  • Affiliations:
  • Department of Statistics, Harvard University, Cambridge, MA

  • Venue:
  • ICANNGA'09 Proceedings of the 9th international conference on Adaptive and natural computing algorithms
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

High dimensionality, missing values, noise, and outliers are standard problems in gene expression data and are usually dealt with separately. In this paper, we propose an ideal point model that performs feature extraction, imputes missing values, and is robust to noise and outliers in a unified and unsupervised framework. We use the simplifying assumption that genes are either expressed or not expressed in order to obtain a parsimonious model. We present a fast Bayesian method for estimating the large number of parameters in the ideal point model. We apply the ideal point model to a leukemia data set, where it outperforms independent component analysis (ICA), a state of the art unsupervised feature extraction method.