Inferential, robust non-negative matrix factorization analysis of microarray data

  • Authors:
  • Paul Fogel;S. Stanley Young;Douglas M. Hawkins;Nathalie Ledirac

  • Affiliations:
  • Consultant 4 rue Le Goff, F-75005, Paris, France;National Institute of Statistical Sciences PO Box 14006, Research Triangle Park, NC 27709-4006, USA;School of Statistics, University of Minnesota 313 Ford Hall, 224 Church Street NE, Minneapolis, MN 55455, USA;Laboratoire de Toxicologie Cellulaire et Moléculaire, Centre de Recherche INRA 400 Route des Chappes, 06903 Sophia-Antipolis, France

  • Venue:
  • Bioinformatics
  • Year:
  • 2007

Quantified Score

Hi-index 3.84

Visualization

Abstract

Motivation: Modern methods such as microarrays, proteomics and metabolomics often produce datasets where there are many more predictor variables than observations. Research in these areas is often exploratory; even so, there is interest in statistical methods that accurately point to effects that are likely to replicate. Correlations among predictors are used to improve the statistical analysis. We exploit two ideas: non-negative matrix factorization methods that create ordered sets of predictors; and statistical testing within ordered sets which is done sequentially, removing the need for correction for multiple testing within the set. Results: Simulations and theory point to increased statistical power. Computational algorithms are described in detail. The analysis and biological interpretation of a real dataset are given. In addition to the increased power, the benefit of our method is that the organized gene lists are likely to lead better understanding of the biology. Availability: An SAS JMP executable script is available from http://www.niss.org/irMF Contact: young@niss.org Supplementary information: http://www.niss.org/irMF