Robust semiparametric mixing for detecting differentially expressed genes in microarray experiments

  • Authors:
  • Marco Alfò;Alessio Farcomeni;Luca Tardella

  • Affiliations:
  • Dipartimento di Statistica, Probabilití e Statistiche Applicate, Universití di Roma "La Sapienza", P.le Aldo Moro, 5 00185 Rome, Italy;Dipartimento di Statistica, Probabilití e Statistiche Applicate, Universití di Roma "La Sapienza", P.le Aldo Moro, 5 00185 Rome, Italy;Dipartimento di Statistica, Probabilití e Statistiche Applicate, Universití di Roma "La Sapienza", P.le Aldo Moro, 5 00185 Rome, Italy

  • Venue:
  • Computational Statistics & Data Analysis
  • Year:
  • 2007

Quantified Score

Hi-index 0.03

Visualization

Abstract

An important goal of microarray studies is the detection of genes that show significant changes in observed expressions when two or more classes of biological samples such as treatment and control are compared. Using the c-fold rule, a gene is declared to be differentially expressed if its average expression level varies by more than a constant factor c between treatment and control (typically c=2). While often used, however, this simple rule is not completely convincing. By modeling this filter, a binary variable is defined at the genexexperiment level, allowing for a more powerful treatment of the corresponding information. A gene-specific random term is introduced to control for both dependence among genes and variability with respect to the c-fold threshold. Inference is carried out via a two-level finite mixture model under a likelihood approach. Then, parameter estimates are also derived using the counting distribution under a Bayesian nonparametric approach which allows to keep under control some error rate of erroneous discoveries. The effectiveness of both proposed approaches is illustrated through a large-scale simulation study and a well known benchmark data set.