Robust semiparametric mixing for detecting differentially expressed genes in microarray experiments

Authors:
Marco Alfò;Alessio Farcomeni;Luca Tardella
Affiliations:
Dipartimento di Statistica, Probabilití e Statistiche Applicate, Universití di Roma "La Sapienza", P.le Aldo Moro, 5 00185 Rome, Italy;Dipartimento di Statistica, Probabilití e Statistiche Applicate, Universití di Roma "La Sapienza", P.le Aldo Moro, 5 00185 Rome, Italy;Dipartimento di Statistica, Probabilití e Statistiche Applicate, Universití di Roma "La Sapienza", P.le Aldo Moro, 5 00185 Rome, Italy
Venue:
Computational Statistics & Data Analysis
Year:
2007

Citing 4
Cited 0

Identification of interaction patterns and classification with applications to microarray data

Computational Statistics & Data Analysis
Efficient design and analysis of two colour factorial microarray experiments

Computational Statistics & Data Analysis
Identifying differentially expressed genes in unreplicated multiple-treatment microarray timecourse experiments

Computational Statistics & Data Analysis
Nearest neighbor pattern classification

IEEE Transactions on Information Theory

Quantified Score

Hi-index	0.03

Visualization

Abstract

An important goal of microarray studies is the detection of genes that show significant changes in observed expressions when two or more classes of biological samples such as treatment and control are compared. Using the c-fold rule, a gene is declared to be differentially expressed if its average expression level varies by more than a constant factor c between treatment and control (typically c=2). While often used, however, this simple rule is not completely convincing. By modeling this filter, a binary variable is defined at the genexexperiment level, allowing for a more powerful treatment of the corresponding information. A gene-specific random term is introduced to control for both dependence among genes and variability with respect to the c-fold threshold. Inference is carried out via a two-level finite mixture model under a likelihood approach. Then, parameter estimates are also derived using the counting distribution under a Bayesian nonparametric approach which allows to keep under control some error rate of erroneous discoveries. The effectiveness of both proposed approaches is illustrated through a large-scale simulation study and a well known benchmark data set.