Identification of interaction patterns and classification with applications to microarray data
Computational Statistics & Data Analysis
Efficient design and analysis of two colour factorial microarray experiments
Computational Statistics & Data Analysis
Computational Statistics & Data Analysis
Nearest neighbor pattern classification
IEEE Transactions on Information Theory
Hi-index | 0.03 |
An important goal of microarray studies is the detection of genes that show significant changes in observed expressions when two or more classes of biological samples such as treatment and control are compared. Using the c-fold rule, a gene is declared to be differentially expressed if its average expression level varies by more than a constant factor c between treatment and control (typically c=2). While often used, however, this simple rule is not completely convincing. By modeling this filter, a binary variable is defined at the genexexperiment level, allowing for a more powerful treatment of the corresponding information. A gene-specific random term is introduced to control for both dependence among genes and variability with respect to the c-fold threshold. Inference is carried out via a two-level finite mixture model under a likelihood approach. Then, parameter estimates are also derived using the counting distribution under a Bayesian nonparametric approach which allows to keep under control some error rate of erroneous discoveries. The effectiveness of both proposed approaches is illustrated through a large-scale simulation study and a well known benchmark data set.