
Title
Bayesian Robust Inference for Differential Gene Expression
Speaker
Raphael Gottardo, University of Washington
Abstract
In this talk, I will consider the problem of identifying differentially expressed genes under different conditions using gene expression microarrays. Because of the many steps involved in the experimental process, from hybridization to image analysis, cDNA microarray data often contain outliers. For example, an outlying data value could occur because of scratches or dust on the surface, imperfections in the glass, or imperfections in the array production. We develop a robust Bayesian hierarchical model for testing for differential expression. Errors are modeled explicitly using a t-distribution, which accounts for outliers. The model includes an exchangeable prior for the variances, which allow different variances for the genes but still shrink extreme empirical variances. Our model can be used for testing for differentially expressed genes among multiple samples, and it can distinguish between the different possible patterns of differential expression when there are three or more samples. Parameter estimation is carried out using a novel version of Markov chain Monte Carlo that is appropriate when the model puts mass on subspaces of the full parameter space. The method will be illustrated using a publicly available gene expression data set. We will compare our method to six other baseline and commonly used techniques, namely the t-test, the Bonferroni-adjusted t-test, Significance Analysis of Microarrays (SAM), Efron's empirical Bayes, and EBarrays in both its Lognormal-Normal and Gamma-Gamma forms. Our method performs better than these alternatives, on the basis of between-replicate agreement and disagreement.
Joint with Adrian E. Raftery, Ka Yee Yeung and Roger Bumgarner.
Meet the speaker in Room 212 Cockins Hall at 4:30 p.m. Refreshments will be served.