دورية أكاديمية

Modeling considerations for using expression data from multiple species.

التفاصيل البيبلوغرافية
العنوان: Modeling considerations for using expression data from multiple species.
المؤلفون: Siewert, Elizabeth1,2, Kechris, Katerina J.1
المصدر: Statistics in Medicine. Oct2013, Vol. 32 Issue 23, p4057-4070. 14p.
مستخلص: Although genome-wide expression data sets from multiple species are now more commonly generated, there have been few studies on how to best integrate this type of correlated data into models. Starting with a single-species, linear regression model that predicts transcription factor binding sites as a case study, we investigated how best to take into account the correlated expression data when extending this model to multiple species. Using a multivariate regression model, we accounted for the phylogenetic relationships among the species in two ways: (i) a repeated-measures model, where the error term is constrained; and (ii) a Bayesian hierarchical model, where the prior distributions of the regression coefficients are constrained. We show that both multiple-species models improve predictive performance over the single-species model. When compared with each other, the repeated-measures model outperformed the Bayesian model. We suggest a possible explanation for the better performance of the model with the constrained error term. Copyright © 2013 John Wiley & Sons, Ltd. [ABSTRACT FROM AUTHOR]
قاعدة البيانات: Academic Search Index