The most common analysis of structural brain MRIs involves massively univariate modelling. Such analyses separately approach different levels of resolution (whole brain, regional, and voxel) and do not provide an easy solution to understanding whether some areas of the brain are more or less affected than others. Here we explore applying hierarchical bayesian modelling to simultaneously analyze brain MRI studies at multiple levels of resolution while allowing for the explicit interrogation of whether brain areas are differentially affected. In addition, we show that hierarchical modelling provides improved parameter recapture, sign error rate, and model fit.
We test our models on volumes derived from automatically segmented T2 MRI scans of C57BL/6J mouse brains. Most are publicly available from the Ontario Brain Institute (https://www.braincode.ca/content/open-data-releases). We examined five studies with more than 20 male and 20 female mice to assess reliability of sex-difference estimation. Estimates were compared against the estimated sex-differences from all 587 C57BL/6J mice.
Models:
We examine two variants of the no-pooling model. Models shown in lme4 mixed-effects syntax where appropriate. All models are fit via rstan3 and rstanarm4.
We examine three bayesian hierarchical models:
Metrics:
We compare the six models across the following dimensions:
In this work we show the advantage of bayesian hierarchical posterior
distributions (fig 1. left) for performing inference across structures.
We show how trivial it is to compare posterior estimates (fig 2.
right), respecting uncertainty without multiplicity correction.
The results show that the bayesian models uniformly outperform the no-pooling models in terms of model fit (fig. 2). The information pooling of effects across structures, and intercepts across individuals substantially improves the mean-squared-error of model fit. Adding the whole brain volume as a covariate substantially remedies the performance deficit (especially fig. 2 panel “all”).
Gold-standard effect recovery for hierarchical models was superior to no-pooling (fig. 3), having higher correlations and lower mean-squared-error. With brain volume covaried, no-pooling performs comparably with the bayesian models, although the effect diffusion tree performs marginally better in both domains. The same holds for predictive performance (fig. 4) except, no-pooling with brain volume covaried performs better on one study and the full data-set. These two cases have larger sample sizes, potentially indicating reduced benefit from pooling with increased data.
The bayesian models have marginally reduced sign error rates (fig. 5), relative to the no-pooling model, although comparable to the no-pooling with brain volume covaried. Again the effect diffusion tree performs best.
1. Lein, Ed S, Michael J Hawrylycz, Nancy Ao, Mikael Ayres, Amy Bensinger, Amy Bernard, Andrew F Boe, et al. “Genome-Wide Atlas of Gene Expression in the Adult Mouse Brain.” (2007). Nature 445 (7124). Nature Publishing Group: 168.
2. Gelman, Andrew, Jennifer Hill, and Masanao Yajima. "Why we (usually) don't have to worry about multiple comparisons." Journal of Research on Educational Effectiveness 5.2 (2012): 189-211.
3. Stan Development Team. "Rstan: the R interface to Stan". (2018) R package version 2.18..
4. Goodrich B, Gabry J, Ali I & Brilleman S. (2018). rstanarm: Bayesian applied regression modeling via Stan. R package version 2.17.4. http://mc-stan.org/.
5. Vehtari A, Gelman A, Gabry J (2017). “Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC.” Statistics and Computing, *27*, 1413-1432. doi: 10.1007/s11222-016-9696-4 (URL:http://doi.org/10.1007/s11222-016-9696-4).