3165

Towards clinically useful individual regional brain atrophy rates: bridging long- and short-term longitudinal volume change estimates
Jonas Richiardi1,2, Bénédicte Maréchal1,2,3, Ricardo Corredor1,2,3, Mazen Fouad A-wali Mahdi2, Reto Meuli1, and Tobias Kober1,2,3

1Department of Medical Radiology, Lausanne University Hospital, Lausanne, Switzerland, 2Advanced Clinical Imaging Technology, Siemens Healthcare AG Switzerland, Lausanne, Switzerland, 3LTS5, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland

Synopsis

Several brain diseases show prominent, regionally-specific atrophy over time, which is often predictive of patient outcomes. However, reliable estimation of atrophy rates typically relies on repeated observations over several years. This is incompatible with clinical practice in terms of cost and prognosis utility. Here, we propose a statistical regularization approach to dramatically improve the quality of regional atrophy estimates, using only two longitudinal measurements. We evaluate the approach on open data from 43 MR scanners and 599 subjects, showing that we halve the error in most regions, while maintaining discriminability between controls and Alzheimer patients.

Introduction

MRI-visible atrophy of several brain structures is a hallmark of progression for many prevalent diseases such as Alzheimer’s disease1 or multiple sclerosis2. To evaluate whether atrophy rate is within normal limits, one can construct reference ranges, typically using large longitudinal datasets. However, this “long term” reference range, which yields high quality estimates by using more than two images, often years apart, often does not match “short term” estimates from single pairs of images a few months apart, which exhibit much more variance due to being estimated from few images. Solutions include directly estimating reference ranges from the noisy “short term” estimates3, or combining several short-interval scans to reduce variability4, but this is impractical in routine clinical settings. Here, we propose an alternative, whereby we use “long term” estimates to improve “short term” estimates.

Materials and Methods

We used a subset of the TADPOLE dataset5, comprising 1.5T ADNI6 T1-weighted MPRAGE images with uniform preprocessing (gradwarp correction, B1 correction, and N3) from 178 controls (CN), 284 patients with mild cognitive impairment (MCI), and 138 AD patients, distributed between 43 ADNI sites (Figure 1), and up to 9 longitudinal acquisitions. We registered each subject’s images to their first time point using affine registration. Focusing on regions of particular interest for dementia, we computed volumes for cortical gray matter, brain, ventricles, hippocampus, putamen, and left and right temporal lobe gray matter with the MorphoBox prototype7. We also used diagnosis at baseline, sex, and age at time of acquisition.

We assumed log-linearity of atrophy due to the elderly study sample (IQR 71-80 years). Using only the CN subjects, we trained a mixed-effects model with random intercepts and random slopes - in lme48 notation: log(volume) ~ 1 + Age + Sex + (1 + Age | Subject), which accounts for longitudinal measurements. “Long-term” atrophy rate in percent change per year is then directly given by the coefficient for Age. We used a parametric bootstrap with 200 resamplings to establish 5th/95th percentiles around the Age coefficient, yielding a reference range for atrophy.

For each region, relative percent change $$$p$$$ was computed as $$$p=\frac{V_{new}-V_{old}}{V_{old}}$$$, from which naïve “short-term” annual atrophy rates $$$r$$$ were computed as $$$r=\frac{p}{d}365$$$, with $$$d$$$ the time in days between the two acquisitions.

To obtain the regularized estimate $$$r_{reg}$$$, we used a shrinkage estimator of the rate of atrophy, thus trading lower variance for increased bias: $$$r_{reg}=(1-\lambda) r + \lambda m$$$, where $$$m$$$ is the fixed-effect coefficient for Age in the long-term random effects model trained on healthy controls. Equivalently, $$$m$$$ is the mean of the distribution of subject-specific long-term atrophy estimates. $$$\lambda$$$ can be computed analytically from the variance of the random effects and the total measurement variance. This could also be set in cross-validation by minimizing mean absolute error (MAE). Here, without loss of generality, we set $$$ \lambda = 0.5$$$ a priori for all structures.

Results

Long-term reference range

On average, the “long-term” estimates for atrophy rate across regions for controls stood between -0.5%/year and -0.2%/year (excluding ventricles). The hippocampus had more marked atrophy, estimated between -0.6%/year and -0.4%/year. Ventricular enlargement was estimated between 2.8% and 3.5% per year (Figure 2). These figures are in line with biological plausibility and extant literature31.

Error reduction

The shrunk short-term estimators of atrophy had approximately half the error (as measured by MAE) than the naïve short-term estimators (Figure 3). In all regions, the shrunk estimators on average moved towards the long-term estimator. For the ventricles, the mean absolute error was reduced from 4.5% to 2.2% by the shrunk short term estimator, bringing the estimation error below the long-term hypertrophy rate estimate (4.3% annual volume change). This suggests that the estimator can be used to distinguish between artefactual variation (apparent volume change) and actual ventricular enlargement, although other regions still have estimation error above the expected annual change. This trend held across diagnosis status (Figure 4).

Clinical discrimination

To evaluate atrophy rates as an imaging marker, we computed the area under the receiver-operating characteristic (ROC) curve (AUC) for the discrimination between CN and AD subjects, using image pairs more than 364 days apart. Consistent with the literature, hippocampal atrophy is the best predictor (AUC 75%). Other regions give worse performance but are significantly above chance (Figure 5).

Conclusions

We have proposed an improved estimator for short-term atrophy estimation that significantly outperforms the naïve estimator and is easily implemented. This improved estimation means that atrophy/hypertrophy information can become more relevant in clinical settings, as shown here in particular for the ventricular system where physiological expansion is substantial in early stages of AD.

Acknowledgements

Data collection and sharing for this study was funded by the ADNI (http://adni.loni.ucla.edu, PI: Michael Weiner; NIH grant U01 AG024904)

References

1. Smith, S. M. et al. Longitudinal and cross-sectional analysis of atrophy in Alzheimer’s disease: Cross-validation of BSI, SIENA and SIENAX. NeuroImage 36, 1200–1206 (2007).

2. Fox, R. J. et al. Phase 2 Trial of Ibudilast in Progressive Multiple Sclerosis. N. Engl. J. Med. 379, 846–855 (2018).

3. Ledig, C., Schuh, A., Guerrero, R., Heckemann, R. A. & Rueckert, D. Structural brain imaging in Alzheimers disease and mild cognitive impairment: biomarker analysis and shared morphometry database. Sci. Rep. 8, (2018).

4. Schott, J. M. et al. Combining short interval MRI in Alzheimer’s disease. J. Neurol. 253, 1147–1153 (2006).

5. Marinescu, R. V. et al. TADPOLE Challenge: Prediction of Longitudinal Evolution in Alzheimer’s Disease. (2018).

6. Jack, C. R. et al. Update on the magnetic resonance imaging core of the Alzheimer’s disease neuroimaging initiative. Alzheimers Dement. 6, 212–220 (2010).

7. Schmitter, D. et al. An evaluation of volume-based morphometry for prediction of mild cognitive impairment and Alzheimer’s disease. NeuroImage Clin. 7, 7–17 (2015).

8. Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting Linear Mixed-Effects Models Using lme4. J. Stat. Softw. 67, (2015).

Figures

Demographic data for the study. CN: control normals. MCI: mild cognitive impairment. AD: Alzheimer's disease. IQR: interquartile range.

5th/95th percentile parametric bootstrap confidence intervals for “long-term” estimates of annual atrophy rates (left, panel A) and hypertrophy rates (right, panel B) in control normals. Note the relatively more pronounced atrophy in the hippocampus, regional differences in confidence interval width, as well as the pronounced hypertrophy in ventricles.

Atrophy rate estimates and errors across 7 brain structures, all diagnoses together. MAE: mean absolute error. The average shrunk “short-term” estimates (computed from two images and regularized) are closer to the long-term estimate (computed using up to 9 images over several years) than the average naïve short-term estimates. Shrunk estimates approximately half the error of the naïve short-term estimates.

Relative improvement for the shrunk short-term estimator compared to the naïve short-term estimator, in the ventricles (left) and hippocampus (right). Each case shows the percent relative error reduction for the shrunk estimate obtained from a pair of images within the same subject. Most subjects and longitudinal image pairs have improved atrophy rate estimates, irrespective of diagnosis.. Note the horizontal axis is cut. Other brain structures show similar results, with a distribution mode around 100% and improvements for all diagnosis groups.

Discrimination performance when using short-term shrunk atrophy estimates as imaging markers to distinguish control normals from AD patients. AUC: area under the ROC curve. Bars indicate the confidence interval for the AUC value. The 2422 image pairs used for this evaluation are >364 days apart. All estimates perform above chance.

Proc. Intl. Soc. Mag. Reson. Med. 27 (2019)
3165