Keywords: Segmentation, Machine Learning/Artificial Intelligence
Motivation: Accurate segmentation of the hippocampus provides an important biomarker in neurodegenerative diseases, e.g., Alzheimer’s disease. However, currently available tools are not robust to disease-related atrophy.
Goal(s): We aim to demonstrate the accuracy of our InnerEye hippocampal segmentation tool on clinical data.
Approach: We fine-tuned our existing model on manually segmented data and externally validated the model on a clinical dataset of patients referred to a dementia clinic. We compare our model to three commonly used segmentation tools.
Results: Our model provides significant improvements over currently available tools when tested on an external, clinical dataset.
Impact: The hippocampal segmentation model presented in this work provides significant improvements over currently available tools in an external, clinical dataset. Segmentation performance was increased, while run-times were decreased. These results support the tool as a viable alternative in clinical settings.
The InnerEye software is open source and can be found at https://github.com/microsoft/InnerEye-DeepLearning. AS, MGS, FBJW, JST and TAY are supported by the NIHR Biomedical Research Centre at UCLH. AS is supported by Engineering and Physical Sciences Research Council (EPSRC), Impact Acceleration Account (IAA) 2022-25. Data collection and sharing for this project was funded by the Alzheimer's Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: AbbVie, Alzheimer's Association; Alzheimer's Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen; Bristol-Myers Squibb Company; CereSpir, Inc.; Cogstate; Eisai Inc.; Elan Pharmaceuticals, Inc.; Eli Lilly and Company; Euro Immun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Lumosity; Lundbeck; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; Neuro Rx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Takeda Pharmaceutical Company; and Transition Therapeutics. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer's Therapeutic Research Institute at the University of Southern California. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California.
1. Jack CR, Petersen RC, Xu Y, O’Brien PC, Smith GE, Ivnik RJ, et al. Rates of hippocampal atrophy correlate with change in clinical status in aging and AD. Neurology. 2000 Aug 22;55(4):484–90.
2. Schroder A, Moggridge J, Wu J, Salhab HA, Vos S, Bristow M, et al. InnerEye as a Tool for Accurate Hippocampal Segmentation. In: Proceedings of the Annual Meeting of ISMRM. Toronto; 2023.
3. Henschel L, Conjeti S, Estrada S, Diers K, Fischl B, Reuter M. FastSurfer - A fast and accurate deep learning based neuroimaging pipeline. NeuroImage. 2020 Oct 1;219:117012.
4. Oktay O, Nanavati J, Schwaighofer A, Carter D, Bristow M, Tanno R, et al. Evaluation of Deep Learning to Augment Image-Guided Radiotherapy for Head and Neck and Prostate Cancers. JAMA Netw Open. 2020 Nov 30;3(11):e2027426.
5. Hsu YY, Schuff N, Du AT, Mark K, Zhu X, Hardin D, et al. Comparison of automated and manual MRI volumetry of hippocampus in normal aging and dementia. J Magn Reson Imaging JMRI. 2002 Sep;16(3):305–10.
6. Fischl B, Salat DH, Busa E, Albert M, Dieterich M, Haselgrove C, et al. Whole Brain Segmentation: Automated Labeling of Neuroanatomical Structures in the Human Brain. Neuron. 2002 Jan 31;33(3):341–55.
7. Winston GP, Cardoso MJ, Williams EJ, Burdett JL, Bartlett PA, Espak M, et al. Automated hippocampal segmentation in patients with epilepsy: Available free online. Epilepsia. 2013;54(12):2166–73.
8. Boccardi M, Bocchetta M, Apostolova LG, Barnes J, Bartzokis G, Corbetta G, et al. Delphi definition of the EADC‐ADNI Harmonized Protocol for hippocampal segmentation on magnetic resonance. Alzheimers Dement. 2015 Feb;11(2):126–38.
9. Boccardi M, Bocchetta M, Morency FC, Collins DL, Nishikawa M, Ganzola R, et al. Training labels for hippocampal segmentation based on the EADC‐ADNI harmonized hippocampal protocol. Alzheimers Dement. 2015 Feb;11(2):175–83.
10. Jenkinson M, Beckmann CF, Behrens TEJ, Woolrich MW, Smith SM. FSL. NeuroImage. 2012 Aug;62(2):782–90.
Table 1: Data splits for the ADNI datasets. The final three columns provide the percentage of subjects in each data split with each clinical diagnosis: dementia, mild cognitive impairment (MCI) and cognitively normal (CN). We show data splits for the pre-training dataset used to train our original model2, and the two ADNI datasets used in this work: ADNI A and ADNI B.
Figure 1: Internal validation of InnerEye against manual segmentations in the ADNI B dataset.
Figure 2: External clinical validation of InnerEye against manual segmentations in the NHNN dataset.
Figure 3: Qualitative analysis of model performance in two scenarios from the external clinical dataset: a) where we observe “typical” performance from each of the models (Dice scores are within the middle 50th percentile for each model’s Dice scores); and b) where fine-tuned InnerEye is marginally outperformed by another model (HIPPOSEG). Blue segmentations represent the manual segmentations, and model segmentations are represented by colours consistent with Figures 1 and 2: fine-tuned InnerEye in orange, FreeSurfer in green, FastSurfer in red, and HIPPOSEG in purple.
Table 2: Average run-time for each model. Run-times were averaged across all subjects in the external validation dataset. FreeSurfer and FastSurfer perform full brain segmentation, while InnerEye and HIPPOSEG are hippocampal specific.