Automated brain segmentation approaches are increasingly being used for decision support in routine clinical settings. While segmentation may be considered a “solved problem” in research, it is still challenging to assure reliable performance of automated tools in clinical settings, which is a crucial requirement for diagnostic tools. To ensure correct results, automated quality control procedures are of vital importance, but they are often difficult to implement or time-consuming to run. We propose a simple and fast fully automated method to detect segmentation errors, and we evaluate its performance to detect skull-stripping-errors using results of two different brain segmentation algorithms on a large multicenter dataset. Results show that the method is able to detect skull-stripping-errors with high specificity.
The workflow is illustrated in Figure 1. The skull-stripped input image is first registered to a reference image in a template space. Here, the ICBM 2009c atlas4 was used as reference, and full affine transforms were employed5,6. Affine registration was deliberately chosen to prevent the registration from “correcting” the segmentation errors to be detected. Using the obtained transform, a binary mask of the image is transformed into the template space where it is compared to a reference mask by means of similarity metrics. Figure 2 shows the procedure using an example case. Dice coefficient and Hausdorff distance (HDD) were employed due to their complementary sensitivity to segmentation errors; in principle, however, depending on the use case, also other similarity metrics could be applied. Test data was used to define cutoff values for both metrics to classify segmentations as good or bad. The intended use for the pipeline was to reject failed segmentations prior to visual review. Thus thresholds were selected to optimize specificity and minimize false positives.
To evaluate the performance of the pipeline, 364 3D-MP-RAGE (TR=2300ms,TI=900ms,matrix-size=240x256x176;voxel-size=1×1×1mm3) and 3D-FLAIR scans (TR=5000ms,TI=1800ms,240x256x176;voxel-size=1×1×1mm3) were acquired from 181 subjects (146 MS patients, 35 healthy controls) from ten institutions using different 3T scanners (MAGNETOM Verio,Skyra,Prisma,Prismafit,Trio,Vida and BIOGRAPH mMR, all Siemens Healthcare,Erlangen,Germany) covering a large variety of brain size and anatomy. The standardized images were collected as part of the Multiple Sclerosis Partners Advancing Technology and Healthcare Solutions (MS PATHS) project7.
Brain segmentation was performed using two different approaches: A FLAIR-based algorithm (a modified version of autosegMS8,9) and an MP-RAGE-based prototype method (MorphoBox10,11). Both methods provided a skull-stripped image (autosegMS: brain outer contour volume, OCV; MorphoBox: total intracranial volume, TIV). Corresponding reference templates were used. For both methods, skull-stripping and brain segmentation quality were visually reviewed and rated as follows:
1. Storelli L, Rocca MA, Pagani E, et al. Measurement of Whole-Brain and Gray Matter Atrophy in Multiple Sclerosis: Assessment with MR Imaging. Radiology 2018:172468 doi: 10.1148/radiol.2018172468
2. Ledig C, Schuh A, Guerrero R, Heckemann RA, Rueckert D. Structural brain imaging in Alzheimer’s disease and mild cognitive impairment: biomarker analysis and shared morphometry database. Sci. Rep. 2018;8:11258 doi: 10.1038/s41598-018-29295-9
3. Alfaro-Almagro F, Jenkinson M, Bangerter NK, et al. Image processing and Quality Control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage 2018 doi: 10.1016/j.neuroimage.2017.10.034
4. Fonov V, Evans A, McKinstry R, Almli C, Collins D. Unbiased nonlinear average age-appropriate brain templates from birth to adulthood. Neuroimage 2009 doi: 10.1016/S1053-8119(09)70884-5
5. Klein S, Staring M, Murphy K, Viergever MA, Pluim JPW. elastix: a toolbox for intensity-based medical image registration. IEEE Trans. Med. Imaging 2010;29:196–205 doi: 10.1109/TMI.2009.2035616
6. Shamonin DP, Bron EE, Lelieveldt BPF, Smits M, Klein S, Staring M. Fast parallel image registration on CPU and GPU for diagnostic classification of Alzheimer’s disease. Front. Neuroinform. 2014;7:50
7. Multiple Sclerosis Partners Advancing Technology and Healthcare Solutions (MS PATHS). https://www.mspaths.com
8. Fisher E, Cothren RM, Tkach JA, Masaryk TJ, Cornhill JF. Knowledge-based 3D segmentation of the brain in MR images for quantitative multiple sclerosis lesion tracking. In: SPIE Proc. Medical Imaging: Image Processing. ; 1997. pp. 19–25
9. Rudick RA, Fisher E, Lee JC, Simon J, Jacobs L. Use of the brain parenchymal fraction to measure whole brain atrophy in relapsing-remitting MS. Multiple Sclerosis Collaborative Research Group. Neurology 1999;53:1698–1704
10. Schmitter D, Roche A, Maréchal B, et al. An evaluation of volume-based morphometry for prediction of mild cognitive impairment and Alzheimer’s disease. NeuroImage Clin. 2015;7:7–17 doi: 10.1016/j.nicl.2014.11.001
11. Roche A, Forbes F. Partial Volume Estimation in Brain MRI Revisited BT
- Medical Image Computing and Computer-Assisted Intervention – MICCAI 2014:
17th International Conference, Boston, MA, USA, September 14-18, 2014,
Proceedings, Part I. In: Golland P, Hata N, Barillot C, Hornegger J, Howe R,
editors. Cham: Springer International Publishing; 2014. pp. 771–778. doi:
10.1007/978-3-319-10404-1_96