0776

Fully Automated Multivendor and Multisite Artificial Intelligence-based 3D Segmentation of the Proximal Arteries from 4D flow MRI

Haben Berhane¹, Michael Scott², Takashi Fujiwara³, Lorna Browne³, Joshua Robinson¹, Cynthia Rigsby¹, Michael Markl², and Alex Barker³
¹Lurie Children's Hospital of Chicago, Chicago, IL, United States, ²Northwestern University, Chicago, IL, United States, ³University of Colorado, Anschutz Medical Campus, Aurora, CO, United States

Synopsis

We trained and validated a multi-label convolutional neural network for the segmentation of the aorta and pulmonary arteries from 4D flow MRI for rapid flow analysis across multiple vendors and centers. Using 67 whole-heart 4D flow MRI scans, including 29 with cardiac pathologies, across two institutions and vendors, we trained and tested our CNN using 10-fold cross validation. For flow analysis, We calculated net flow, peak velocity, and Qp-Qs. Across all flow metrics, we found that automated segmentations showed moderate to strong agreement with the manual segmentations, while taking a fraction of the time.

Introduction

4D Flow MRI provides comprehensive characterization and quantification of hemodynamics in the presence of pediatric congenital heart disease, however the analysis can be cumbersome, requiring extensive and time-consuming manual segmentation. As such, an accurate, automated segmentation algorithm could simplify hemodynamic quantification and possibly improve flow analysis interobserver variability. While previous attempts have demonstrated the ability to accurately and rapidly provide automated segmentations of the thoracic aorta from 4D flow MRI [1], we seek to build on these developments by including: 1) multi-label segmentation of the aorta and pulmonary arteries on, 2) multiple vendor datasets from, 3) multiple sites. With rapid and accurate segmentation of the aorta and pulmonary arteries, 4D flow MRI data can be analyzed quickly across multiple sites. In this study, we trained and validated a multi-label convolutional neural network (CNN) for the segmentation of the aorta and pulmonary arteries (PA) from 4D flow MRI for rapid flow analysis across multiple vendors and centers.

Methods

This retrospective pilot study used 67 whole-heart 4D flow datasets from Site 1: Lurie Children’s Hospital (Lurie), and Site 2: Children’s Hospital of Colorado (CHCO), with patients ranging from infants to young adults (Lurie: N=42, age=15 [2-25] years, TR=40.8-42.8 ms, venc=80-220 cm/s, spatial resolution=1.89-2.38x1.8-2.38x1.9-2.8 mm3, Siemens; CHCO: N=25, age= 14 [3-28] years, TR=37-75 ms, venc=150 cm/s, spatial resolution=1.25-2.34x1.25-2.34x1.56-2.5 mm3, Philips). Overall, the cohort included 37 controls and 29 patients with cardiac pathologies (including 15 tetralogy of Fallot [TOF]). The workflow is described in Figure 1. Each 4D flow dataset underwent standard 4D flow pre-processing. 4D flow-derived 3D phase contrast MR angiograms (PCMRA, Figure 1B) were used to perform manual 3D segmentations of the aorta and the PA (Mimics, Materialise; 3D Slicer). The manually-generated 3D segmentations served as a ground-truth for training and testing while the PCMRAs were inputs to the CNN (Figure 1C). A 10-fold cross validation was used, allowing every dataset to be used for testing (Figure 1D).

The CNN used was a 3D UNet with DenseNet blocks replacing the traditional convolution layers (Figure 2) [2, 3]. Each DenseNet block consisted of a series of 3D convolutions, batch normalization, and a linear rectified unit, applied n number of times, with increasing frequency at deeper sections of the CNN. After every convolution layer, the feature maps were concatenated together and served as inputs for subsequent layers, in order to efficiently reuse feature maps throughout the CNN. Max-pooling was applied to down-sample the feature maps to obtain more generic features, while transposed-convolution was used for up-sampling. After the final convolution layer, a softmax function was used to generate a probability map across all voxels. A multi-labeled dice loss and softmax cross entropy were used as a composite loss function throughout training. A dropout rate of 0.1 was applied after every convolution layer in order to prevent overfitting. An Adam optimizer was used, and the learning rate was kept constant at 0.0001.

Quantitative flow analysis and Qp-Qs calculations were performed by manually placing a plane at the ascending aorta and at the main PA (Ensight, Ansys; Figure 1E). Dice scores were calculated between the ground-truth to the automated CNN output. All values are reported as the mean±strandard deviation for normally distributed data or the median [interquartile range] otherwise. For all flow metrics, Bland-Altman and intraclass correlations (ICC) were performed between the manual and automated segmentations to assess the performance of the CNN.

Results

The CNN took on average 0.34±0.12 seconds to segment the aorta and PA compared to 15 minutes manually. The median Dice score of the entire cohort was 0.89 [0.86-0.93] for the aorta and 0.88 [0.83-0.91] for the PA. Three examples of the manual (red) and automated (blue) segmentation as well as a difference map are provided in Figure 3. Figure 3A shows an example of a Site 1 control which had Dice scores of Ao: 0.91, PA: 0.92. Figure 3B shows the results for a Site 1 TOF patient, obtaining Dice scores of Ao: 0.89, PA: 0.88, and Figure 3C showed a Site 2 patient with Dice scores of Ao: 0.91, PA: 0.93. For flow comparisons, the Bland Altman, the flow metrics, and ICC comparisons are provided in Figure 4 and Table 1, respectively. Four Site 1 and two Site 2 datasets were excluded from the flow analysis due to severe aliasing in the aorta or PA. Table 1A summarizes the flow metrics in the aorta and PA across the entire cohort. Bland-Altman plots showed moderate to good agreement across all comparisons between 10%-24% difference from the mean manually generated reference values (Figure 4), and ICC comparisons (Table 1B) similarly showed between moderate to excellent agreement, between 0.74-0.99 ICC coefficients.

Discussion

In this study, we present initial CNN results showing an automated multi-label segmentation of the aorta and PA from 4D flow MRI in patients with multiple pathologies, from two centers with different vendors. The results indicate a slight performance bias for Site 1, likely due to the data imbalance of roughly 2:1 between Site 1 and 2. Future work will incorporate more datasets and build a more balanced cohort across additional centers and vendors to improve the model.

Acknowledgements

R01HL115828

R01HL133504

F30HL145995

References

1. Berhane, H., et al., Artificial intelligence-based fully automated 3D segmentation of the aorta from 4D flow MRI. SMRA2019, 2019.

2. Çiçek, Ö., et al. 3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation. 2016. Cham: Springer International Publishing.

3. Huang, G., et al., Densely Connected Convolutional Networks. 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017. DOI: 10.1109/cvpr.2017.243.

Figures

Figure 1: Workflow. All 4D flow data (A) underwent standard 4D flow preprocessing and 3D phase contrast (PC) MRA (B) calculation. The 3D PCMRA was manually segmented for the aorta and pulmonary arteries (PA) (C), and used as the input for the convolutional neural network (CNN) in order to generate the automated segmentations (D). Training and testing were performed through a 10-fold cross validation. Flow metrics were obtained by manually placing 2D analysis planes at the ascending aorta and main PA (E). The same plane positions were used for both the manual and automated segmentations.

Figure 2: CNN Architecture and Layer Structures. The CNN consisted of a symmetrical traditional U-Net design with the dense blocks replacing the convolution layers. Skip connections were used to retain the feature maps from earlier portions of the CNN. Feature maps were concatenated after each convolution layer and served as the input for all subsequent layers. A convolution channel size of 12 was used for every convolution layer in the dense block. Use of dense blocks regulated the growth of the CNN while efficiently reusing feature maps throughout.

Figure 3: Examples of the automated and manual segmentations as well as difference maps between them. (A) An example Site 1 control patient with Dice scores of Ao: 0.91 and PA: 0.92. (B) An example of a Site 1 Tetralogy of Fallot patient with a left pulmonary artery stent with Dice scores of Ao: 0.89 and PA: 0.88. (C) A Site 2 patient with Dice scores of Ao: 0.91 and PA: 0.93.

Figure 4: Bland-Altman plots of the net flow in the aorta and pulmonary arteries for both sites and Qp-Qs comparison for the entire cohort. Across all comparisons, the bias was low. The limits of agreement for Site 1 were within 10% difference of the mean manually generated values for the pulmonary arteries and 13% difference for the aorta. For Site 2, the limits of agreement were between 10-24% difference from the mean manually generated values. Qp-Qs comparisons across the entire cohort similarly showed limits of agreement, around 24% difference from the mean manually generated values.

Table 1: Flow metrics and intraclass correlation (ICC) between the automated and manual segmentations for the net flow in the aorta and pulmonary arteries (PA) as well as comparisons between Qp-Qs across the entire cohort. In Table 1A, the median and interquartile range for the net flow in the aorta and pulmonary arteries (PA) are provided for both Lurie and CHCO data. In Table 1B, The ICC showed excellent agreement for the net flow in the aorta and PA, with values between 0.97-0.99. The ICC for Qp-Qs comparisons showed moderate agreement, with an ICC coefficient of 0.73.

Proc. Intl. Soc. Mag. Reson. Med. 28 (2020)

0776