4136

Automated Segmentation of Salvageable Ischemic Brain Tissue using Convolutional Neural Networks with DWI and FLAIR MRI

Ryan Andrew Rava^1,2, Kenneth V. Snyder^2,3, Muhammad Waqas^2,3, Elad I. Levy^2,3, Jason M. Davies^2,3, Adnan H. Siddiqui^2,3, and Ciprian N. Ionita^1,2,3
¹Biomedical Engineering, University at Buffalo, Buffalo, NY, United States, ²Canon Stroke and Vascular Research Center, Buffalo, NY, United States, ³Neurosurgery, University at Buffalo, Buffalo, NY, United States

Synopsis

Convolutional neural networks have the potential to predict penumbra volumes within acute ischemic stroke patients to determine their eligibility for mechanical thrombectomy based on the Defuse 3 clinical trial. Currently, computed tomography perfusion is the main method used to quantify penumbra volumes but not all stroke centers have this modality available. In this study, 2 networks were developed to automatically segment penumbra using FLAIR and DWI and performance metrics comparing each network’s predictions with ground truth penumbra (dual network: Dice=0.61, sensitivity=0.68, PPV=0.59, multi-input network: Dice=0.61, sensitivity=0.62, PPV=0.64) indicate a multi-input network is the most capable of segmenting penumbra tissue.

Introduction

Worldwide, 82.4 million individuals suffer an acute ischemic stroke (AIS) annually.¹ Following an AIS, computed tomography perfusion (CTP) is used by stroke centers to asses patient eligibility for mechanical thrombectomy through infarct (dead) and penumbra (salvageable) tissue quantification using contralateral hemisphere comparisons of perfusion parameters.² In order to assess if a patient is eligible for mechanical thrombectomy, removal of a thrombus from neurovasculature, the Defuse 3 clinical trial states the following are required: infarct<70mL, ischemic-to-infarct ratio>1.8, and penumbra>15mL.³ Since infarct and penumbra volumes can be influenced by abnormal patient specific contrast flow conditions and there is subjectivity in optimal CTP thresholds, fluid-attenuation inversion recovery (FLAIR) and diffusion-weighted imaging (DWI) may provide a better alternative for determining reperfusion eligibility.^4-6 Within FLAIR, hyper-intensified regions represent infarct, while DWI hyper-intensified regions represent the entire ischemic region when baseline imaging is conducted within 4 hours of symptom onset.⁷ In this study, we aim to assess the ability of convolutional neural networks (CNNs) to segment penumbra from AIS patients using FLAIR and DWI. This automated method could optimize clinical workflow for patients at risk for cognitive and motor deterioration compared to manual segmentation methods.

Methods

FLAIR and DWI data from 21 AIS patients were obtained for this retrospective study. FLAIR and DWI volumes were resampled from a 5mm to 0.5mm slice thickness to increase the dataset as two-dimensional slices were used to train and test the networks. Volumetric registration was used to align corresponding DWI and FLAIR volumes using MATLAB’s geometric/intensity registration tool (Figure 1).^5,8 Open source BrainSuite software was used to remove high intensity skin regions around the brain in FLAIR, simplifying the network segmentation of the FLAIR lesion (Figure 2).⁹

Following pre-processing, 2 individual approaches were conducted to segment penumbra. The first approach trained 2 separate CNNs, one to segment FLAIR infarct and the other to segment DWI ischemic tissue, and a subtraction of the segmented lesions was taken to isolate penumbra. For each network, a 60:10:30 training:validation:testing split was conducted to prevent overfitting, allocating 13 (1,726 slices), 2 (288 slices), and 6 (863 slices) patients to the training, validation, and testing sets respectively. Training and testing was conducted on slices to increase the network dataset while the split was conducted on patients so penumbra volumes could be reconstructed following network predictions for each patient. A modified U-net architecture was utilized with 2 down-sampling processes, 1 bottom process, and 2 up-sampling processes along with a batch size of 16 and an Adadelta optimizer. FLAIR and DWI slices containing infarct were used as the inputs to their respective networks. Infarct and ischemic tissue volumes were reconstructed from tested slices and compared with their respective ground truth volumes. Infarct predictions were subtracted from ischemic tissue predictions to obtain penumbra predictions which were compared with ground truth penumbra volumes. Comparison metrics computed include: Dice coefficients, sensitivity, positive-predictive-value (PPV), infarct/ischemic tissue/penumbra volume differences, and Spearman correlation coefficients (SCC). 20 iterations of Monte Carlo cross-validation (MCCV) were conducted to assess network variability.

The second approach used to segment penumbra was a multi-input CNN which allocated FLAIR and DWI slices to separate color channels of a matrix to be used as the network input. Training:validation:testing splits, network architecture, batch size, and optimizer all remained the same for the second approach from the first. Output predictions were two-dimensional penumbra segmentations and volumes of penumbra were reconstructed for each patient in the testing set from the slice predictions. Predicted penumbra volumes were compared with ground truth penumbra volumes using the same previously stated metrics. Metrics from the two separate processes were compared to assess potential superiority of one approach over the other.

Results

Table 1 indicates 95% confidence intervals for each network’s segmentation metrics over 20 iterations of MCCV. Figure 3 demonstrates two-dimensional slices of each network process’s performance when segmenting penumbra. Figure 4 indicates three-dimensional overlap of predicted and ground truth penumbra using both network processes. Each volume prediction took 6.7 seconds for the dual network and 2.2 seconds for the multi-input network compared to 10 minutes for manual DWI and FLAIR segmentations.

Discussion

Results indicate similar spatial overlap of estimated penumbra volumes for each network approach based on Dice coefficients. PPVs indicates the multi-input network is less likely to overestimate volumes of penumbra as PPV indicates the portion of the predicted penumbra volume within the ground truth volume. Additionally, a lower sensitivity for the multi-input network indicates less true positive predicted penumbra voxels, but similar Dice values indicate the false positive and false negative predictions equal each other out for the two network processes. Penumbra volume differences indicate the multi-input network to be superior since there is only a slight overestimation of the penumbra volume compared with the dual network method. Using either however would allow for an increase in the number of patients deemed as thrombectomy eligible which is preferred to help as many individuals as possible regain lost neurological function.

Conclusion

CNNs can be utilized to predict the amount of penumbra present in AIS patients to help determine mechanical thrombectomy eligibility. CNNs additionally provide an automated penumbra segmentation method to improve clinical workflow for patients at risk of permanent neurological deficits.

Acknowledgements

No acknowledgement found.

References

1. Benjamin EJ, Muntner P, Bittencourt MSJC. Heart disease and stroke statistics-2019 update: a report from the American Heart Association. 2019;139:e56-e528.

2. Miles K, Eastwood JD, Konig M. Multidetector computed tomography in cerebrovascular disease: CT perfusion imaging. United Kingdom: CRC Press, 2007.

3. Albers GW, Marks MP, Kemp S, et al. Thrombectomy for stroke at 6 to 16 hours with selection by perfusion imaging. New England Journal of Medicine. 2018;378:708-718.

4. Rava R, Snyder K, Mokin M, et al. Assessment of a Bayesian Vitrea CT Perfusion Analysis to Predict Final Infarct and Penumbra Volumes in Patients with Acute Ischemic Stroke: A Comparison with RAPID. American Journal of Neuroradiology. 2020.

5. Rava R, Snyder KV, Mokin M, et al. Assessment of Computed Tomography Perfusion Software in Predicting Spatial Location and Volume of Infarct in Acute Ischemic Stroke Patients: A Comparison of Sphere, Vitrea, and RAPID. Journal of Neurointerventional Surgery. 2020.

6. Legrand L, Tisserand M, Turc G, et al. Do FLAIR vascular hyperintensities beyond the DWI lesion represent the ischemic penumbra? American Journal of Neuroradiology. 2015;36:269-274.

7. Ho KC, Speier W, Zhang H, et al. A machine learning approach for classifying ischemic stroke onset time from imaging. IEEE transactions on medical imaging. 2019;38:1666-1676.

8. Muthukumaran D, Sivakumar M. Medical Image Registration: A Matlab Based Approach. International Journal of Scientific Research in Computer Science, Engineering and Information Technology. 2017;2:29-34.

9. Sandor S, Leahy R. Surface-based labeling of cortical anatomy using a deformable atlas. IEEE transactions on medical imaging. 1997;16:41-54.

Figures

Figure 1: Indicates FLAIR and DWI volumes following registration using MATLAB’s geometric/intensity based registration tool. Pink and green volumes indicate FLAIR and DWI volumes respectively. Note the high degree of spatial overlap of the brain and ventricle edges within all three views (axial, lateral, anteroposterior (AP)). Registration accuracy was assessed through ventricle overlap which was found to have a Dice 95% confidence interval of 0.84±0.05 for the 21 patients in this study.

Figure 2: The left and right images indicates a FLAIR slice before and after the surrounding skin is removed using BrainSuite’s open source brain extraction tool. Note the intensities of the skin being similar to the region of infarct caused by a left M1 middle cerebral artery occlusion (left image). Removing regions similar in intensity to infarct simplifies the network’s segmentation task due to an increase in contrast separating the lesion from the rest of the brain (right image).

Table 1. Dice Coefficients, sensitivity, positive predictive values, infarct/ischemic tissue/penumbra volume differences, and Spearman correlation coefficients, with 95% confidence intervals, for each network’s segmentation performance of either ischemic tissue (DWI), infarct (FLAIR), or penumbra (Dual Network and Multi-Input).

Figure 3: Top images indicate DWI and FLAIR slices fed into each network to segment the penumbra seen in the bottom images. The dual network prediction indicates the subtraction of FLAIR segmented infarct from DWI segmented ischemic tissue. The multi-input prediction indicates predicted penumbra using both DWI and FLAIR as network input and the red line is an outline of the ground truth penumbra for this slice. Dual and multi-input Dice coefficients for this slice are 0.83 and 0.78 respectively.

Figure 4: Indicates the three-dimensional overlap of predicted penumbra from the dual network (left image) and multi-input network (right image) with ground truth penumbra. Green, blue, and maroon regions indicate predicted penumbra, ground truth penumbra, and agreement of penumbra spatial locations respectively. Note the higher degree of maroon regions when utilizing the multi-input network and the higher degree of penumbra overestimation (green regions) when using the dual network process.

Proc. Intl. Soc. Mag. Reson. Med. 29 (2021)

4136