5198

3D CNN for Oxygen Extraction Fraction Mapping with combined QSM and qBOLD

Patrick Kinz¹ and Lothar R Schad¹
¹Computer Assisted Clinical Medicine, Medical Faculty Mannheim, Heidelberg University, Mannheim, Germany

Synopsis

Keywords: Machine Learning/Artificial Intelligence, Oxygenation

We developed a CNN for OEF mapping from QSM+qBOLD data, which utilizes utilizes 3D convolutional layers. Two dimensions for the spatial components of an image and one dimension for the temporal component in the qBOLD data. The results are an improvement over our previous 2D CNN, but even the more advanced network architecture struggles with voxels, that have a very low deoxyhemoglobin content. In this abstract we also study with simulated data when the CNN produces reliable results and when it predicts default values for the reconstructed parameters instead.

Introduction

The oxygen extraction fraction (OEF) can be used as an indicator for the vitality of tissue in pathologies that affect the blood perfusion like a stroke¹ or that affect the tissues metabolism like cancer². To determine the OEF we use a model that combines quantitative BOLD (qBOLD) and quantitative susceptibility mapping (QSM)³.
Traditional fitting methods take a long time to reconstruct the parameter maps for a whole brain and strongly depend on the initial guess. Convolutional Neural Networks (CNNs) provide a way to get fast results without any initial guess. Previous works⁴showed a lot of potential, so we worked on further improving the neural networks.

Methods

Artificial training data was created assuming the equations from the QSM+qBOLD model. It has 5 parameters: Initial signal amplitude S₀, transverse relaxation rate R₂, venous blood oxygenation Y, volume percentage of a voxel filled by deoxygenated blood ν and susceptibility χ_nb of the non-blood tissue surrounding the blood vessels. Y of the venous blood is used instead of OEF since OEF=(Y_a -Y_v )/Y_a and Y_a is assumed to be constant.
To create training data patches of 30*30 pixels were taken from a segmented brain⁵. Random values for the 5 parameters were assigned to each tissue type in a patch. Example parameter maps can be seen in the top row of figure 1. The simulated signal was calculated for a GESFIDE sequence⁶ with a Spin Echo at 40 ms and 16 Gradient echoes spaced every 3 ms. Gaussian noise was added to the resulting signal to achieve an SNR of 100. The parameter maps were also used to calculate a quantitative susceptibility map of each patch, which is used as an additional input. The training set consists of approximately 200,000 patches.
To reconstruct the parameters a simple CNN as shown and described in figure 2 was used in our previous work⁷. Figure 3 shows an improved CNN which utilizes 3D convolutions and has added batch normalization and spatial dropout. Both were implemented using Keras⁸ and TensorFlow⁹.

Results

Figure 1 depicts the ground truth parameter maps of an exemplary patch together with the reconstruction results of both networks. The improved CNN with 3D convolutional layers leads to qualitatively better results for Y and ν, while it maintains a high level of accuracy for the other 3 parameters.
Figure 4 gives an overview over the reconstruction accuracy of the CNN with 3D convolutions. Each subplot is a 2D histogram of all the true parameter values on the x axis vs the predicted parameter values on the y axis. A perfect result would be a thin line at an 45° angle for the subplots on the main diagonal and uniform noise for all other combinations. S₀, R₂ and χnb come close to the ideal, while Y and ν show clear correlations between them. These are further observed in figure 5, where the correlation plots of the CNN with 2D kernels is compared with the CNN with 3D kernels. Both networks show similar tendencies in parameter ranges with low deoxyhemoglobin content in a voxel. In these cases the additional signal loss from the deoxyhemoglobin is very small and the networks do not have enough information to accurately estimate the true values of Y and ν. The CNN with 3D kernels can still accurately predict Y down to ν of almost 2%. At ν below 2% it uses a default value for Y near 85%. Similar behavior is visible for predictions of ν when Y goes above 70%. Interestingly, both networks started to use the same default values of Y=85% and ν=3.5%. The older network never predicted a ν below 3% while the 3D CNN can go down to 1% for strongly deoxygenated blood with Y below 40%.

Discussion and Conclusion

The improved CNN with 3D convolutional layers can extract more information from the signal and has accurate predictions for a wider range of parameters. Other studies¹⁰ found strongly deoxygenated blood with an mean OEF around 50% and mean deoxygenated blood volume ν of 1%. The expected values for Y fall in the stable range of the networks, but the expected values for ν are below the possible range of the 2D CNN and at the edge of the range for the improved 3D CNN. Further improvements of the network architecture might move the range a bit more to lower values. To solve the problem we will continue by studying other sequences with more data points or higher SNR. Training a CNN with the same architecture as the 3D CNN on simulated data for possible sequences will provide a way to compare which sequences provide the most useful information.

Acknowledgements

No acknowledgement found.

References

1. Ibaraki M, Shimosegawa E, Miura S, Takahashi K, Ito H, Kanno I, Hatazawa J. PET measurements of CBF, OEF, and CMRO2 without arterial sampling inhyperacute ischemic stroke: method and error analysis. Ann Nucl Med 2004;18(1):35-44.

2. Vaupel P, Mayer A. Hypoxia in cancer: significance and impact on clinical outcome. Cancer Metastasis Rev 2007;26(2):225-239.

3. Hubertus, S, Thomas, S, Cho, J, Zhang, S, Wang, Y, Schad, LR. Comparison of gradient echo and gradient echo sampling of spin echo sequence for thequantification of the oxygen extraction fraction from a combined quantitative susceptibility mapping and quantitative BOLD (QSM+qBOLD) approach. MagnReson Med 2019;82:1491-1503.

4. Hubertus, S, Thomas, S, Cho, J, Zhang, S, Wang, Y, Schad, LR. Using an artificial neural network for fast mapping of the oxygen extraction fractionwith combined QSM and quantitative BOLD. Magn Reson Med. 2019; 82: 2199– 2211

5. Alfano, B, Comerci, M, Larobina, M, Prinster, A, Hornak, JP, Selvan, SE, Amato, U, Quarantelli, M, Tedeschi, G, Brunetti, A, Salvatore, M. An MRI digitalbrain phantom for validation of segmentation methods. Med Image Anal. 2011 Jun;15(3):329-39

6. Ma, J, Wehrli, F. Method for Image-Based Measurement of the Reversible and Irreversible Contribution to the Transverse-Relaxation Rate. J. Magn.Reson. B 1996; 111:61

7. Kinz, P, Schad, LR, A CNN for Oxygen Extraction Fraction Mapping with combined QSM and qBOLD, ISMRM 2022, Abstract 1986

8. Chollet, F et al, Keras, https://keras.io, 2015

9. Abadi, M et al, TensorFlow: Large-scale machine learning on heterogeneous systems, 2015. Software available from tensorflow.org.

10. Xiang, H, Yablonskiy, DA, Quantitative BOLD: Mapping of Human CerebralDeoxygenated Blood Volume and Oxygen ExtractionFraction: Default State, MagnReson Med 2007;57:115-126.

Figures

Figure 1: Parameter Maps for an exemplary patch used to test the CNNs. The top row shows the ground truth values. The second row shows the results of the 2D CNN described in figure 2 and the third row shows the predictions of the 3D CNN described in figure 3. Both networks do well in reconstructing S₀, R₂ and χ_nb. The results of Y and ν have qualitative improved. Areas with very high Y, like the yellow stripe at the bottom left, contain little deoxyhemoglobin, so that the networks can not accurately reconstruct ν. Instead they produce a default value.

Figure 2: Architecture of the CNN with only 2D convolutional kernels. The network gets simulated GESFIDE data with 30x30 pixels and 16 echoes as input for the qBOLD branch. The 16 echoes are represented as 16 channels. The QSM branch gets a 30x30 pixel susceptibility map as input. Both branches have one convolutional layer with 16 filters with kernel size 3x3 and tanh activation. Afterwards they are joined through concatenation, which is followed by a convolutional layer with 32 filters. A final convolution with linear activation reduces the number of layers to 5 for the parameters.

Figure 3: Architecture of the CNN with 3D convolutional kernels. This network uses more convolutional layers. Each convolutional layer is followed by batch normalization and spatial dropout layers. The qBOLD branch has an additional dimension to separate time and channels. It uses 3D convolutional layers with kernels of size 3*3*5. Zero padding keeps the spatial image size constant, while the time axis shrinks with each convolution until it has length one. Reshaping removes the time dimension to allow for concatenation with the QSM branch.

Figure 4: These histograms show the correlation between true and predicted parameters by the CNN with 3D Kernels. S₀, R₂, and χ_nb show the desired behavior. The true and the predicted values match each other forming a diagonal line while the other parameters show no correlation with the predicted values. Y and ν are correlated with each other, since both influence the amount of deoxyhemoglobin present in a voxel. A more detailed comparison can be found in figure 5.

Figure 5: These plots show the correlation between true and predicted venous oxygen saturation Y and deoxygenated blood volume ν. Both networks perform best in cases with a lot of deoxyhemoglobin in a voxel. If ν is too low, the predicted Y tends more and more towards 85%. And if Y is above a certain limit, the predicted ν is around 3.5%. The CNN with 3D Kernels has a larger range of parameter combinations where predicted value and ground truth match and is able to determine ν below 3% for strongly deoxygenated blood.

Proc. Intl. Soc. Mag. Reson. Med. 31 (2023)

5198

DOI: https://doi.org/10.58530/2023/5198