0509

DeepCEST 3T: Robust neural network prediction of 3T CEST MRI parameters including uncertainty quantification

Felix Glang¹, Anagha Deshmane¹, Sergey Prokudin², Florian Martin¹, Kai Herz¹, Tobias Lindig³, Benjamin Bender³, Klaus Scheffler^1,4, and Moritz Zaiss^1,5
¹Magnetic Resonance Center, Max Planck Institute for Biological Cybernetics, Tübingen, Germany, ²Department of Perceiving Systems, Max Planck Institute for Intelligent Systems, Tübingen, Germany, ³Department of Diagnostic and Interventional Neuroradiology, Eberhard Karls University Tübingen, Tübingen, Germany, ⁴Department of Biomedical Magnetic Resonance, Eberhard Karls University Tübingen, Tübingen, Germany, ⁵Department of Neuroradiology, University Clinic Erlangen, Erlangen, Germany

Synopsis

Analysis of CEST data often requires complex mathematical modeling before contrast generation, which can be error prone and time-consuming. Here, a probabilistic deep learning approach is introduced to shortcut conventional Lorentzian fitting analysis of 3T in-vivo CEST data by learning from previously evaluated data. It is demonstrated that the trained networks generalize to data of a healthy subject and a brain tumor patient, providing CEST contrasts in a fraction of the conventional evaluation time. Additionally, the probabilistic network architecture enables uncertainty quantification, indicating if predictions are trustworthy, which is assessed by perturbation analysis.

Introduction

Calculation of sophisticated MR contrasts often requires complex mathematical modeling, which can be computationally expensive and sensitive to fit algorithm parameters. In this work, we investigate whether neural networks (NNs) can provide not only a shortcut to conventional fitting, but also a quality metric for the predicted values, so-called uncertainty quantification, investigated here in the context of multi-pool Lorentzian fitting of CEST-spectra at 3T. The uncertainties allow radiographers to interpret the generated CEST maps with high confidence.

Methods

Z-spectra at 55 frequency offsets between ±100ppm were acquired from 4 healthy subjects and one brain tumor patient at 3 different sites with identical 3T whole-body MRI systems (MAGNETOM Prisma, Siemens Healthcare, Erlangen, Germany) after written informed consent, using a 3D snapshot-CEST sequence¹ and a low-power pre-saturation block².
Z-spectra were de-noised by principal component analysis using the median truncation criterion³. A four-pool Lorentzian fit model⁴ was used describing direct water saturation (DS), semi-solid magnetization transfer (MT), amide (APT) and relayed NOE peaks. The model includes the water frequency shift δ_DS as free parameter and thus takes B₀ inhomogeneity into account.

Deep feed-forward NNs were set up in tensorflow⁵/keras⁶ to map vectors $$$x$$$ of 55 elements representing raw Z-spectra to vectors $$$\mu(x)=(\mu_1(x),...,\mu_n(x))$$$ of $$$n=10$$$ elements, representing the free parameters of the Lorentzian model. Following approaches of learned output variance via maximum-likelihood estimation employed in computer vision^7–9, the NN has 10 additional output neurons representing uncertainties $$$\sigma(x)=(\sigma_1(x),...,\sigma_n(x))$$$ of each Lorentzian parameter. These uncertainty outputs are indirectly inferred from the training data by training with a Gaussian negative log-likelihood loss function $$-\log p_\theta(\mu^{\text{tgt}},\mu(x),\sigma(x))=\frac{1}{2} \sum_{i=1}^{n}\left(\frac{\mu^{\text{tgt}}_i-\mu_i(x)}{\sigma_i(x)}\right)^2+\sum_{i=1}^{n}\log(\sigma_i(x))+\frac{n}{2}\log(2\pi)$$ with the ground-truth training targets $$$\mu^{\text{tgt}}_{i}$$$ obtained by conventional least squares fitting. The workflow is shown in Figure 1.

A network called NN1 was trained on the combined datasets of 3 healthy subjects and tested on the fourth healthy subject and tumor patient datasets. Training data augmentation with simulated Gaussian noise in the inputs and with additional simulated B₀ shifts in a range of ±0.8 ppm was employed. Overfitting was avoided by early stopping with 10 % of the data used as validation set. NN hyperparameters were optimized by a grid search, resulting in 2 layers with 100 neurons each and ELU activation.
Uncertainty quantification was assessed by input perturbations with Gaussian noise and additional B₀ shifts of various strengths. Additionally, an implant-like B₀-artifact caused by a magnetic dipole in the tumor patient’s skull was simulated.

Results

NN1 generalizes to the unseen healthy subject test dataset, as the predicted parameter maps (Figure 2b) closely match the ground truth Lorentzian fit results (Figure 2a), resulting in low prediction errors (Figure 2d). This is confirmed by scatter (Figure 2d) and Bland-Altmann analysis (Figure 2e).
For input perturbation with noise (Figure 3a), strongly increased uncertainty outputs for all corrupted voxels reflect the larger prediction errors caused by fluctuations in the inputs. For perturbation with simulated B₀ shifts ≥1 ppm (Figure 3b), uncertainties are significantly increased, as these shifts exceed the range covered by the training data. Thus, the uncertainty quantification recognizes corrupted and “out-of-training-data” inputs.
NN1 predictions from the tumor patient dataset (Figure 4d) show similar contrasts as the fit (Figure 4c), with APT hyperintensity in the tumor area. Pearson correlation coefficient r between ground truth and prediction is larger than 0.87 for all parameters. The uncertainty maps given in Figure 4e show increased values in the center of the displayed slice where SNR is lower, in parts of the tumor area – especially in the T₂-FLAIR hyperintense region (Figure 4b) – and in vessels, which can show noise-like pulsation artifacts.
For the simulated implant-like B₀-artifact, strongly increased uncertainties (Figure 5e) indicate failure of the prediction (Figure 5d) close to the dipole location, where the strong field inhomogeneity (Figure 5a) shifts Z-spectra drastically. Thus, contrast that might arise from or is depleted by the B₀ artifact can be identified by means of the uncertainty maps and therefore not be misinterpreted.

Discussion

The trained NN generates CEST contrasts from uncorrected Z-spectra rapidly (~1s for ~50,000 Z-spectra as opposed to ~10min in case of Lorentzian fitting) and accurately, generalizing to data that was not included in training. A limitation is introduced by the fact that training data itself is generated by least squares Lorentzian fitting. This is a simplified model and suffers from typical instabilities caused by low SNR and dependence on initial and boundary values. Thus, the current approach proves the concept of a shortcut to conventional fitting including uncertainty quantification but can be extended to improved Z-spectrum models such as Bloch-McConnell fitting^10–12.

Using NNs as shortcut for least squares fitting has been proposed before¹³, also in the context of MRI^14–16 and CEST^17–19. However, none of these approaches includes uncertainty quantification. The presented probabilistic output layer can be easily adapted to other regression NNs, enabling uncertainty quantification for similar approaches.

Conclusion

The deepCEST NN allows fast estimation of CEST parameters, providing a shortcut to the conventional evaluation method at 3T. Moreover, the introduced uncertainty quantification indicates if the predictions are trustworthy, enabling confident interpretation of contrast changes. This is promising for fast online reconstruction, bringing sophisticated CEST contrasts a step closer to clinical application.

Acknowledgements

Max Planck Society; German Research Foundation (DFG, grant ZA 814/2-1, support to KH); European Union Horizon 2020 research and innovation programme (Grant Agreement No. 667510, support to MZ, AD).

References

1. Zaiss M, Ehses P, Scheffler K. Snapshot-CEST: Optimizing spiral-centric-reordered gradient echo acquisition for fast and robust 3D CEST MRI at 9.4 T. NMR in Biomedicine. 2018;31(4):e3879. doi:10.1002/nbm.3879

2. Deshmane A, Zaiss M, Lindig T, et al. 3D gradient echo snapshot CEST MRI with low power saturation for human studies at 3T. Magnetic Resonance in Medicine. 2019;81(4):2412-2423. doi:10.1002/mrm.27569

3. Breitling J, Deshmane A, Goerke S, et al. Adaptive denoising for chemical exchange saturation transfer MR imaging. NMR in Biomedicine. 2019;0(0):e4133. doi:10.1002/nbm.4133

4. Goerke S, Soehngen Y, Deshmane A, et al. Relaxation-compensated APT and rNOE CEST-MRI of human brain tumors at 3 T. Magnetic Resonance in Medicine. March 2019. doi:10.1002/mrm.27751

5. Abadi M, Barham P, Chen J, et al. TensorFlow: A System for Large-Scale Machine Learning. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’16). ; 2016:265-283. https://www.usenix.org/conference/osdi16/technical-sessions/presentation/abadi. Accessed June 4, 2019.

6. Chollet F, others. Keras. https://github.com/fchollet/keras. Published 2015. Accessed January 7, 2019.

7. Kendall A, Gal Y. What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? In: Guyon I, Luxburg UV, Bengio S, et al., eds. Advances in Neural Information Processing Systems 30. Curran Associates, Inc.; 2017:5574–5584.

8. Prokudin S, Gehler P, Nowozin S. Deep Directional Statistics: Pose Estimation with Uncertainty Quantification. In: Ferrari V, Hebert M, Sminchisescu C, Weiss Y, eds. Computer Vision – ECCV 2018. Vol 11213. Cham: Springer International Publishing; 2018:542-559. doi:10.1007/978-3-030-01240-3_33

9. Gast J, Roth S. Lightweight Probabilistic Deep Networks. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition; 2018:3369-3378.

10. Heo H-Y, Zhang Y, Lee D-H, Hong X, Zhou J. Quantitative assessment of amide proton transfer (APT) and nuclear overhauser enhancement (NOE) imaging with extrapolated semi‐solid magnetization transfer reference (EMR) signals: Application to a rat glioma model at 4.7 tesla. Magnetic Resonance in Medicine. 2016;75(1):137-149. doi:10.1002/mrm.25581

11. Heo H-Y, Zhang Y, Jiang S, Lee D-H, Zhou J. Quantitative assessment of amide proton transfer (APT) and nuclear overhauser enhancement (NOE) imaging with extrapolated semisolid magnetization transfer reference (EMR) signals: II. Comparison of three EMR models and application to human brain glioma at 3 Tesla. Magnetic Resonance in Medicine. 2016;75(4):1630-1639. doi:10.1002/mrm.25795

12. Liu D, Zhou J, Xue R, Zuo Z, An J, Wang DJJ. Quantitative characterization of nuclear overhauser enhancement and amide proton transfer effects in the human brain at 7 Tesla. Magnetic Resonance in Medicine. 2013;70(4):1070-1081. doi:10.1002/mrm.24560

13. Bishop CM, Roach CM. Fast curve fitting using neural networks. Review of Scientific Instruments. 1992;63(10):4450-4456. doi:10.1063/1.1143696

14. Bertleff M, Domsch S, Weingärtner S, et al. Diffusion parameter mapping with the combined intravoxel incoherent motion and kurtosis model using artificial neural networks at 3 T. NMR in Biomedicine. 2017;30(12):e3833. doi:10.1002/nbm.3833

15. Domsch S, Mürle B, Weingärtner S, Zapp J, Wenz F, Schad LR. Oxygen extraction fraction mapping at 3 Tesla using an artificial neural network: A feasibility study. Magnetic Resonance in Medicine. 2018;79(2):890-899. doi:10.1002/mrm.26749

16. Liu H, Xiang Q-S, Tam R, MacKay A, Kramer J, Laule C. 150× acceleration of myelin water imaging data analysis by a neural network, Program Number 0152. In: Proceedings of ISMRM 27th Annual Meeting; 2019.

17. Chen L, Schär M, Chan KWY, et al. High-resolution phosphocreatine mapping of human skeletal muscle by artificial neural network-based chemical exchange saturation transfer MRI at 3T, Program Number 0048. In: Proceedings of ISMRM 27th Annual Meeting; 2019.

18. Deshmane A, Zaiss M, Herz K, Bender B, Lindig T, Scheffler K. Machine Learning accelerates and stabilizes selective CEST fitting at 3T, Program Number 5002. In: Proceedings of ISMRM 27th Annual Meeting; 2019.

19. Glang F, Deshmane A, Martin F, Herz K, Scheffler K, Zaiss M. Can a neural network predict B0 maps from uncorrected CEST-spectra? Program Number 4995. In: Proceedings of ISMRM 27th Annual Meeting; 2019.

Figures

Figure 1: Schematic of the presented data processing pipeline. A deep neural network is used to shortcut CEST-spectra evaluation by training with data that was previously evaluated using conventional techniques (PCA denoising, Lorentzian fitting). The trained network directly maps from raw Z-spectra to Lorentzian parameters, yielding CEST contrast maps in about 1 second where conventional evaluation takes ~10 minutes. Additionally, the neural network architecture is able to provide uncertainty quantification that indicates if predictions are trustworthy.

Figure 2: Results of NN1 applied to a test dataset that was not included in training, compared to least squares fitting results: (a) Parameter maps obtained by least squares fit, (b) NN1 predictions of Lorentzian parameters, (c) difference between fit results and NN1 predictions, (d) scatter plots of prediction vs. fit (legends indicate Pearson correlation coefficients r) and (e) Bland-Altmann plots (red: mean, green: mean±1.96·std across all voxels). Rows correspond to the parameters APT, NOE and MT amplitude and water frequency shift δ_DS.

Figure 3: Input perturbation analysis for NN1: Voxel-wise scatter plots together with marginal histograms of NN uncertainty vs absolute prediction error for (a) different Gaussian noise amplitudes and (b) additional simulated B₀ shifts. Perturbations of each strength were applied to the entire healthy subject test dataset shown in Figure 2. Colors indicate the respective perturbation strength. Results are given exemplarily for the APT peak amplitude.

Figure 4: Application of NN1 in tumor patient. Clinical contrasts: (a) T1ce and (b) T2 FLAIR. (c) Fit results. (d) NN1 prediction. (e) difference between fit and NN1 prediction. (f) NN1 uncertainty.

Figure 5: Performance of NN1 for simulation of an implant-like B₀ artifact. (a) Relative B₀ shift simulated by placing a magnetic dipole (in-plane orientation) in the upper right corner of the displayed slice. The dipole field strength is scaled such that it results in a B₀ shift of 1 ppm in the center of the slice. (b) Conventional fit results, (c) NN predictions, (d) prediction errors and (e) uncertainty estimations for the simulated implant artifact corrupted dataset.

Proc. Intl. Soc. Mag. Reson. Med. 28 (2020)

0509