0330

Unsupervised physics-informed deep learning (N=1) for solving inverse qMRI problems – Relaxometry and field mapping from multi-echo data
Ilyes Benslimane1, Thomas Jochmann2, Robert Zivadinov1,3, and Ferdinand Schweser1,3
1Buffalo Neuroimaging Analysis Center, Department of Neurology, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, The State University of New York, Buffalo, NY, United States, Buffalo, NY, United States, 2Department of Computer Science and Automation, Technische Universität Ilmenau, Ilmenau, Germany, Jena, Thuringia, Germany, 3Center for Biomedical Imaging, Clinical and Translational Science Institute at the University at Buffalo, Buffalo, NY, USA, Buffalo, NY, United States

Synopsis

Modeling the non-linear relationship of the Magnetic Resonance (MR) signal and biophysical sources is computationally expensive and unstable using conventional methods. We develop an unsupervised physics-informed deep learning algorithm that quantifies MR parameters from multi-echo GRE data in a single computational pass. The algorithm produced accurate B0 and R2* field maps without phase wrapping artifacts and with typical contrast variations. The success of this network demonstrates the feasibility of physics-informed quantitative MRI (qMRI) without the need for ground truth training data, typically required by similar networks. This developed tool could provide fast and comprehensive tissue characterization in qMRI.

Introduction

The non-linear mathematical relationship between the MRI signal and the biophysical contrast sources has been a significant challenge of quantitative MRI (qMRI). While Deep Learning (DL) has recently achieved unprecedented performance in solving non-linear problems of various kinds1, network training often relies on extensive ground-truth training data. Such training data is often not available in qMRI, primarily when novel techniques aim to quantify properties that have been inaccessible before.

The present work illustrates how a fully unsupervised physics-informed DL algorithm can solve such problems with data from as low as N=1 subject. As a proof-of-concept application, we implemented a network that quantifies R2*, B0, and the transceiver phase from complex-valued bipolar multi-echo gradient-recalled echo (GRE) data. Established state-of-the-art algorithms for this task are computationally expensive and unstable multi-step procedures.2,3

Methods

Neural Network Architecture: We implemented a custom deep neural network (Tensorflow 2; Fig. 1) with an encoder (inversion) and a decoder (forward-simulation) part. The number of nodes in the input and output layers was equal. We added two input nodes (real and imaginary) for each of the M GRE-echoes, resulting in 2M input/output nodes. The loss function was the mean squared difference between the measured MRI signal (input) and the network prediction, ensuring data fidelity and an appropriate (Gaussian) noise model. We included one node for each tissue quantity to be determined in the layer that separated encoder and decoder parts. To enforce the physical relevance of these quantities, we used the following MR-physics model in the decoder part:

$$S(\mathrm{TE}) = A_{0}\cdot e^{-\mathrm{TE}\cdot R^{*}_{2} + i[\omega\cdot \mathrm{TE} + \omega_{\mathrm{TE} = 0} + \Delta\omega_{gr}(\mathrm{TE})]} \:(\mathrm{Equation\:1})$$

with TE being echo time, ω frequency, ωTE = 0 transceiver phase, and Δωgr(TE) the gradient-polarity-related phase difference. Positivity of quantities, if appropriate, was enforced through activation function choice. We used a fully connected network with three hidden layers in the decoder part. A Gaussian noise layer was added after the input layer to improve conditioning.

Network Training: We trained the network on single-voxel volunteer GRE data (N=1) acquired with a 6.5 minutes axial whole-brain 3D multi-echo GRE sequence at 3T (TR/TE1/ΔTE=30.4/4/2.5 ms; 10 echoes, flip 14°; 558 Hz/px; 0.67x0.9x1.8 mm3; Fig. 2b,e). We used ADAM4 optimization on one GPU (NVIDIA GeForce RTX 2080 Ti).

Evaluation: We extracted the tissue quantities (Figure 3) from the separation layer and compared them to parameters obtained with conventional least-squares fitting of denoised magnitude data (Figures 3 and 4).

Results

The training loss converged after 2500 epochs, and training was stopped after 2 hours at 7000 epochs. Figure 2a,d shows the predicted (output) magnitude and phase images of the 10-th echo, which was highly similar to the input (Fig. 2c,f), illustrating that the parameters at the separation layer appropriately describe the measured signal throughout the brain. Figure 3a-f contrasts the predicted tissue parameters with those obtained by conventional methods. The resulting field map (Fig. 3g) was free of wraps and showed typical tissue contrast, the transceiver phase (Fig. 3h) demonstrated typical non-harmonic behavior, and the gradient-polarity-related phase showed a linear spatial pattern (Fig. 3i), as expected. Relaxation rate and A0 were similar to those obtained with conventional methods. Figure 4 quantifies the prediction errors of the parameter maps compared to those obtained with conventional methods. Computation time for whole-brain prediction from complex-valued data was 6.5 seconds.

Discussion

This study demonstrated the feasibility of solving inverse problems of qMRI with DL entirely based on the theoretical signal model and measured MRI signals. Theoretical models are usually available and can be more complicated than the one used here. Ground-truth training data (labels) are not needed. The implemented network yielded maps in a single computational pass whose computations require multiple, independent processing steps with established techniques, some of them ill-conditioned. Since the proposed network works on raw data, it correctly accounts for the signal noise, different from most magnitude-based processing algorithms. Results were similar to the conventional methods but not identical. However, it remains unclear if the predicted R2* or the conventional R2* is more correct because model inaccuracies in established methods render them a questionable gold standard. Since the implemented network yielded accurate frequency and phase maps throughout the brain without wrapping artifacts, despite its voxel-by-voxel nature, it may perform well with phase data that can be difficult to unwrap, such as in the presence of spatial discontinuities.

Conclusion and Outlook

Network training without ground truth data allows inverting complex theoretical signal models that could not be inverted until recently due to inefficient numerical techniques. Future research will validate the proposed framework systematically, evaluate generalization, and investigate the incorporation of more advanced biophysical multi-compartment models in the decoder part of the network and additional MR measurements in the input layer. The proposed neural network framework may allow fast and comprehensive tissue characterization in qMRI with complex signal models that have been difficult to solve with conventional techniques.

Acknowledgements

This study was supported by an equipment grant from Canon Medical Systems Corporation and Canon Medical Research USA, Inc. Research reported in this publication was funded by the National Center for Advancing Translational Sciences of the National Institutes of Health under Award Number UL1TR001412. The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

References

1. M. T. McCann, K. H. Jin, and M. Unser, “Convolutional Neural Networks for Inverse Problems in Imaging: A Review,”IEEE Signal Processing Magazine, vol.34, no. 6, pp. 85–95, Nov. 2017.

2. Robinson SD, Bredies K, Khabipova D, Dymerska B, Marques JP, Schweser F. An illustrated comparison of processing methods for MR phase imaging and QSM: combining array coil signals and phase unwrapping. NMR Biomed. 2017 Apr;30(4):e3601. PMCID: PMC5348291

3. Schweser F, Robinson SD, de Rochefort L, Li W, Bredies K. An illustrated comparison of processing methods for phase MRI and QSM: removal of background field contributions from sources outside the region of interest. NMR Biomed. 2017 Apr;30(4):e3604. PMCID: PMC5587182

4. D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimization,” Dec. 2014.

Figures

Figure 1: Visual schematic of the neural network. An input dataset of a given MR image has Gaussian noise added before being fed through three dense hidden layers (tanh activation) of alternating size (24, 25 nodes) to calculate the five MR physics outputs (separation layer). These outputs are then forward simulated and compared to the original data for network weight optimization.

Figure 2: Shown from top left to bottom right: (a) predicted magnitude image of the 10th echo of the multi-echo GRE scan (at network output layer), (b) expected magnitude image of the input multi-echo GRE scan, (c) magnitude image discrepancy, (d) predicted phase image, (e) expected input phase image, (f) phase image discrepancy.

Figure 3: Shown from top left to bottom right: (a) Conventionally obtained amplitude image of a multi-echo GRE scan, (b) Network predicted amplitude parameter map, (c) Ratio of predicted to conventionally obtained amplitude images, (d) Conventionally obtained R2* image of a multi-echo GRE scan, (e) Network predicted R2* parameter map, (f) Difference in R2* maps between conventionally obtained and network trained methods, (g) the frequency prediction map, (h) the phase offset prediction, (i) the gradient offset prediction

Figure 4: Bland-Altman plot of conventionally obtained R2* data (x-axis) and network prediction (y-axis). Difference between the methods is non-systematic, suggesting that they are dominated by noise.

Proc. Intl. Soc. Mag. Reson. Med. 29 (2021)
0330