1759

DeepSlider: Deep learning-powered gSlider for improved robustness and performance
Juhyung Park1, Dongmyung Shin1, Hyeong-Geol Shin1, Jiye Kim1, and Jongho Lee1
1Seoul National University, Seoul, Korea, Republic of

Synopsis

We designed a slab-encoding RF pulse set, referred to as DeepSlider, using deep reinforcement learning to improve the robustness and performance. The condition number of the RF encoding matrix, which determines the sensitivity of the design to noise, was optimized via deep reinforcement learning and gradient descent. Additionally, the design was extended from reconstructing the real signal to the complex signal, allowing us to utilize the complex signal instead of the real part, expanding the application of the pulse to phase-based imaging.

Introduction

Acquiring a thin slice is important in high-resolution 2D neuroimaging. Recently, generalized slice dithered enhanced resolution (gSlider) was developed to generate high-resolution sub-slice images (e.g., 660 $$$μ$$$m diffusion MRI) via combinations of slab-encoding RF pulses1,2. Recently, a deep reinforcement learning (DRL)-powered machine-designed RF pulse was proposed, providing a lower specific absorption rate (SAR) RF than conventional RF3. In this study, we utilize this DRL approach to generate an RF pulse set that has an improved condition number in the RF encoding matrix of gSlider, which determines the sensitivity of the design to noise (Fig. 1). The proposed method is referred to as DeepSlider. Additionally, we expand the dimension of DeepSlider reconstruction from real signal to complex signal, proposing another method, DeepSlidercomplex, expanding application of the pulse to phase-based imaging.

Method

[Conventional gSlider] Conventional gSlider RF pulses are generated using publicly available codes4. Design parameters are as follows: TBWP = 12, pulse duration = 11 ms, and five RF encoding basis. The signals, $$$b$$$, obtained from multiple MRI scans with the slab-encoding RF pulses can be modeled as a linear system of an encoding matrix, $$$A$$$, which is determined by the RF pulses, and sub-slice signals, $$$z$$$ (i.e., $$$b = Az$$$). The sub-slice signals are estimated by the pseudo-inverse of the equation.

[DeepSlider] The objective of DeepSlider is to generate five slab-encoding RF pulses that have the lowest condition number for A in order to improve the robustness of the pseudo-inverse against noise. To generate RF pulses, DRL creates RF pulses that maximize a reward function (see below). Then, the DRL-generated RF pulses are grouped into five, further optimized by the gradient descent stage using a loss function (see below). The TBWP and pulse duration are the same as in the gSlider design.

[DRL] DRL consists of a deep neural network agent and Bloch equation environment (Fig. 2a). The reward function is defined as follows:$$reward_{RL}={c_1}\sum_{z_p\in{Z_{pass}}}|M_{xy}(z_p)|-{c_2}\sum_{z_s\in{Z_{stop}}}|M_{xy}(z_s)|-c_3SAR$$where the first, second, and third terms represent passband magnetization, stopband magnetization, and SAR regularizer, respectively. The three coefficients, $$$c_1$$$ = 0.5, $$$c_2$$$ = 0.35, and $$$c_3$$$ = 10-3, are determined empirically. In a single episode, the DRL agent generates an RF pulse as action. Then, the environment returns the reward of the RF pulse. Using this reward, the DRL agent updates the network weights with a proximal policy optimization algorithm. A total of 105 RF pulses are generated.

[Gradient descent] In the gradient descent stage (Fig. 2b), 500 DRL-generated RF pulses, which have the highest rewards, are selected. Then, five RF pulses are randomly selected, resulting in 100 groups. The loss function of each group, which is minimized via the gradient descent, is defined as follows:$$loss=-eig_5(A)-\sum_{i=1}^5reward_{RL,i}$$where the first term $$$eig_5(A)$$$ is the fifth eigenvalue of the encoding matrix , and the second term is the sum of the reward function from DRL. Using this loss function, the RF pulses in each group are refined by gradient descent. After 2000 iterations, an RF pulse set with the minimum loss is chosen as the final result.

[Experiment] To evaluate the DeepSlider results, a point spread function (PSF) is calculated as follows: $$PSF_i=(A^TA+\lambda{I})^{-1}A^T\delta_i$$where $$$\delta_i$$$ is an impulse at the $$$i$$$th sub-slice, $$$\lambda$$$ is a Tikhonov regularizer (= 0.4) and $$$PSF_i$$$ is the impulse response of $$$\delta_i$$$. To explore the effects of noise, PSF errors of gSlider and DeepSlider are estimated by varying SNR from 10 to 150. Also, the mean SAR for the RF pulses and the condition number of the encoding matrix $$$A$$$ are compared.

[Complex DeepSlider] Since gSlider slab-encoding RF pulses are designed for the real part of a complex MRI signal1, expanding it to complex is beneficial when the phase information is valuable. Hence, we design a new method, DeepSlidercomplex, that allows the encoding matrix to be complex. To designed a complex encoding matrix A, the eigenvalue term in the loss function is modified by separating the real and imaginary parts of $$$A$$$ :
$$loss=-eig_5(real(A))-eig_5(imag(A))-\sum_{i=1}^5reward_{RL,i}$$To evaluate DeepSlidercomplex results, the PSF errors of DeepSlidercomplex and DeepSlider are estimated using sub-slice impulse, $$$\delta_i$$$ , which has a phase error from 0 to 80 for the SNR of 20, 50, and 80.

Results

[DeepSlider] DeepSlider successfully reconstructs the sub-slice impulse functions (Fig. 3a), showing slightly lower SAR (= 3.02 (mG)2sec) and improved condition number (1.58) than those of gSlider (SAR: 3.30 (mG)2sec; condition number: 2.02). When the noise is added, DeepSlider shows lower PSF errors than those of gSlider, particularly at low SNRs (Fig. 3b).

[DeepSliderComplex] DeepSlidercomplex shows the potential of reconstructing robust sub-slice impulse functions (Fig. 5), producing the mean SAR of 3.11 (mG)2sec and condition number of 1.62 in real and 1.68 in imaginary. Compared to DeepSlider, DeepSlidercomplex demonstrates slightly increased PSF errors due to the noise in the imaginary axis. However, the results report steady performances for the phase errors.

Discussion

In this study, the slab-encoding RF pules are designed using DRL and gradient descent. The results of DeepSlider show a lower condition number and a lower SAR, suggesting robustness to noise. Furthermore, DeepSlidercomplex shows the potential of applying slab-encoding RF pulses for phase imaging or QSM.

Acknowledgements

We sincerely thank Professors Kawin Setsompop and William A Grissom for sharing their codes of the gSlider RF designing tool. This research was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. NRF-2018R1A2B3008445), and funded by the MSIT (NRF-2018R1A4A1025891).

References

[1] K. Setsompop et al., "High‐resolution in vivo diffusion imaging of the human brain with generalized slice dithered enhanced resolution: Simultaneous multislice (gSlider‐SMS)," Magnetic resonance in medicine, vol. 79, no. 1, pp. 141-151, 2018.

[2] F. Wang et al., "Motion‐robust sub‐millimeter isotropic diffusion imaging through motion corrected generalized slice dithered enhanced resolution (MC‐gSlider) acquisition," Magnetic resonance in medicine, vol. 80, no. 5, pp. 1891-1906, 2018.

[3] Dongmyung Shin and Jongho Lee. DeepRF: Designing an RF pulse using a self-learning machine. Highlight session of 28th ISMRM Virtual Conference & Exhibition, 08-14 August 2020.

[4] https://github.com/wgrissom/gSliderRF

Figures

Figure 1. Overview of DeepSlider. Five slab-encoding RF pulses are designed using deep reinforcement learning and gradient descent. The signals from slab-encoding RF pulses can be modeled as a linear system of an encoding matrix and sub-slice signals z. The design objective is to minimize the condition number of the encoding matrix A, improving stability against noise.

Figure 2. Details of generating slab-encoding RF pulses. (a) In a single episode, DRL agent generates an RF pulse as an action, and Bloch equation environment returns a reward for the RF pulse. The reward is composed of slice profile penalty and SAR regularization. The agent is updated using the reward and generates an RF pulse recursively to maximize the reward. (b) DRL-generated RF pulses are randomly grouped into five RF pulses. Then the five RF pulses are optimized to have the lowest condition number in the encoding matrix (and maximize DRL reward term) by gradient descent.

Figure 3. (a) PSFs for each sub-slice using DeepSlider (first row) and gSlider (second row). DeepSlider successfully reconstructs the impulse functions, showing less than 7.5% of side-lobe signal. (b) PSF errors of gSlider (black line) and DeepSlider (blue line) for SNRs from 10 to 150. DeepSlider results show the robustness of SNR.

Figure 4. (a) Pulse shapes (first column) and excitation profiles (second column) of DeepSlider. (b) Pulse shapes (third column) and excitation profiles (last column) of gSlider. Compared to the conventional method, DeepSlider pulses show lower SAR (3.02 vs 3.30 mG2s) and lower condition number (1.58 vs. 2.02).

Figure 5. The results of DeepSlidercomplex. (a) The pulse shapes of DeepSlidercomplex (first row) and excitation profiles (second row). (b) PSF results. (c) PSF errors of DeepSlidercomplex (green line) and DeepSlider (blue line). Overall, DeepSlidercomplex shows lower errors which are particularly noticeable when the phase error of the impulse increases.

Proc. Intl. Soc. Mag. Reson. Med. 29 (2021)
1759