DeepRF1 is a recently proposed RF pulse design method using deep reinforcement learning and optimization, generating RF defined by a reward (e.g., slice profile and energy constraint) from self-learning. Here, we proposed an accelerated algorithm for DeepRF that utilizes a modified optimal control, replacing the computationally complex gradient ascent-based optimizer. The new algorithm is tested for slice-selective inversion and slice-selective excitation and compared with original DeepRF and SLR RF pulses, reporting improved computation efficiency while preserving performances. Additionally, a short-duration B1-insensitive inversion pulse, which was difficult to produce in conventional RF algorithms, is designed to demonstrate the usefulness of DeepRF.
[1] Dongmyung Shin, et al. “Deep Reinforcement Learning Designed RadioFrequency Waveform in MRI.”, Nature Machine Intelligence, accepted (2021).
[2] Steven Conolly, et al. “Optimal Control Solutions to the Magnetic Resonance Selective Excitation Problem.” IEEE Transactions on Medical Imaging, 5.2 (1986): 106-115.
[3] Rund Armin, et al. “Magnetic resonance RF pulse design by optimal control with physical constraints.” IEEE Transactions on Medical Imaging, 37.2 (2017): 461-472.
[4] Vinding M.S, et al. “DeepControl: 2DRF pulses facilitating inhomogeneity and B0 off‐resonance compensation in vivo at 7 T.” Magnetic Resonance in Medicine, 85.6 (2021): 3308-3317.
[5] Vinding M.S, et al. “Fast numerical design of spatial-selective rf pulses in MRI using Krotov and quasi-Newton based optimal control methods.” The Journal of Chemical Physics, 137.5 (2012): 054203
[6] Vinding M.S, et al. “Local SAR, global SAR, and power‐constrained large‐flip‐angle pulses with optimal control and virtual observation points.” Magnetic Resonance in Medicine, 77 (2017): 374-384
[7] Skinner Thomas E, et al. "Reducing the duration of broadband excitation pulses using optimal control with limited RF amplitude." Journal of Magnetic Resonance, 167.1 (2004): 68-74
[8] Xu Dan, et al. "Designing multichannel, multidimensional, arbitrary flip angle RF pulses using an optimal control approach." Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine 59.3 (2008): 547-560
[9] John Pauly, et al. "Parameter relations for the Shinnar-Le Roux selective excitation pulse design algorithm (NMR imaging)" IEEE Transactions on Medical Imaging, 10.1 (1991): 53-65
[10] Silver M.S, et al. “Highly selective π2 and π pulse generation” Journal of Magnetic Resonance, 59 (1984): 347-351
[11] Paszke, A. et al. “PyTorch: An Imperative Style, High-Performance Deep Learning Library.” Advances in Neural Information Processing Systems, 32 (2019): 8024–8035.Fig. 1. Overview of accelerated DeepRF. In the first step of DeepRF, DRL generates seed for target RF properties defined by reward function. Then the top 256 reward seeds are selected as the input for optimization using either gradient ascent (original DeepRF) or modified optimal control (proposed DeepRF). The gradient ascent costs long computation time (12.4±4.0 hours) for convergence and is replaced by optimal control to reduce time (3.3±0.2 hours) while maintaining performance. Both methods show comparable performance in energy reduction and slice profile (see Figures 2-4).
Figure 2. Comparison of the slice-selective inversion pulses using (a) proposed DeepRF (DRL with mOC) vs. SLR and (b) original DeepRF (DRL with GA) vs. SLR. The RF shapes of the proposed DeepRF and the original DeepRF are not the same but show comparable slice profiles (mean Mz in BW: -0.81, stopband ripple: 0.3%) to those of SLR and also have similar energy reduction (both 11.9% reduction). (c) Summary of the results. The computation time of modified optimal control (mOC) is reduced by 66% when compared to that of gradient ascent (GA).
Figure 3. Comparison of the slice-selective excitation pulses (a) proposed DeepRF (DRL with mOC) vs. SLR and (b) original DeepRF (DRL with GA) vs. SLR. Both of the DeepRF methods show comparable slice profiles (both mean Mxy in BW: 0.94; stopband ripple: 1.4%) and energy reduction (Proposed: 11.4% reduction; Original: 11.0% reduction). (c) Summary of the results. The computation time of modified optimal control (mOC) is reduced by 80% when compared to that of gradient ascent (GA).
Figure 4. Comparison of the short-duration B1-insensitive volume inversion pulses using (a) proposed DeepRF (DRL with mOC), (b) original DeepRF (DRL with GA), and (c) hyperbolic secant (HS) design. The simulated mean Mz over target inversion range (B1: 0.5 - 2.0, off-resonance frequency: -1000 Hz - 1000Hz) is -0.90 in both DeepRF pulses whereas it is only -0.76 in the HS pulse, demonstrating a substantial improvement in the DeepRF pulses. (d) Summary of the results. The computation time of mOC is reduced by 66% when compared to that of GA.