Keywords: AI/ML Image Reconstruction, Brain
Motivation: In MRI reconstruction, deep-learning methods often increase network complexity for improved super-resolution, leading to longer reconstruction times and training difficulties.
Goal(s): Our solution introduces an enhanced lightweight network that maintains high-quality performance.
Approach: We accomplish this by stacking Reverse Residual Attention Fusion (RRAF) with PCA and Enhanced Spatial Attention (ESA) for precise feature extraction, utilizing Transformers with depth-wise dilated convolution for better context information, and employing High-Frequency Image Refinement (HFIR) for detailed information recovery.
Results: Our experiments confirm the effectiveness of our approach.
Impact: Introducing the lightweight network represents an important improvement in MRI SR reconstruction. By integrating Reverse Residual Attention Fusion, it upholds exceptional image quality, streamlines network complexity, reduces reconstruction time, and simplifies training for SR MRI image reconstruction.
This research was supported by a grant from the Zhejiang Natural Science Foundation of China (No. LY23F010005), the ALF foundation in the Stockholm Region, and the Joint China–Sweden Mobility program from STINT (Dnr: CH2019-8397).
1. Eyal C, Siuyan L, Noga A, et al. Resolution enhancement in MRI. Magnetic Resonance Imaging. 2006; 24: 133–154. 2. Chao D, Chen Change L, Kaiming H, et al. Learning a deep convolutional network for image super-resolution. ECCV. 2014; Part IV, LNCS 8692: 184–199. 3. Jiwon K, Jung Kwon L, Kyoung Mu L. Accurate image super-resolution using very deep convolutional networks. CVPR. 2015; 1646-1654. 4. Bee L, Sanghyun S, Heewon K, et al. Enhanced deep residual networks for single image super-resolution. CVPR. 2017; 1132-1140. 5. Hengyuan Z, Xiangtao K, Jingwen H, et al. Efficient image super-sesolution using pixel attention. aXiv:2010.01073v1 [eess.IV] 2 Oct 2020. 6. Zongcai D, Ding L, Jie L, et al. Fast and memory-efficient network towards efficient image super-resolution. CVPR. 2020; 853-862. 7. Xiaole Z, Yulun Z, Tao Z, et al. Channel Splitting Network for Single MR Image Super-Resolution. IEEE Trans Image Process. 2019. 28(11): 5649-5662. 8. Haoqian W, Xiaowan H, Xiaole Z, et al. Wide weighted attention multi-scale network for accurate MR image super-resolution. IEEE Trans Circuits Syst Video Technol, 2022; 32(3): 962-975.
Fig. 1: The network consists of three key components: deep feature extraction, upsampling reconstruction, and SR correction (a). RRAF modules (b) progressively enhance features through residual local feature extraction, Pixel and channel attention (PCA), efficient spatial attention (ESA), channel shuffle (CS), and residual connections. The long-distance skip connection input feature is integrated into the feature refinement process, which culminates with a Transformer module for context enrichment.
Fig. 2: Representative slice demonstrating the results of the reconstruction process following 4× downsampling of a volume IXI (T1w), IXI (T2w) and MICC (T1w) for various SR models. The 1st to 3rd rows depict the reconstruction, a zoomed display of the region highlighted in the red square, and the Mean Squared Error (MSE) evaluation, respectively. (a) sagittal slice for IXI (T1w), (b) axial slice for IXI (T2w), and (c) coronal slice for MICC (T1w). The reconstructed images exhibit the impact of different SR models on image quality and detail preservation.
Fig. 3: Bar graphs illustrating image quality metrics at sample scale 4×, including Peak Signal-to-Noise Ratio (PSNR in dB), Structural Similarity Index (SSIM), and Blur assessment. These graphs enable a comprehensive performance comparison between various reconstruction methods, shedding light on the relative effectiveness of each technique in enhancing image quality and detail preservation.
Fig. 4: Summary of ablation studies, detailing the effects of each module on the Peak Signal-to-Noise Ratio (PSNR in dB) and Structural Similarity Index (SSIM) in the reconstructed image quality. This table provides insights into the contribution of individual components to the overall performance of the reconstruction method.
Fig. 5: Representative reconstruction results showcasing the application of the proposed method for SR reconstruction on 3T LR data using a paired HR 7T dataset. This illustrates the feasibility of obtaining T1w and T2w images akin to 7T quality through LR scans, a valuable prospect in clinical practice.