Yanting Yang1, Jeffery Siyuan Tian2, and Jia Guo1
1Columbia University, New York, NY, United States, 2Computer Science, University of Maryland, College Park, Clarksville, MD, United States
Synopsis
Keywords: AI/ML Image Reconstruction, Machine Learning/Artificial Intelligence
Motivation: Complex-valued deep learning framework has not been fully investigated in human normal-field and low-field MRI reconstruction and preprocessing.
Goal(s): We aim to replace conventional numerical methods with deep learning network, which reconstruct and preprocess the k-space data in parallel.
Approach: An artificial Fourier transform network (AFT-Net) is proposed to directly processes the complex-valued raw data in the sensor domain.
Results: An evaluation of accelerated reconstruction and denoised reconstruction shows that AFT-Net demonstrated the ability to reconstruct the data with significantly accelerate acquisition and random Gaussian noise. The proposed AFT-Net is an efficient and accurate approach for MRI reconstruction and preprocessing from raw data.
Impact: MRI reconstruction and preprocessing with AFT-Net should be able to determine the domain-manifold mapping and process k-space data directly, which shows superior performance and can be served as an efficient and accurate approach for human high-field and low-field MRI acquisition.
Introduction
The domain shift from real to complex domain in the context of deep neural networks has uncovered the potential of utilizing the rich representational capacity of complex numbers and boosted the development of complex-valued architectures[1,2,3]. A similar but inverse domain shift is mirrored in MRI reconstruction, where raw data is acquired in complex-valued k-space. It has been shown that k-space, as a low-dimensional feature space, can be leveraged in deep neural networks to determine the between-manifold mapping of domain transforms in low signal-to-noise settings[4]. However, most complex networks do not include end-to-end domain transformation. Here, we propose a complex-valued MRI reconstruction approach, which aims to get rid of conventional numerical methods in the workflow and preprocess the data in parallel. The framework described here is the artificial Fourier transform (AFT). We utilize AFT combined with complex-valued UNet to design our AFT-Net as shown in Figure 1.Methods
Complex-valued neural network
The real-valued network is extended to complex-valued network by defining the complex operator as $$$\mathrm{W}=\mathrm{W}_{real}+i\mathrm{W}_{imag}$$$, where $$$\mathrm{W}_{real}$$$ and $$$\mathrm{W}_{imag}$$$ are real-valued operators (linear or convolution operator). The output of $$$\mathrm{W}$$$ acting on $$$x$$$ can then denoted as complex matrix multiplications:
$$y=\mathrm{W}*x=(\mathrm{W}_{real}*x_{real}-\mathrm{W}_{imag}*x_{imag})+i(\mathrm{W}_{imag}*x_{real}+\mathrm{W}_{real}*x_{imag}).$$
We extend the ReLU in the same way, which preserves consistency and shows comparable performance[3]. Normalization is used in deep learning to accelerate training and reduce statistical covariance shift[5]. This is mirrored in the complex-valued neural network, where we want to ensure that both real and imaginary parts have equal variance. Extending the normalization equation to matrix notation we have:
$$\tilde{z}=V^{-\frac{1}{2}}(z-\mathbb{E}(z))=(\begin{bmatrix}\mathrm{Cov}(\Re(z),\Re(z))&\mathrm{Cov}(\Re(z),\Im(z))\\\mathrm{Cov}(\Im(z),\Re(z))&\mathrm{Cov}(\Im(z),\Im(z))\end{bmatrix}+{\epsilon}I)^{-\frac{1}{2}}\cdot\begin{bmatrix}\Re(z)-\mathrm{Mean}(\Re(z))\\\Im(z)-\mathrm{Mean}(\Im(z))\end{bmatrix},$$
where $$$x-\mathbb{E}(x)$$$ zero centers real and imaginary parts separately, $$$V$$$ is the covariance matrix and $$${\epsilon}I$$$ is added to guarantee the existence of the inverse square root.
Implementation detail
The general workflows are shown in Figure 2. We apply AFT-Net to the normal-field and high-field MRI of humans. Multiple network architectures are evaluated for accelerated and denoised reconstruction to verify the effectiveness of both AFT and CUNet in different domains. We refer to each of them as AFT, AFT-Net (I), AFT-Net (K), and AFT-Net (KI).Results
The reconstruction results using fully-sampled k-space data are shown in Figure 3. It can be seen that the ground truth image obtained from FT is identical to the AFT prediction, which human observers can not distinguish. The residual map (pixel-wise difference between the ground truth image and the AFT prediction) shows that no brain structural information is presented. The grid-like remaining error is mainly caused by precision loss during floating-point calculations in matrix multiplication.
In Figure 4, we show the results of accelerated reconstruction using under-sampled k-space data. In the first row, we see the reconstructions from 1D 4x equal-spaced sampling. Different AFT-Net structures were compared with zero-filling method. AFT-Net (KI) performs outstanding reconstruction, where less structural difference can be seen from the residual map in the second row. The third row shows zoomed-in areas of both images and residual maps. AFT-Net (I) produces more blurry reconstruction that loses the structural details. Reconstruction through AFT-Net (K) induces foggy artifacts, which is reflected in terms of SSIM.
Next, we illustrate the results of denoised reconstruction using k-space data with added Gaussian noise in Figure 5. AFT-Net (I) performs the best across all structures. The second row shows the pixel-wise difference between AFT-Net output and noiseless ground truth. It can be indicated that the noise in the background is attenuated significantly. Although the brain structure can be seen from the residual map, the zoomed-in version of the image shows that the AFT-Net reconstruction preserves the anatomy.Discussion and Conclusion
A novel artificial Fourier transform framework is proposed that determines the mapping between k-space and image domain as conventional DFT while having the ability to be fine-tuned/optimized with further training. The flexibility of AFT allows it to be easily incorporated into existing deep learning networks as learnable or static blocks. AFT is then utilized to design our AFT-Net, which implements complex-valued UNet to extract higher features in k-space and/or image domain. We aim to combine reconstruction and acceleration/denoising tasks into a unified network that enhances the image quality by removing artifacts directly from the k-space. Our AFT-Net achieves competitive results and proves to be more robust to noise and contrast differences. One remaining limitation is that we only implement a complex-valued network with linear layers and CNNs, which are less effective than some advanced architectures. In our future work, we will aim to replace multi-layer perceptrons and CNNs with transformer-based and diffusion-based models while extending the concept of AFT to more medical imaging tasks.Code Availability
The code is available at https://github.com/yangyanting233/AFT-Net.Acknowledgements
This study was supported by the Zuckerman Mind Brain Behavior Institute at Columbia University and Columbia MR Research Center site.References
[1] Georgiou GM, Koutsougeras C. Complex domain backpropagation. IEEE transactions on Circuits and systems II: analog and digital signal processing. 1992 May;39(5):330-4.
[2] Guberman N. On complex valued convolutional neural networks. arXiv preprint arXiv:1602.09046. 2016 Feb 29.
[3] Trabelsi C, Bilaniuk O, Zhang Y, Serdyuk D, Subramanian S, Santos JF, et al. Deep Complex Networks. 2018 ICLR (Poster)
[4] Zhu B, Liu JZ, Cauley SF, Rosen BR, Rosen MS. Image reconstruction by domain-transform manifold learning. Nature. 2018 Mar 22;555(7697):487-92.
[5] Ioffe S, Szegedy C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. InInternational conference on machine learning 2015 Jun 1 (pp. 448-456). pmlr.