4549

Score-based Diffusion Models with Self-supervised Learning for Accelerated 3D Multi-contrast Cardiac MR Imaging

Yuanyuan Liu¹, Zhuo-Xu Cui¹, Congcong Liu¹, Qingyong Zhu¹, Jing Cheng¹, Dong Liang¹, and Yanjie Zhu¹
¹Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China

Synopsis

Keywords: Quantitative Imaging, Machine Learning/Artificial Intelligence, 3D cardiac magnetic resonance imaging, self-supervised, diffusion models, multi-contrast

Motivation: Long scan time significantly hinders the widespread applications of three-dimensional multi-contrast cardiac magnetic resonance (3D-MC-CMR) imaging.

Goal(s): This study aims to accelerate 3D-MC-CMR acquisition by a novel method based on score-based diffusion models with self-supervised learning.

Approach: We first establish a mapping between the undersampled k-space measurements and the MR images, utilizing a self-supervised Bayesian reconstruction network. Secondly, we develop a joint score-based diffusion model on 3D-MC-CMR images to capture their inherent distribution. The 3D-MC-CMR images are finally reconstructed using the conditioned Langenvin Markov chain Monte Carlo (MCMC) sampling.

Results: This approach enables accurate reconstruction with high acceleration rates up to 14.

Impact: The proposed method is trained in a self-supervised manner and therefore particularly suited for 3D CMR imaging that lacks fully sampled data.

INTRODUCTION

Myocardial parameter mapping is an important technique in cardiovascular MR imaging (CMR) (1). There is a growing emphasis on characterizing multiple relaxation times by acquiring multi-contrast (MC) images (2,3). 3D-CMR offers comprehensive information about the entire heart compared to 2D imaging with limited coverage, but it suffers from the long scan time. Deep learning (DL) has emerged as a successful tool in MR reconstruction. However, most DL-based reconstruction methods rely on supervised learning that requires extensive fully sampled datasets for training. Acquiring fully sampled k-space data is challenging for 3D-MC-CMR since the scan time is extremely lengthy. This study endeavors to develop a self-supervised reconstruction method, referred to as SSJDM, for 3D-MC-CMR. It employs a score-based model with a Bayesian convolutional neural network, obviating the need for fully sampled training data.

METHODS

MRI reconstruction model: Let $$$\boldsymbol Y$$$ and $$$ \boldsymbol X$$$ denote the undersampled k-space data and the corresponding MR image, respectively. The reconstruction model to recover $$$\boldsymbol X$$$ is:
\begin{equation} \begin{gathered}\underset{\boldsymbol{X}}{\operatorname{\arg \min}} \frac{1}{2}\left \| \boldsymbol{AX}-\boldsymbol{Y} \right\|_{F}^{2} +\lambda R(\boldsymbol{X}) \end{gathered} \end{equation}
where $$$\boldsymbol A$$$ represents the encoding operator given by $$$\boldsymbol A=\boldsymbol{MFS}$$$, $$$\boldsymbol M$$$ is the undersampling operator, $$$ \boldsymbol F$$$ is the Fourier transform operator, and $$$\boldsymbol S$$$ denotes the coil sensitivity map, $$$R(\boldsymbol{X})$$$ denotes a combination of regularizations. From a Bayesian perspective, $$$R(\boldsymbol{X})$$$ can be considered as the prior model of the data, denoted as $$$p(\boldsymbol{X})$$$ (4).

Self-supervised Bayesian reconstruction network (BCNN): A secondary sub-sampling mask is applied to the measurements, resulting in $$${\boldsymbol{Y}^{\prime}} = {\boldsymbol{M}^{\prime}} \boldsymbol{Y}$$$. A BCNN can be trained using the sub-sampled data $$${\boldsymbol{Y}^{\prime}}$$$ and the zero-filling image $$${\boldsymbol{X}^{\prime}}$$$ of $$$\boldsymbol{Y}$$$with:
\begin{equation}{\boldsymbol{X}^{\prime}} = \boldsymbol{f}_{\boldsymbol{\theta}}( {\boldsymbol{Y}^{\prime}} ) +\boldsymbol{n}_1 \end{equation}
where $$$\boldsymbol{f_{\theta}}$$$ represents a mapping parameterized by $$$\boldsymbol{\theta}$$$ which follows a distribution $$$p(\boldsymbol{\theta})$$$. Then $$$\boldsymbol{f_{\theta}}$$$ is utilized to convert the measured $$$\boldsymbol Y$$$ to its corresponding image $$$\boldsymbol X$$$:
\begin{equation}\boldsymbol{X}= \boldsymbol{f}_{\boldsymbol{\theta}}(\boldsymbol{Y}) +\boldsymbol{n}_2 \end{equation}
where $$$\boldsymbol{n}_1$$$ and $$$\boldsymbol{n}_2$$$ represent Gaussian noise with scales $$$\gamma_1$$$ and $$$\gamma_2$$$. Assume that $$$p(\boldsymbol{n}_1)$$$ $$$\sim$$$ $$$\prod_{t } \exp \left(\frac{-{n}_t^2}{2 {\gamma_1}^2}\right)$$$, and the prior $$$p(\boldsymbol{\theta})$$$ $$$ \sim$$$ $$$\prod_s \exp \left(\frac{-{\theta}_s^2}{2 {\bar{\sigma}}^2}\right)$$$, an approximate distribution $$$q(\boldsymbol{\theta} )$$$ can be obtained by minimizing the following the Kullback-Leibler divergence:
\begin{equation}\begin{aligned}& \min _{\boldsymbol{\mu_{\theta}}, \boldsymbol{\sigma_{\theta}}} \frac{1}{2{\gamma_1}^2}\mathbb{E}_{q(\boldsymbol{\theta} \mid \boldsymbol{\mu_{\theta}}, \boldsymbol{\sigma_{\theta}})} \sum_{i=1}^N \left\|\boldsymbol{f_{\theta}}({\boldsymbol{X}^{\prime}}_i) -\boldsymbol{Y}_i\right\|_2^2\\&+\frac{1}{2\bar{\sigma}^2} \left(\|\boldsymbol{\mu_{\theta}}\|_2^2+\|\boldsymbol{\sigma_{\theta}}\|_2^2\right)- \sum_s \log \frac{\sigma_{s}}{\bar \sigma} +\text {const.}\end{aligned}\end{equation}
MC image reconstruction via score-based model: Utilizing the above estimated $$$q(\boldsymbol{\theta} )$$$, the score matching technique (5) is employed to approximate $$$\nabla \log_{\boldsymbol X} p(\boldsymbol{X})$$$ using a score-matching network parameterized by $$$\boldsymbol{\phi}$$$ in a way that $$$\boldsymbol{s_\phi}(\boldsymbol{X};\boldsymbol \theta)=\nabla \log q(\boldsymbol{X};\boldsymbol \theta)$$$ by training the joint score function with:
\begin{equation}\frac{1}{2 L} \sum_{i=1}^L \mathbb{E}_{p(\boldsymbol{Y}) q(\boldsymbol{\theta})} \mathbb{E}_{p_{\varepsilon_i}(\tilde{\boldsymbol{X}} | \boldsymbol{Y},\boldsymbol{\theta})}\left [\left\| \varepsilon_i \boldsymbol{s}_{\boldsymbol{\phi}}(\tilde{\boldsymbol{X}}, \varepsilon_i)+\frac{\tilde{\boldsymbol{X}}-\boldsymbol{f}_{\boldsymbol{\theta}}(\boldsymbol{Y})}{\varepsilon_i}\right\| ^2\right]\end{equation}
After the score function is estimated, the Langevin MCMC sampling can be applied to reconstruct MC images via the following formula:
\begin{equation}\begin{aligned}\tilde{\boldsymbol{X}}_{i+1} & =\tilde{\boldsymbol{X}}_i+\frac{\eta_i}{2} \nabla \log p(\boldsymbol{\tilde{X}}_i | \boldsymbol{Y})+\sqrt{\eta_i} \boldsymbol{z}_i \\& =\tilde{\boldsymbol{X}}_i+\frac{\eta_i}{2}(\boldsymbol{s}_\phi(\tilde{{\boldsymbol X}}_i, \varepsilon_i)+\frac{\boldsymbol{A}^H(\boldsymbol{A} \tilde {\boldsymbol{X}}_i-\boldsymbol{Y})}{{\gamma_2}^2+\varepsilon_i^2})+\sqrt{\eta_i} \boldsymbol{z}_i\end{aligned}\end{equation}
where $$$\eta_i$$$ serves as the step size.

Figure 1 shows the proposed SSJDM framework. Figure 2 shows the structure of BCNN. The NCSNv2 framework in (6) was used as the score matching network. We utilized a 3D simultaneous cardiac T1 and T1ρ mapping sequence (Figure 1(c)) on a 3T MR scanner (uMR 790, United Imaging Healthcare, Shanghai, China). The 3D-MC-CMR dataset using this sequence was prospectively undersampled with an acceleration rate of R = 6. The training set consisted of 5796 slices from 46 volunteers, while the test set included 1512 slices from 12 volunteers. To evaluate SSJDM, the test data were retrospectively undersampled using a 2D Poisson-disc sampling pattern with R = 11 and 14. The SSJDM was used to reconstruct the MC images, and T1 and T1ρ maps were estimated using the dictionary matching method (7). The results were compared with those obtained using traditional regularized reconstruction (PROST (8)), a self-supervised DL method (SSDU (9)), and BCNN.

RESULTS and DISCUSSION

Figure 3 shows the reconstructed MC images using the four methods at R = 11. Blurring artifacts of PROST become noticeable in scenarios with higher acceleration rates. BCNN preserves more detailed information than SSDU (indicated by the yellow arrows). Images of SSJDM still exhibit sharp boundaries and high texture fidelity. At an even higher acceleration rate of R = 14 (shown in Figure 4), the image quality of SSJDM degrades a little, while other methods exhibit severe blurring artifacts. Figure 5 shows the T1 and T1ρ maps estimated from reconstructions using different methods at R = 11. Similar conclusions can be drawn.

CONCLUSION

This study shows the feasibility of score-based models for reconstructing 3D-MC-CMR images from highly undersampled data in simultaneous whole-heart T1 and T1ρ mapping.

Acknowledgements

This work is supported in part by the National Natural Science Foundation of China under grant no. 62201561, National Key R&D Program of China under grant no. 2021YFF0501402, and the Guangdong Basic and Applied Basic Research Foundation under grant no. 2021A1515110540.

References

1. Rao S, Tseng SY, Pednekar A, Siddiqui S, Kocaoglu M, Fares M, Lang SM, Kutty S, Christopher AB, Olivieri LJ, Taylor MD, Alsaied T. Myocardial Parametric Mapping by Cardiac Magnetic Resonance Imaging in Pediatric Cardiology and Congenital Heart Disease. Circ Cardiovasc Imaging 2022;15(1):e012242.

2. Christodoulou AG, Shaw JL, Nguyen C, Yang Q, Xie Y, Wang N, Li D. Magnetic resonance multitasking for motion-resolved quantitative cardiovascular imaging. Nat Biomed Eng 2018;2(4):215-226.

3. Velasco C, Cruz G, Lavin B, Hua A, Fotaki A, Botnar RM, Prieto C. Simultaneous T1 , T2 , and T1ρ cardiac magnetic resonance fingerprinting for contrast agent-free myocardial tissue characterization. Magn Reson Med 2022;87(4):1992-2002.

4. Chung H, Ye JC. Score-based diffusion models for accelerated MRI. Medical Image Analysis 2022;80.

5. Song Y, and Stefano E. Generative modeling by estimating gradients of the data distribution. Advances in neural information processing systems 32 (2019).

6. Song Y, and Stefano E. Improved techniques for training score-based generative models. Advances in neural information processing systems 33 (2020): 12438-12448.

7. Ma D, Gulani V, Seiberlich N, Liu K, Sunshine JL, Duerk JL, Griswold MA. Magnetic resonance fingerprinting. Nature 2013;495(7440):187-192.

8. Bustin A, Lima da Cruz G, Jaubert O, Lopez K, Botnar RM, Prieto C. High-dimensionality undersampled patch-based reconstruction (HD-PROST) for accelerated multi-contrast MRI. Magn Reson Med 2019;81(6):3705-3719.

9. Yaman B, Hosseini SAH, Moeller S, Ellermann J, Ugurbil K, Akcakaya M. Self-supervised learning of physics-guided reconstruction neural networks without fully sampled reference data. Magn Reson Med 2020;84(6):3172-3191.

Figures

Figure 1. Illustration of the proposed approach. (a) Flowchart illustrating the self-supervised BCNN for modeling the parameter distribution. BCNN takes undersampled k-space data pairs {Y,Y^'} as input, and produces q(θ|μ_θ,σ_θ) as an approximation. (b) Forward and reverse diffusion processes of the score-based model. (c) Diagram of the pulse sequence and post-processing steps used to estimate T1 and T1ρ maps via the dictionary matching technique.

Figure 2.The network structure of one block of BCNN used in the proposed method. The BCNN network comprises 10 blocks.The operators FFT(·) and IFFT(·) represent the Fourier and inverse Fourier transforms, respectively. P(·)=(I-M^')(·)+ Y^' denotes projection onto Y'.

Figure 3.Reconstructed images using SSJDM, and the zero-filling, PROST, SSDU, BCNN methods with an acceleration rate R = 11. Blurring artifacts of PROST become evident. BCNN preserves more detailed information than SSDU (i.e., the papillary muscle area indicated by the yellow arrows). Images of SSJDM still exhibit sharp boundaries and high texture fidelity.

Figure 4.Reconstructed images using SSJDM, and the zero-filling, PROST, SSDU, BCNN methods with an acceleration rate R = 14. The yellow arrows show the papillary muscle areas in the reconstructed images. Even at a high acceleration rate of R = 14, the image quality of SSJDM degrades a little, while other methods exhibit severe blurring artifacts.

Figure 5.T1 and T1ρ maps estimatd from reconstructed images using the proposed method, and the zero-filling, PROST, SSDU, BCNN methods with acceleration rate R = 11. The maps of SSJDM exhibit sharp boundaries and high texture fidelity.

Proc. Intl. Soc. Mag. Reson. Med. 32 (2024)

4549

DOI: https://doi.org/10.58530/2024/4549