1747

Patch-based Conditioned Denoising Diffusion Probabilistic (PC-DDPM) for Magnetic Resonance Imaging Reconstruction

Mengting Huang^1,2, Thanh Nguyen-Duc^1,3, Martin Soellradl^1,3, Daniel Schmidt², and Roland Bammer^1,3
¹Department of Radiology, Monash University, Melbourne, Australia, ²Department of Data Science and Artificial Intelligence, Faculty of IT, Monash University, Melbourne, Australia, ³Department of Diagnostic Imaging, Monash Health, Melbourne, Australia

Synopsis

Keywords: AI Diffusion Models, Image Reconstruction

Motivation: Although MRI is a powerful medical imaging technique, its utility is limited by its prolonged scan time. Since deep learning methods for reconstructing undersampled MRI haven't achieved rapid and reliable high-resolution results, we investigated diffusion models as a potential solution.

Goal(s): Improve MRI reconstruction while accelerating the inference process with diffusion model.

Approach: Proposed Patch-based Conditioned Denoising Diffusion Probabilistic Model (PC-DDPM) that predicts the distribution of clean images from noisy input by conditioning on noisy patches.

Results: Experiment result shows PC-DDPM outperforms U-Net, Vision Transformer, and conditioned DDPM by better reconstruction performance and shorter inference time.

Impact: The implementation of patch-based conditioned DDPM for MRI reconstruction can speed up reconstruction and image acquisition while preserving the image quality. Patients could benefit from shorter scan time, and medical facilities could increase the patient throughput.

Introduction

MRI serves as a crucial test in diagnostic imaging. Nonetheless, the prolonged scan time of MRI compels a compromise between the speed of acquisition and the quality of the reconstructed images. Deep learning models, such as U-nets^1–3 and Vision Transformers^4,5, have been proposed to improve MR image reconstruction. Recently, Denoising Diffusion Probabilistic Models (DDPM)⁶ have demonstrated remarkable performance for high-quality image reconstruction. However, the original DDPM method is computationally intensive, especially with high-resolution inputs, and fraught by slow inference times. In this work, we propose a novel patch-based conditioned Denoising Diffusion Probabilistic model that aims at optimizing computational efficiency and accelerating the inference process.

Methods

Patch-based Conditioned Denoising Diffusion Probabilistic (PC-DDPM) enables faster reverse processing than standard DDPM. This patch-based model also uses a guided denoising process for size-independent reconstruction during inference.

Denoising Diffusion Probabilistic Model (DDPM):
Given a sample $$$\mathbf{x}_\mathbf{0}$$$ from the real data distribution $$$q(\mathbf{x})$$$, the forward process simulates the diffusion process by gradually adding noise to the data until pure Gaussian noise is reached: $$q\left(\mathbf{x}_{\mathbf{1}:\mathbf{T}}\middle|\mathbf{x}_\mathbf{0}\right)=\prod_{t=1}^{T}{q\left(\mathbf{x}_\mathbf{t}\middle|\mathbf{x}_{\mathbf{t}-\mathbf{1}}\right)},$$ where $$$T$$$ is the total number of steps.
The inference process (e.g., reverse process) approximates the reverse distribution $$$q\left(\mathbf{x}_{t-1}\middle|\mathbf{x}_t\right)$$$ with the parameterized model $$$p_\theta$$$ through each time step $$$t$$$: $$p_\theta\left(\mathbf{x}_{0:T}\right)=p\left(\mathbf{x}_T\right)\prod_{t=1}^{T}\, p_\theta\left(\mathbf{x}_{t-1}\mid\mathbf{x}_t\right).$$

Conditioned DDPM:
To accurately represent complex distributions, conditioned diffusion models⁷ incorporate auxiliary conditions into the generating process. Different to DDPM, the reverse process of a Conditioned DDPM becomes: $$p_\theta\left(\mathbf{x}_0,\mathbf{x}_1,\ldots,\mathbf{x}_T\mid\mathbf{x}_c\right):=p\left(\mathbf{x}_T\right)\prod_{t=1}^{T}\, p_\theta\left(\mathbf{x}_{t-1}\mid\mathbf{x}_t,\mathbf{x}_c\ \right),$$ where $$$\mathbf{x}_c$$$ is the auxiliary condition, and the noise distribution estimate becomes: $${\epsilon_\theta\left(\mathbf{x}_t,t\right)\ \rightarrow\epsilon}_\theta\left(\mathbf{x}_t,\mathbf{x}_c,t\right).$$
However, the reverse diffusion process of generating a high-resolution sample from DDPM through the Markov chain can be considerably slow, given that $$$T$$$ can range from one to many thousands.

Patched-based Conditioned DDPM (PC-DDPM):
Inspired by the patched-based Denoising Diffusion model⁸, we propose a class of non-Markovian diffusion processes built from the seminal work of Song et al⁹.
For high-resolution MRI image reconstruction, our patch-based technique allows size-independent image reconstruction and faster reverse processing process than typical DDPM. A guided denoising approach that applies smoothed noise estimates across overlapping patches during inference can improve image restoration quality and efficiency.
Given a patch is sampled from the fully sampled data distribution $$$q(\mathbf{x}_\mathbf{0}^i)$$$ and with corresponding undersampled MRI as condition $$$\mathbf{x}_\mathbf{c}^i$$$. The transition probability for the patch-based conditional diffusion can be formulated as: $$p_\theta\left(\mathbf{x}_{t-1}^i\mid\mathbf{x}_t^i,\ \mathbf{x}_c^i\right)=\mathcal{N}\left(\mathbf{x}_{t-1}^i;\mathbf{\mu}_\theta\left(\mathbf{x}_t^i,\ \mathbf{x}_c^i,t\right),\mathbf{\Sigma}_\theta\left(\mathbf{x}_t^i,\ \mathbf{x}_c^i,t\right)\right).$$
The objective of the model is to minimize the difference between the expected noise during the diffusion process and the one generated by the model parameterized by $$$\theta$$$: $$\mathbf{E}_{x_0,t,\epsilon_t\sim N(0,I)}\left[\left|\epsilon_t-\epsilon_\theta\left(\mathbf{x}_t^i,\ \mathbf{x}_c^i,t\right)\right|^2\right].$$
At each time step, the model progressively refines the image patch: $$\mathbf{x}_{t-1}^i=\sqrt{{\bar{\alpha}}_{t-1}}\left(\frac{\mathbf{x}_t^i-\sqrt{1-{\bar{\alpha}}_t}\epsilon_\theta\left(\mathbf{x}_t^i,\ \mathbf{x}_c^i,t\right)}{\sqrt{{\bar{\alpha}}_t}}\right) +\sqrt{1-\alpha_{t-1}}\epsilon_\theta\left(\mathbf{x}_t^i,\ \mathbf{x}_c^i,t\right).$$
Implicit sampling allows accelerating the sampling process by only requiring a subset of the timesteps $$$\tau_i$$$ from complete sequence $$${1,...,T}$$$ where $$$i<T$$$.

Experiments:
We performed experiments using fastMRI¹⁰ brain MRI dataset from 30 patients (1,000 training, 200 testing), undersampled at 4x and 8x speeds with center acquisition rates of 0.08 and 0.04, respectively. Multi-coil images were combined using sum of squares. Then, for each subject, the combined images (320x320 pixels) were divided into 16 patches of 64x64 pixels each. Our model was trained with the principal diffusion loss, followed the original implementation of DDPM¹¹ that consists of ResNet blocks, group normalization and self-attention.
To evaluate the quality of our approach, we measured reconstructed images using the PSNR and SSIM metrics. The proposed method was compared against several baseline reconstruction methods, including zero-filled, standard U-Net, Vision Transformer, and conditioned DDPM.

Result

Table 1 shows that the proposed patch-based conditioned DDPM (PC-DDPM) outperforms other comparison methods on PSNR and SSIM. For brain MRI, PC-DDPM demonstrates superior reconstruction performance and outperforms other methods, achieving top SSIM (0.817 and 0.796) and PSNR (31.04 and 30.85) scores at both x4 and x8 acceleration rates, respectively.

Figure 1 indicates the difference maps between the reconstructed image and the fully-sampled image. Our proposed PC-DDPM shows fewer errors than other reconstruction models. The PC-DDPM model demonstrated five times better efficiency, reconstructing MRI images in just 2 seconds, compared to the 10 seconds in the conditioned DDPM.

Discussion

We proposed a PC-DDPM model which outperforms other reconstruction methods, such as U-Net, Vision Transformer, and conditioned DDPM. PC-DDPM produces high-resolution reconstruction results at different acceleration rates (x 4 and x8). This is due to the inherent properties of diffusion models that seek to sample from the true distribution and refine the image over time. Together with patch-based input and non-Markovian diffusion processes, our PC-DDPM approach demonstrates the potential of reconstructing high-fidelity MRI images in a short time.

Acknowledgements

No acknowledgement found.

References

1. Lee D, Yoo J, Ye JC. Deep residual learning for compressed sensing MRI. In: 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017). IEEE; 2017:15-18. doi:10.1109/ISBI.2017.7950457

2. Hyun CM, Kim HP, Lee SM, Lee S, Seo JK. Deep learning for undersampled MRI reconstruction. Phys Med Biol. 2018;63(13):135007. doi:10.1088/1361-6560/aac71a

3. Han YS, Yoo J, Ye JC. Deep Learning with Domain Adaptation for Accelerated Projection-Reconstruction MR. Published online January 8, 2018. Accessed August 13, 2023. http://arxiv.org/abs/1703.01135

4. Feng CM, Yan Y, Fu H, Chen L, Xu Y. Task Transformer Network for Joint MRI Reconstruction and Super-Resolution. In: de Bruijne M, Cattin PC, Cotin S, et al., eds. Medical Image Computing and Computer Assisted Intervention – MICCAI 2021. Lecture Notes in Computer Science. Springer International Publishing; 2021:307-317. doi:10.1007/978-3-030-87231-1_30

5. Lin K, Heckel R. Vision Transformers Enable Fast and Robust Accelerated MRI. In: Proceedings of The 5th International Conference on Medical Imaging with Deep Learning. PMLR; 2022:774-795. Accessed November 8, 2023. https://proceedings.mlr.press/v172/lin22a.html

6. Ho J, Jain A, Abbeel P. Denoising Diffusion Probabilistic Models. In: Advances in Neural Information Processing Systems. Vol 33. Curran Associates, Inc.; 2020:6840-6851. Accessed November 8, 2023. https://proceedings.neurips.cc/paper/2020/hash/4c5bcfec8584af0d967f1ab10179ca4b-Abstract.html

7. Choi J, Kim S, Jeong Y, Gwon Y, Yoon S. ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models. Published online September 15, 2021. Accessed November 8, 2023. http://arxiv.org/abs/2108.02938

8. Özdenizci O, Legenstein R. Restoring Vision in Adverse Weather Conditions with Patch-Based Denoising Diffusion Models. Published online November 21, 2022. Accessed August 22, 2023. http://arxiv.org/abs/2207.14626

9. Song J, Meng C, Ermon S. Denoising Diffusion Implicit Models. Published online October 5, 2022. Accessed November 8, 2023. http://arxiv.org/abs/2010.02502

10. Zbontar J, Knoll F, Sriram A, et al. fastMRI: An Open Dataset and Benchmarks for Accelerated MRI. Published online December 11, 2019. Accessed August 13, 2023. http://arxiv.org/abs/1811.08839

11. Ho J, Jain A, Abbeel P. Denoising Diffusion Probabilistic Models. Published online December 16, 2020. Accessed November 8, 2023. http://arxiv.org/abs/2006.11239

Figures

Figure 1: Performance comparison of different reconstruction methods at acceleration rates of 4 (first row) and 8 (third row) on a T2-weighted FLAIR MRI brain scan: (a) ground-truth; (b) zero-filled reconstruction; (c) U-Net; (d) Vision Transformer; (e) proposed PC-DDPM. The second and fourth row shows the undersampled k-space and the corresponding enhanced difference maps for each reconstruction method, emphasizing the deviations from the ground truth. The red box is the region of interest.

Figure 2: Illustration of the proposed image reconstruction methods: (a) Forward and reverse diffusion processes in the patch-based conditioned diffusion model: The forward process perturbs the MRI images towards a noise distribution by adding Gaussian noise. The reverse process removes noise at the patch level and reconstructing the image. (b) Image reconstruction pipeline: From fully sampled MRI data, a zero-filled solution is computed by applying the adjoint operator $$$A^*$$$, which performs Fourier transformations followed by the pre-defined sampling pattern.

Table 1: The table compares the reconstruction performance on brain MRI reconstructions of our patch-based conditioned DDPM (PC-DDPM) against U-Net, Vision Transformer, and conditioned diffusion models.

Proc. Intl. Soc. Mag. Reson. Med. 32 (2024)

1747

DOI: https://doi.org/10.58530/2024/1747