1748

Generating missing M0 in ADNI ASL dataset with latent diffusion models

Qinyang Shou¹, Nan-kuei Chen², Hosung Kim³, and Danny JJ Wang¹
¹Laboratory of Functional MRI Technology (LOFT), Stevens Neuroimaging and Informatics Institute, University of Southern California, Los Angeles, CA, United States, ²Department of Biomedical Engineering, University of Arizona, Phoenix, AZ, United States, ³Laboratory of Neuro Imaging (LONI), Stevens Neuroimaging and Informatics Institute, University of Southern California, Los Angeles, CA, United States

Synopsis

Keywords: AI Diffusion Models, Machine Learning/Artificial Intelligence

Motivation: Alzheimer’s Disease Neuroimaging Initiative (ADNI) ASL dataset acquired on Siemens scanners missed M0 images, which prevents CBF quantification for further analysis.

Goal(s): Our goal is to generate the missing M0 for the ADNI ASL dataset using latent diffusion model (LDM).

Approach: A separate training dataset was acquired with the ADNI ASL protocol but with manually disabled background suppression to be used as the M0. A conditional LDM was trained to use acquired control images as the condition to generate M0 images.

Results: The generated M0 with the conditional LDM shows a high fidelity compared to the experimentally acquired M0.

Impact: With generated M0 images, more than 500 ADNI ASL datasets can be further analyzed for CBF to investigate AD progression.

Introduction

Alzheimer's Disease (AD) is a progressive and devastating neurological condition that affects millions of individuals. Cerebral blood flow (CBF) may provide a biomarker for early detection of AD¹. Arterial spin labeling (ASL) scan can measure CBF without contrast agents or exposure to radiation. ASL has been part of the imaging protocol in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (https://adni.loni.usc.edu/). Although ADNI has collected over 1000 ASL scans, more than 500 scans acquired on Siemens scanners missed M0 images, which is necessary for CBF quantification, making it impossible for investigating CBF changes during AD progression. The diffusion model² is a type of generative model that can generate images from random noise. Recently latent diffusion model (LDM)³ has been proposed to enable a flexible conditional generation while saving computational costs by applying diffusion models in the latent space instead of operating directly in the image space. In this study, we developed LDMs to generate M0 images, which may potentially enable CBF analysis using ADNI dataset.

Methods

Data acquisition of the control and M0 images
Figure 1 shows the sequence diagram of the ADNI pulsed ASL (PASL) and M0 acquisition in our experiment. Control/label images of PASL were acquired using the ADNI ASL protocol. For M0, background suppression pulses were manually disabled, and TI was extended to 5 seconds for full magnetization recovery. ASL and M0 from 36 subjects (70±11 years, 11 males) were collected using these 2 sequences on Siemens 3T Prisma scanners at two sites.

Conditional latent diffusion model
Latent diffusion model uses two steps for image generation. The images were first compressed into the latent space and the diffusion process was done in the latent space. Then the compressed image went through a decoder to generate the final image. The framework of this study is shown in Figure 2. In the forward process, M0 images went through 1000 diffusion steps, and in the reverse process, the corresponding control image was used as a condition to the network in each denoising step to generate the final M0 image.
We trained three models in total. First, a conditional LDM that generates M0 image conditioned on control images (‘M0 gen’); Second, a discriminative Resnet⁴ model that maps from control images to M0 images (‘direct mapping’); Third, a conditional LDM that generates CBF maps conditioned on both control and perfusion images (‘CBF gen’). The models were trained on 2D slices of 31 scans and tested on 5 scans from both sites. For generated M0 images, CBF maps were calculated with the acquired perfusion images (control - label) along with the generated M0 images.

Performance evaluation
To quantitatively compare the performance, we calculated the similarity (NMSE, PSNR and SSIM) between the generated CBF maps and the ground truth. We also calculated the bias in CBF values in whole brain, GM and WM masks.

Results

Figure 3 displays the generated M0 and CBF images with M0 gen and ground truth images. It shows that the generated M0 and CBF images by LDM has a high fidelity. Figure 4 shows the quantitative comparison of different models. It shows that the CBF maps generated from the M0 gen method has the highest similarity, with a comparably small bias as the CBF gen model. Figure 5 shows the uncertainty estimation of M0 reproduced by M0 gen. Since the generation varies for each sampling process, we generated 100 samples and calculated mean and standard deviation of these samples. The left panel shows that most variance occurs in CSF and is relatively stable elsewhere. The right panel shows that with more samples, the average generation becomes closer to the ground truth, though 20 samples already produce reasonable outcome.

Discussion

In this study, we developed conditional latent diffusion models to generate M0 and subsequently quantitative CBF maps from the acquired control and/or perfusion images. Compared to the M0 gen method, CBF gen has a larger variance, likely because perfusion images have higher variability, making the model less stable. Compared to direct mapping, generative model produces a better result, probably because control images have low intensity and may include residual fat signals that may reduce the performance more in discriminative models than generative models. Future study and analysis are warranted.

Conclusion

In this study, we developed conditional LDMs to generate M0 and subsequently CBF from control and/or perfusion images. This can be applied to compensate for missing M0 images in many ADNI ASL datasets, enabling quantitative CBF analyses during the progression of AD.

Acknowledgements

This work was supported by US NIH grants R01-EB032169 and R01-EB028297.

References

[1] H. Zhang et al., “Cerebral blood flow in mild cognitive impairment and Alzheimer’s disease: A systematic review and meta-analysis,” Ageing Research Reviews, vol. 71, p. 101450, Nov. 2021, doi: 10.1016/j.arr.2021.101450.

[2] J. Ho, A. Jain, and P. Abbeel, “Denoising Diffusion Probabilistic Models,” arXiv.org. Accessed: May 16, 2023. [Online]. Available: https://arxiv.org/abs/2006.11239v2

[3] R. Rombach, A. Blattmann, D. Lorenz, P. Esser, and B. Ommer, “High-Resolution Image Synthesis with Latent Diffusion Models.” arXiv, Apr. 13, 2022. doi: 10.48550/arXiv.2112.10752.

[4] B. Lim, S. Son, H. Kim, S. Nah, and K. M. Lee, “Enhanced Deep Residual Networks for Single Image Super-Resolution.” arXiv, Jul. 10, 2017. Accessed: Mar. 19, 2023. [Online]. Available: http://arxiv.org/abs/1707.02921

Figures

Figure 1. Sequence diagram for the ADNI ASL protocol. (a) The protocol of ADNI uses a pulsed ASL sequence with 2 background suppression pulses. (b) Surrogate M0 data are acquired after manually disabling the two background suppression pulses and extending the TI to 5s.

Figure 2. The framework of the conditional generation of the M0 image with the control image as the condition through concatenation with the inputs.

Figure 3. A representative case of ground truth M0 image, generated reproduced M0 image by LDM (M0 Gen), the ground truth CBF map and the calculated CBF map from the generated reproduced M0 image and the actual perfusion images.

Figure 4. Uncertainty analysis of the generated M0. Left panel: Ground truth M0, average of 100 generated M0 samples, and the standard deviation of the 100 generated M0 samples. Right panel: SSIM and PSNR with number of average of samples with the ground truth.

Figure 5. Similarity metrics (NMSE, SSIM and PSNR) of the generated CBF maps and the ground truth, and the difference of the whole brain, gray matter and white matter CBF difference between the generated CBF map and the ground truth.

Proc. Intl. Soc. Mag. Reson. Med. 32 (2024)

1748

DOI: https://doi.org/10.58530/2024/1748