0161

MRFlow: Flow-based neural network for MR image harmonization

Hwihun Jeong¹, Dong Un Kang¹, Jiye Kim¹, and Jongho Lee¹
¹Department of electrical and computer engineering, Seoul national university, Seoul, Korea, Republic of

Synopsis

Keywords: Machine Learning/Artificial Intelligence, Machine Learning/Artificial Intelligence

We propose MRFlow, which is a normalizing flow-based neural network for the MRI harmonization framework. With the normalizing flow trained only with the target domain (e.g., 3T image) data, we harmonize the image from the source domain (e.g., 1.5T image) to the target domain by alternately reducing the norm of the latent variable and increasing similarity between the source domain and harmonized images. When MRFlow is applied to synthesized source domain images, the harmonized images showed lower errors than the source domain images. In the prospective study, the harmonized images became more similar to the target domain images after MRFlow.

Introduction

Although deep learning-based MR image methodologies have been widely developed,^1,2 variations of MR images across vendors, scanners, and scan parameters have raised a generalization issue of deep learning.³ To overcome such generalization challenges, harmonization methods that match the image of the source domain (e.g., 1.5T image) to the target domain (e.g., 3T image) have been developed by utilizing end-to-end UNet⁴ or CycleGAN-based style transfer.^5,6 Despite noticeable performances, however, these methods have a limitation in that they have limited accuracy when applied to an unseen domain. Recently, a neural network named the normalizing flow was introduced and showed great performance in diverse computer vision tasks.^7,8 Normalizing flow can learn the prior of image distribution with the invertible network. In this study, we utilized this normalizing flow to harmonize MR images from a domain that has not been trained in the model.

Methods

[Normalizing flow] The goal of the normalizing flow is to learn an invertible mapping $$$f$$$ between latent variable $$$z$$$ and image $$$x$$$ parameterized with $$$\theta$$$. Here, the latent variable $$$z$$$ follows the simple probability distribution $$$p_Z$$$ (i.e., Gaussian distribution):$$z=f_\theta(x),{z}\sim{p_Z}\qquad(Eq. 1)$$Then the distribution of image can be formulated as:$$p_X(x)=p_Z(f_\theta(x))\left|det(\frac{\partial f_\theta(x)}{\partial x})\right|\qquad(Eq.2)$$Training can be conducted to minimize negative log-likelihood: $$$\mathcal{L}(\theta;x)=-log(p_X(x))=-log(p_Z(f_\theta(x))-log(\left|det(\frac{\partial f_\theta(x)}{\partial x})\right|)$$$. Image $$$x$$$ can be sampled by applying the inverse network: $$$x=f_\theta^{-1}(z),{z}\sim{p_Z}$$$.
[MRFlow] If the normalizing flow is trained only with the target domain, (Fig. 1; Step 1) the normalizing flow has learned a prior distribution of the target domain. The main idea for harmonization is to optimize the latent variable $$$z$$$ so that the optimized latent variable $$$\hat{z}$$$ is around the center of the Gaussian, and at the same time, the sampled image is not far from the image of the source domain. Because the normalizing flow has a prior of the target domain, the sampled image of the optimized latent variable $$$f_\theta^{-1}(z)$$$ would be the image harmonized from source to target. It can be formulated as an optimization equation:$${\hat{z}=\underset{z}{\mathrm{argmin}}\,\left|z\right|^2+\beta_1M\left|Gf_\theta^{-1}(z)\right|+\beta_2NCC(f_\theta^{-1}(z),x_s),}\\x_{harmonized}=f_\theta^{-1}(\hat{z}),\qquad(Eq. 3)$$where $$$\beta_1$$$ and $$$\beta_2$$$ are the regularization parameter, $$$x_s$$$ is the source domain image, $$$M$$$ is a mask where $$$x_s$$$ does not have any edges, $$$G$$$ is the gradient operator, and $$$NCC$$$ is normalized cross-correlation. The first term is to force the latent variable $$$z$$$ to be around the center of the Gaussian. The second term is for the edge coincidence of source and harmonized images, and the third term is for the similar trend of voxel values of source and harmonized images. The optimization of Eq. 3 is obviously not analytic, so we introduce an iterative method by taking advantage of the invertibility of the normalizing flow (Fig. 1; Step 2). $$\begin{align}&1.\;x_0=x_s,\;\alpha<<1\\&for\;n=0:N\;repeat\;2.\;and\;3.\\&2.\;z_n=(1-\alpha)f_\theta(x_n)\\&3.\;x_{n+1}=f(z_n)-\beta_1\triangledown_xM\left|Gx\right|_{x=f_\theta^{-1}(z_n)}-\beta_2\triangledown_xNCC(x,x_s)_{x=f_\theta^{-1}(z_n)}\end{align}$$
[Train and validation details] The SRFlow model⁶ was adopted as a network structure of the normalizing flow, with the substitution of the RRDB module to UNet. In order to improve the performance of the normalizing flow model, edge information of the input image was given as a condition, assuming the edge should be consistent across domains. The ADNI 2 dataset (3T image)⁹ was utilized as a target domain for the training normalizing flow. For a retrospective evaluation, source domain images were synthesized by manipulating the target domain image (e.g., gamma transformation and multiplication/summation with the low-frequency term; Fig. 2a). MRFlow was applied to the images of the generated source domain, and peak signal-to-noise ratio (PSNR), structural similarity (SSIM), and mean absolute error (MAE) were calculated. A prospective evaluation was also conducted using the ADNI 1 dataset (1.5T image) as a source domain. Histogram analysis was done for measuring the similarity between domains.

Results

The results of retrospective experiments with synthesized source domains are illustrated in Figure 2b and Figure 3. For all synthesized source domains, the harmonized images using MRFlow showed lower overall errors than source domain images with the reference of target domain images (PSNR: 25.28 ± 3.13 dB → 26.04 ± 2.96 dB), but also demonstrated slightly decreased SSIM in some source domain cases. In Figure 4, the MRFlow harmonization results from the source domain (ADNI 1) to the target domain (ADNI 2) are displayed. The harmonized images show similar contrast and noise level to the target domain images, despite blurring and small artifacts. The histogram of harmonized images is closer to that of the target domain when compared to the source domain histogram (Fig 4b).

Conclusion and Discussion

In this study, we propose MRFlow, which is a preliminary study of the normalizing flow-based MR image harmonization method. Compared to other deep learning-based harmonization methods, our method has the advantage that it can be applied to the domain unseen during training. Still, MRFlow cannot be applied to a situation with large domain gaps, such as harmonization between multi-contrast, but this study demonstrates that harmonization is possible with a generative network trained only with the target domain, without requiring source domain information during training.

Acknowledgements

This work was supported by the BK21 FOUR program of the Education and Research Program for Future ICT Pioneers, Seoul National University and the Korea Agency for Infrastructure Technology Advancement(KAIA) grant funded by the Ministry of Land, Infrastructure and Transport (Grant 21NPSS-C163415-01)

References

1. Lundervold, Alexander Selvikvåg, and Arvid Lundervold. "An overview of deep learning in medical imaging focusing on MRI." Zeitschrift für Medizinische Physik 29.2 (2019): 102-127.

2. Liu, Jin, et al. "Applications of deep learning to MRI images: A survey." Big Data Mining and Analytics 1.1 (2018): 1-18.

3. Jung, Woojin, Steffen Bollmann, and Jongho Lee. "Overview of quantitative susceptibility mapping using deep learning: Current status, challenges and opportunities." NMR in Biomedicine 35.4 (2022): e4292

4. Dewey, Blake E., et al. "DeepHarmony: A deep learning approach to contrast harmonization across scanner changes." Magnetic resonance imaging 64 (2019): 160-170.

5. Liu, Mengting, et al. "Style transfer using generative adversarial networks for multi-site mri harmonization." International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, Cham, 2021.

6. Modanwal, Gourav, et al. "MRI image harmonization using cycle-consistent generative adversarial network." Medical Imaging 2020: Computer-Aided Diagnosis. Vol. 11314. SPIE, 2020.

7. Lugmayr, Andreas, et al. "Srflow: Learning the super-resolution space with normalizing flow." European conference on computer vision. Springer, Cham, 2020.

8. Horvat, Christian, and Jean-Pascal Pfister. "Denoising normalizing flow." Advances in Neural Information Processing Systems 34 (2021): 9099-9111

9. Jack Jr, Clifford R., et al. "The Alzheimer's disease neuroimaging initiative (ADNI): MRI methods." Journal of Magnetic Resonance Imaging: An Official Journal of the International Society for Magnetic Resonance in Medicine 27.4 (2008): 685-691.

Figures

Figure 1 The overall description of the MRFlow. At first, the normalizing flow is trained with target domain data only, giving the prior of the target domain. Using a trained normalizing flow, harmonization from the source domain to the target domain is done with iterative optimization. In the latent variable z, we reduce the length of z by (1-α), which makes z closer to the center of the Gaussian. For the sampled image, gradient descent with respect to the similarity with the source domain image is conducted to regularize that the image is not too far from the source domain image.

Figure 2 The description and metrics of retrospective experiments. (a) Source domain images were synthesized by manipulating images such as Gamma transform with γ of 2 (Domain A) and 1.5 (Domain B), summation with a low-frequency term (Domain C), and multiplication with the low-frequency term (Domain D). (b) A table of PSNR, SSIM, and MAE between the source domain image and target domain image (Before) as well as harmonized image and target domain image (After) is shown. The PSNR values became higher after MRFlow, but the SSIM values of Domain C and D became lower after harmonization.

Figure 3 Results of MRFlow application to synthetic source domains. (a) Source domain images, (b) target domain images, (c) harmonized images, difference map between source domain images and target domain images (1st column), and difference map between harmonized image and target domain images (5th column) are illustrated. The PSNR values are denoted at the bottom of the difference maps.

Figure 4 The results of the application of MRFlow to the ADNI 1 dataset as a target domain of the ADNI 2 dataset. (a) The harmonized images show consistency with the ADNI 2 dataset image in terms of noise and contrast. (b) A histogram of the source domain (ADNI 1), target domain (ADNI 2), and MRFlow harmonized image are shown. The histogram became more similar to the target domain after the MRFlow harmonization.

Proc. Intl. Soc. Mag. Reson. Med. 31 (2023)

0161

DOI: https://doi.org/10.58530/2023/0161