1203

BlindHarmony: Blind harmonization for multi-site MR image processing via unconditional flow model

Hwihun Jeong¹, Heejoon Byun¹, and Jongho Lee¹
¹Department of electrical and computer engineering, Seoul national university, Seoul, Korea, Republic of

Synopsis

Keywords: Analysis/Processing, Reproductive

Motivation: Conventional deep learning-based harmonization cannot handle the unseen source domain image when there is no large-size data.

Goal(s): We propose blind harmonization, which requires only target domain data during training and generalizes well on unseen source domain data.

Approach: BlindHarmony utilizes an unconditional flow model to measure the probability of the target domain image and find a harmonized image that is structurally close to this source domain image but has a high probability in the target domain.

Results: BlindHarmony successfully harmonized the source domain image to the target domain and improved the performance of downstream tasks for the data with a domain gap.

Impact: Deep learning-based harmonization typically necessitates both source and target domain data, limiting its widespread applicability. This study eliminates the need for source domain data and exhibits robust generalization to new source domain data, thereby expanding the utility of harmonization.

Introduction

The variations in MR images across different hardware, sequences, or scan parameters create a domain gap that needs to be bridged by a step called image harmonization. This is essential to facilitate the effective utilization of conventional or deep learning-based image analysis techniques, such as segmentation. Multiple methods, including a deep learning-based approach, have been proposed for achieving this image harmonization.^1-4 Nevertheless, these approaches frequently require multiple datasets for deep learning training and may still encounter challenges when applied to images from previously unseen domains. To mitigate this limitation, we introduced an approach known as "Blind Harmonization".⁵ This approach relies solely on the target domain data for training but retains the ability to harmonize images from previously unseen domains. As an implementation of this concept, we have developed BlindHarmony, leveraging an unconditional flow model.⁶

Methods

[BlindHarmony] The main idea of blind harmonization is to generate an image that is structurally close to the source domain image but has a high probability in the target domain. Firstly, we define image distance for quantifying structural closeness. Concerning the source domain image and corresponding harmonized image, we can see the two relationships: two images are highly correlated and the edges of the two images coincide. Based on this fact, we can define a distance between the source domain image ($$$x_s$$$) and its harmonized image ($$$x_h$$$):$$D\left(x_h,x_s\right)=\beta_1\{1 - NCC\left(x_h,x_s\right)\}+\beta_2 \| M G x_h \|_1,\qquad[Eq.1]$$ where $$$NCC$$$ denotes normalized crosscorrelation and $$$\| \|$$$ denotes the L1 norm. $$$M$$$ is a mask obtained by thresholding the gradient value of $$$x_s$$$, which retains the non-edge regions, and $$$G$$$ represents the gradient operator. $$$\beta$$$s are hyperparameters. Secondly, in order to measure the probability of the target domain, we utilize an unconditional flow model. If a flow model $$$f_\theta$$$ is trained only with the target domain images, the probability of the target domain image can be parameterized by a simple Gaussian latent vector ($$$z=f_\theta(x)$$$). Using these settings, we can formulate an optimization equation for blind harmonization as follows: $$\widehat{x_h}= \arg\min_x{D\left(x,x_s\right) + \alpha |f_\theta(x)|^2}.\qquad[Eq.2]$$ Direct optimization of Eq. 2 requires the calculation of the gradient of $$$f_\theta$$$ which is computationally demanding. Instead, we employ iterative optimization both in image and latent vector spaces. In a latent vector space, $$$z$$$ is updated so that it does not deviate from the center of the Gaussian. In an image space, the gradient descent of $$$D(x,x_s)$$$ is calculated and updated. After an N iteration, the resultant image would be a harmonized image. [Details and evaluation] T1-weighted images in the OASIS3⁷ dataset were used for train and evaluation. OASIS3 contains multisite data and images acquired with the Siemens TrioTim scanner were used as the target domain. The neural spline flow (NSF) model⁸ was used for flow model training. To evaluate the performance of the BlindHarmony, twenty traveling subjects with four source domains were utilized. We applied the BlindHarmony to the source domain images and calculated the PSNR and SSIM between the harmonized image and target domain image. Moreover, to measure the functional utility of the proposed method, we tackle the downstream task of white matter segmentation. The white matter segmentation network is trained on the target domain data, so it can describe the effectiveness of harmonization from source to target domain. As a comparison, histogram matching (HM), low-frequency replacing (SSIMH),⁹ CycleGAN-based style transfer, and supervised training with U-net are adopted. It should be noted that the CycleGAN and U-net are trained on each source-target pair.

Results

Figure 3 displays the results of harmonizing source domain images (column 1) to target domain images (column 2) using BlindHarmony (column 3). BlindHarmony effectively reduced inter-scanner variability and aligned the images with the target domain. Quantitative evaluation (Fig. 3b) via PSNR and SSIM metrics demonstrated improved values (averaged PSNR: 21.5 dB to 22.2 dB) compared to the source images. It is worth noting that U-net outperformed BlindHarmony, but they were trained separately for each source domain. Figure 4 presents white matter segmentation results for each harmonization method. Remarkably, BlindHarmony enhanced segmentation performance, harmonizing source domain images with the target domain effectively. The IoU values (Fig. 4b) were also higher for BlindHarmony, further confirming its superior harmonization performance.

Conclusion and Discussion

In this study, we propose BlindHarmony, a flow-based blind harmonization method for MR images. Unlike other existing harmonization methods, our network is trained exclusively on the target domain dataset and can be applied to previously unseen domain images. Our study demonstrates the feasibility of blind harmonization, providing an advantage in scenarios where access to source domain data is limited or unavailable.

Acknowledgements

This work was supported by the Korea Agency for Infrastructure Technology Advancement(KAIA) grant funded by the Ministry of Land, Infrastructure and Transport (Grant 21NPSS-C163415-01), Samsung Electronics Co., Ltd (IO201216-08215-01), and Institute of New Media and Communications (INMC), SNU.

References

1. Dewey, Blake E., et al. "DeepHarmony: A deep learning approach to contrast harmonization across scanner changes." Magnetic resonance imaging 64 (2019): 160-170.

2. Liu, Mengting, et al. "Style transfer using generative adversarial networks for multi-site mri harmonization." Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part III 24. Springer International Publishing, 2021.

3. Modanwal, Gourav, et al. "MRI image harmonization using cycle-consistent generative adversarial network." Medical Imaging 2020: Computer-Aided Diagnosis. Vol. 11314. SPIE, 2020.

4. Zuo, Lianrui, et al. "Unsupervised MR harmonization by learning disentangled representations using information bottleneck theory." NeuroImage 243 (2021): 118569.

5. Jeong, Hwihun, et al. "BlindHarmony:" Blind" Harmonization for MR Images via Flow Model." Proceedings of the IEEE/CVF International Conference on Computer Vision. 2023.

6. Dinh, Laurent, David Krueger, and Yoshua Bengio. "Nice: Non-linear independent components estimation." arXiv preprint arXiv:1410.8516 (2014).

7. LaMontagne, Pamela J., et al. "OASIS-3: longitudinal neuroimaging, clinical, and cognitive dataset for normal aging and Alzheimer disease." MedRxiv (2019): 2019-12.

8. Durkan, Conor, et al. "Neural spline flows." Advances in neural information processing systems 32 (2019).

9. Guan, Hao, et al. "Fast image-level MRI harmonization via spectrum analysis." International Workshop on Machine Learning in Medical Imaging. Cham: Springer Nature Switzerland, 2022.

Figures

Figure 1 Blind harmonization presents an advantage over conventional harmonization models. While traditional models often necessitate multiple datasets during training or show reduced performance on unseen domains, blind harmonization can be trained solely with target domain data and generalized to previously unseen source domains.

Figure 2 The BlindHarmony framework functions through the following process: initially, a flow model undergoes training solely on target domain data. Subsequently, harmonization is executed iteratively, encompassing both latent variable and image domains, utilizing the trained flow model.

Figure 3 (a) An illustrative image exemplifying the application of BlindHarmony to traveling subjects is presented alongside a comparative analysis with other harmonization techniques. (b) Quantitative metrics are showcased, with PSNR and SSIM values computed with reference to the target domain image, while using signal regions as masks. Notably, the utilization of BlindHarmony demonstrates enhanced consistency with the target domain image. Additionally, CycleGAN and U-net results represent the network outputs trained individually for each source domain.

Figure 4 (a) The outcomes of the white matter segmentation network are displayed, with each column representing the application of the respective harmonization method to the input image when the white matter segmentation network is trained on the target domain data. (b) Intersection over Union (IoU) values, depicting the degree of overlap between the segmentation network results and the label mask, are reported. Notably, the use of BlindHarmony shows superior results when compared to other harmonization methods, demonstrating its effectiveness.

Proc. Intl. Soc. Mag. Reson. Med. 32 (2024)

1203

DOI: https://doi.org/10.58530/2024/1203