3459

Unsupervised Domain Adaptation for Neural Network Enhanced Turbo Spin Echo Imaging

Zechen Zhou¹, Peter Börnert², and Chun Yuan³
¹Philips Research North America, Seattle, WA, United States, ²Philips Research Hamburg, Hamburg, Germany, ³Vascular Imaging Lab, University of Washington, Seattle, WA, United States

Synopsis

Supervised learning is widely used for deep learning based image quality enhancement for improved clinical diagnosis. However, the difficulties to acquire a large number of high-quality reference image for different MR applications can limit its generalization performance. An unsupervised domain adaptation (DA) approach is proposed and incorporated into the deep learning based image enhancement framework, which improves the performance of trained network on new datasets. Preliminary evaluation on point spread function enhanced turbo spin echo imaging has showed that the unsupervised DA approach can provide more stabilized image sharpness improvement without severe amplified noise.

Introduction

Deep learning based image enhancement techniques can retrospectively improve the image quality for better clinical diagnosis¹. Typically, supervised learning with paired low- and high-quality images is used to train such neural network (NN) for image enhancement. Due to inter-scan motion, pixel-level misalignment between the two separate low-quality fast scan and high-quality scan raises challenges for this supervised learning framework. Given the high-quality image, simulation based image synthesis approach can be used to generate the co-registered low-quality image. However, the simulated noise pattern can be difficult to align with the realistic noise statistics particularly when fast imaging is applied, leading to deteriorated generalization performance of the trained NN. In addition, acquisition of a large number of high-quality reference images for the specific MR scans can be challenging due to the occurrence of motion in a prolonged scan. In this work, we leverage a domain adaptation (DA) approach^2,3 to train a NN for enhanced 3D turbo spin echo (TSE) imaging⁴ by using a few sample low-quality 3D TSE images and some publicly available high-quality images, and we investigated whether DA can synthesize more realistic noise patterns and whether the trained NN with DA has more stabilized point spread function (PSF) enhancement performance for 3D TSE imaging in comparison to the trained NN without DA.

Methods

Network Architectures
Figure 1 shows the overall DA network structure for PSF enhanced 3D TSE image. The image synthesis network aims to generate noise patterns similar to the sample TSE image, where the similarity is measured by a discriminator network. To mitigate synthesis of new image structures, we enforced the output of image synthesis network to be unbiased or zero-mean. Likewise, the discriminator network mainly evaluates whether the synthesized noise pattern is similar to the distribution in the sample TSE images. Therefore, mean removal was applied to the input image patches of the discriminator network before measuring the similarity. The synthesized low-quality image and high-quality image were paired to train the task specific network for PSF enhanced TSE image (i.e. improve image sharpness without noise amplification). Since the high-quality 3D TSE images are not required to train the PSF enhancement NN, this DA approach is considered as unsupervised training.
Training and Loss Functions
All three networks were trained simultaneously in an end-to-end manner, where the adversarial loss was applied to train the image synthesis and discriminator networks. To further control the relative magnitude of synthesized noise pattern, a L2 loss on the output of noise generator network was also applied and empirically weighted against the adversarial loss. The pixel-level L1 loss and feature-level perceptual VGG loss were applied to train the task specific network.
MR Datasets
A publicly available high-resolution brain dataset⁵ was downloaded to train the DA network and evaluate its performance for PSF enhanced TSE image, which includes 2D T2* gradient echo (0.12x0.12x0.6mm³), 3D MPRAGE (isotropic 0.44mm) and 3D TOF (isotropic 0.2mm) images acquired at Siemens 7T from one subject. 3D TSE images were acquired on a Philips 3T scanner with 6-fold acceleration on 3 healthy subjects, where 2 cases were used to generate sample low-quality image patches during training and the other 1 was used for testing.

Results

Comparison of synthesized images
Figure 2 shows the comparison of synthesized image patches using the high-quality images with two different approaches. Comparing to the sample patches from the 3D TSE image (Fig.2 B), the proposed DA approach can synthesize more similar noise patterns compared to the traditional approach by simply adding Rician noise onto the input image. Therefore, the DA approach allows synthesis of a training dataset with spatially variant noise pattern which is much closer to the acquired low-quality 3D TSE image with imaging acceleration.
Comparison of PSF enhanced images
Figure 3 (B)-(D) compare the performance of PSF enhanced images from the acquired 3D TSE image with T1/T2 relaxometry induced blurring (A). NN trained on the Rician noise corrupted blurring images (without DA approach) can result in severe noise amplification during PSF enhancement (B). Fine-tuning on such training dataset can help to reduce such noise amplification (C), but the sharp boundary restoration (e.g. small structure in cerebellum region) and background noise suppression might be difficult to be well balanced. The proposed DA approach (D) provides more stable performance for PSF enhanced results without severe noise amplification while demonstrating improved noise suppression in relatively uniform regions (dotted circle in red). This DA approach provides a more automatic fine-tuning strategy to better adapt the NN performance to the new test data.

Discussion and Conclusion

Given a few low-quality images acquired with the targeted fast MR scan, the proposed DA approach provides an unsupervised learning framework to leverage publicly available high-quality data for synthesis of training dataset that can approximate the noise/artifact pattern in the targeted scan for improved image enhancement. This framework can be useful for image enhancement tasks when the high-quality images are difficult to acquire for supervised learning. Initial evaluation on PSF enhanced TSE imaging has showed that the DA approach can adaptively stabilize the image sharpness improvement without severe amplified noise.

Acknowledgements

No acknowledgement found.

References

1. Litjens G, Kooi T, Bejnordi BE, Setio AAA, Ciompi F, Ghafoorian M, van der Laak JAWM, van Ginneken B, Sánchez CI. A survey on deep learning in medical image analysis. Med Image Anal. 2017;42:60-88.

2. Ganin Y, Lempitsky V. Unsupervised domain adaptation by backpropagation. In International conference on machine learning 2015, pp. 1180-1189.

3. Bousmalis K, Silberman N, Dohan D, Erhan D, Krishnan D. Unsupervised pixel-level domain adaptation with generative adversarial networks. In Proceedings of the IEEE conference on computer vision and pattern recognition 2017, pp. 3722-3731.

4. Zhou Z, Chen S, Balu N, Chu B, Zhao X, Sun J, Mossa-Basha M, Hatsukami T, Börnert P, Yuan C. Neural network enhanced 3D turbo spin echo for MR intracranial vessel wall imaging. Magn Reson Imaging. 2021 May;78:7-17. doi: 10.1016/j.mri.2021.01.004. Epub 2021 Feb 4. PMID: 33548457; PMCID: PMC7979503.

5. Stucht D, Danishad K, Schulze P, Godenschweger F, Zaitsev M, Speck O. Highest Resolution In Vivo Human Brain MRI Using Prospective Motion Correction. PLoS One. 2015;10(7):e0133921.

Figures

Figure 1：Architecture of domain adaptation (DA) network for point spread function (PSF) enhanced MR turbo spin echo (TSE) imaging. The high-quality image from a public dataset is used to train the task specific network, and also used to simulate the blurred image based on extended phase graph (same scan parameters used for sample TSE imaging) as the input for the image synthesis network. The image synthesis and discriminator network are optimized by an adversarial loss with L2-norm regularization. The PSF enhancer network is optimized by the L1 and perceptual loss.

Figure 2: Comparison of example synthesized image patches by different approaches from the training dataset (A) and example image patches from the sample images (B). Different columns in (A) from left to right corresponds to the input image to the image synthesis network, traditional approach to add Rician noise to the input image, output image from trained network with domain adaptation, and the high-quality ground truth image. Different rows show different examples of image patches. (B) 12 example reference images acquired by 3D TSE MR imaging.

Figure 3：Comparison of point spread function (PSF) enhanced images by different approaches. (A) Acquired MR turbo spin echo (TSE) image. (B) & (C) PSF enhanced images from trained networks without domain adaption (DA): (B) network trained with 40 iterations; (C) network trained with 20 iterations to prevent overfitting for noise. (D) PSF enhanced image using a trained network with DA. Zoom-in comparison of the cerebellum region (blue square in (A)) is shown on each panel. Note the difference of denoising performance for different approaches in the red dotted circle region.

Proc. Intl. Soc. Mag. Reson. Med. 30 (2022)

3459

DOI: https://doi.org/10.58530/2022/3459