3980

Exploring domain adaption for deep neural network trained QSM

JUAN LIU¹ and Kevin Koch²
¹Yale University, New Haven, CT, United States, ²Medical College of Wisconsin, Milwaukee, WI, United States

Synopsis

We aim to address the domain adaption problem of neural networks for QSM reconstruction which are learned from synthetic data while applied on real data. To address the unsupervised domain adaption, we apply domain-specific batch normalization layers in convolutional neural networks while allowing them to share all other model parameters. The proposed method is evaluated on multiple orientation datasets and single-orientation QSM datasets. Compared withTKD, MEDI, and DL-based method first training on synthetic datasets then model-based fine-tuning on real datasets, the proposed method achieved better performance.

Introduction

Quantitative susceptibility mapping (QSM) is a MRI technique that estimates tissue magnetic susceptibility. The generation of QSM requires solving a challenging ill-posed field-to-source inversion problem. Recently, several deep learning (DL) QSM techniques[1,2,3,4] have been proposed and demonstrated impressive performance. Due to the inherent non-existent ground-truth QSM references, these techniques used either COSMOS [5] maps or synthetic data for network training. Synthetic data is easy to generate but often adapt poorly to domain shifts. Here, we introduce an easy domain adaption technique using domain-specific batch normalization[6] to improve the domain adaption.

Method

[Neural Network] A modified U-net for 3D data processing was utilized for the dipole inversion, with inputs of local field and brain mask and output of QSM. For the training of the network, whole simulated QSM maps were generated using single brain COSMOS map from 2016 QSM reconstruction challenge and data augmentation techniques (elastic transform, contrast change, and adding lesions). The local field were computed using the dipole kernel convolution. We apply domain-specific batch normalization layers in networks while allowing them to share all other model parameters. For synthetic data, L2 loss was used $$$L_{S} = \left \| \chi _{s} - \chi _{ground truth} \right \|^{2}$$$, where $$$\chi _{s}$$$ is the network QSM output of synthetic data, and $$$\chi _{ground truth}$$$ is the QSM label of synthetic data. For real data, data consistency loss was used $$$L_{T} = \left \| W_{t} (e^{j d\ast \chi _{t}} - e^{j f_{t}}) \right \|^{2}$$$, where $$$W_{t}$$$ is the data weighting term to incorporate the noise weighting term and brain mask, $$$d$$$ is dipole kernel, $$$\chi _{t}$$$ is the network QSM outputs of real data, $$$f_{t}$$$ is local field of real data. Total loss incorporates these two losses $$$L_{Total} = L_{S} + \lambda L_{T}$$$, where $$$\lambda$$$ is loss weight which increases during training. The proposed method is denoted as QSMInvNet+.

[Data] 9 multi-orientation datasets were acquired with 5 head orientations and a 3D single-echo GRE scan with isotropic voxel size 1.0x1.0x1.0mm³ on 3T MRI scanners [1]. The data processing was the same as [9]. For network training, 500 synthetic data were generated then cropped to image patches with patch size 96x96x96 and training data 3000. During training, the RESHARP local field, magnitude image and brain mask of 36 scans (9*4 tilted head positions) were randomly cropped with patch size 96x96x96. QSMInvNet+ was trained using patch-based network with patch size 96x96x96. Adam was used as optimizer with the learning rate of 10-4. The network was implemented using TensorFlow and was trained on one NVIDIA K80 GPU. TKD[7], MEDI[8], COSMOS were performed to get QSM results of normal head position. In addition, DL-based techniques QSMInvNet (which first trained on synthetic data, then fine-tuned the trained model on real data according to the physical model), QSMInvNet+, and uQSM[9] were performed to compare the QSM results. Quantitative metrics such as NRMSE, HFEN, SSIM were calculated using COSMOS result as a reference.

100 clinical data were acquired using susceptibility-weighted angiography on a 3T MRI scanner (GE Healthcare MR750) with data acquisition parameters: in-plane data matrix 288x224, FOV 22 cm, slice thickness 3 mm, first TE 12.6 ms, echo spacing 4.1 ms, 7 echoes, TR 39.7 ms, pixel bandwidth 244 Hz, and total acquisition time of about 2 minutes. Complex multi-echo images were reconstructed from raw k-space data. The brain masks were obtained using the SPM tool. RESHARP method with spherical mean radius 6mm were used for background field removal. For network training, simulated data were generated with voxel size 0.76x0.76x0.76mm³. The network was trained with patch size 128x128x64. TKD, MEDI, QSMInvNet, and QSMInvNet+ were performed to compare the QSM results.

Result

Fig. 2 displays the quantitative metrics of QSM results with COSMOS map as an reference. QSMInvNet+ achieved the best score in PSNR, NRMSE, and HFEN.
Fig. 3 compared QSM results of a multi-orientation dataset. Compared with conventional methods TKD and MEDI, DL-based methods display QSM with clearer details and less artifacts. Among DL-based methods, QSMInvNet+ results shows less shading artifacts. Fig. 4 and Fig. 5 compared QSM results of two clinical datasets with brain hemorrhage. TKD and MEDI results display sever streaking artifact, while QSMInvNet and QSMInvNet+ outperformed with clear details and invisible artifacts. Compared with QSMInvNet, QSMInvNet+ images showed improved performance with less shading artifacts around the lesions.

Discussion and Conclusion

We have proposed to apply domain-specific batch normalization to improve domain adaption of neural networks learning from synthetic data. This method outperforms QSMInvNet - first learning from synthetic data then fine-tuning on real data. Since the method requires simulated data and minor changes on network architecture, it can be easily utilized to facilitate QSM researches.

Acknowledgements

We thank Professor Jongho Lee for sharing the multi-orientation QSM datasets.

References

[1] Yoon, Jaeyeon, et al. "Quantitative susceptibility mapping using deep neural network: QSMnet." Neuroimage 179 (2018): 199-206.

[2] Bollmann, Steffen, et al. "DeepQSM-using deep learning to solve the dipole inversion for quantitative susceptibility mapping." Neuroimage 195 (2019): 373-383.

[3] Chen, Yicheng, et al. "QSMGAN: improved quantitative susceptibility mapping using 3d generative adversarial networks with increased receptive field." NeuroImage 207 (2020): 116389.

[4] Jung, Woojin, et al. "Exploring linearity of deep neural network trained QSM: QSMnet+." NeuroImage 211 (2020): 116619.

[5] Liu, Tian, et al. "Calculation of susceptibility through multiple orientation sampling (COSMOS): a method for conditioning the inverse problem from measured magnetic field map to susceptibility source image in MRI." Magnetic Resonance in Medicine: 61.1 (2009): 196-204.4.

[6] Chang, Woong-Gi, et al. "Domain-specific batch normalization for unsupervised domain adaptation." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2019.

[7] Shmueli, Karin, et al. "Magnetic susceptibility mapping of brain tissue in vivo using MRI phase data." Magnetic Resonance in Medicine: 62.6 (2009): 1510-1522.7.

[8] Liu, Tian, et al. "Morphology enabled dipole inversion (MEDI) from a single angle acquisition: comparison with COSMOS in human brain imaging." Magnetic resonance in medicine 66.3 (2011): 777-783.

[9] Liu, Juan, and Kevin M. Koch. "Model-Based Learning for Quantitative Susceptibility Mapping." International Workshop on Machine Learning for Medical Image Reconstruction. Springer, Cham, 2020.

Figures

Fig 1. Neural network architecture of QSMInvNet+. It has an encoder-decoder structure with 9 convolutional layers (kernel size 3x3x3, same padding), 9x2 domain-specific batch normalization layers, 9 ReLU layers, 4 max pooling layers (pooling size 2x2x2, strides 2x2x2), 4 nearest-neighbor upsampling layers (size 2x2x2), 4 feature concatenations, and 1 convolutional layer (kernel size 1x1x1, linear activation). To address the unsupervised domain adaption on real data, domain-specific batch normalization layers are utilized while sharing all other model parameters.

Fig 2. Means and standard deviations of quantitative performance metrics of 5 reconstructed QSM images with COSMOS as a reference on 9 multi-orientation datasets. QSMInvNet+ achieved the best in PSNR, NRMSE, and HFEN.

Fig 3. Comparison of QSM of a multi-orientation dataset. TDK (a), MEDI (b) and uQSM (e) results show black shading artifacts in the axial plane and streaking artifacts in the sagittal plane. QSMInvNet (c) displays better quality then TKD and MEDI, but shows subtle artifacts. QSMInvNet+(d) results have high-quality with clear details and invisible artifacts.

Fig 4. SM images (a-d) and SWI images (e) from a 37-year-old pa-tient with surgical planning. TKD (a), and MEDI (b) maps showed oversmoothing and/or streaking artifacts. Compared with TKD and MEDI, QSMInvNet (c) and QSMInvNet+ (d) maps better preserve image details and show subtle artifacts. QSMInvNet+ outperformed QSMInvNet with less shading artifacts around the hemorrhage regions (c, black arrows).

Fig 5. QSM images (a-d) and SWI images (e) from a 56-year-old subject withhemorrhagic intracranial metastas. TKD (a), and MEDI (b) maps showed oversmoothing and/or streaking artifacts. Compared with TKD and MEDI, QSMInvNet (c) and QSMInvNet+ (d) maps show less image artifacts and clearer details. QSMInvNet+ outperformed QSMInvNet with less artifacts especially around the hemorrhage regions (c, black arrows).

Proc. Intl. Soc. Mag. Reson. Med. 29 (2021)

3980