Keywords: Analysis/Processing, Data Processing, magnetic resonance imaging, diffusion tensor imaging, self-supervised learning, transfer learning
Motivation: MRI with high resolution and/or acceleration factor suffers from intrinsic low signal-to-noise ratio. Supervised learning-based denoising significantly improves image quality, but requires high-SNR data as training targets.
Goal(s): To denoise images using noisy image repetitions without additional acquisition.
Approach: Noise2Average trains CNN to map each noisy image to its residual compared to the average of all noisy images at iteration 1 and all denoised images from iteration k-1 at iteration k. The images from opposite phase-encoding directions of EPI or different echo times of ME-MPRAGE are noisy repetitions.
Results: Noise2Average outperforms BM4D, AONLM and Noise2Noise in terms of image quality and DTI metrics.
Impact: By reducing the requirement for training data and time, Noise2Average substantially increases the feasibility and accessibility of deep learning based denoising methods for MRI and potentially benefits a wider range of clinical and neuroscientific studies.
1. Zhang K, Zuo W, Chen Y, Meng D, Zhang L. Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising. IEEE transactions on image processing. 2017;26(7):3142-3155.
2. Tian Q, Zaretskaya N, Fan Q, et al. Improved cortical surface reconstruction using sub-millimeter resolution MPRAGE by image denoising. NeuroImage. 2021;233:117946.
3. Li Z, Tian Q, Ngamsombat C, et al. High‐fidelity fast volumetric brain MRI using synergistic wave‐controlled aliasing in parallel imaging and a hybrid denoising generative adversarial network (HDnGAN). Medical physics. 2021;
4. Lehtinen J, Munkberg J, Hasselgren J, et al. Noise2Noise: Learning Image Restoration without Clean Data. The International Conference on Machine Learning. 2018:2965-2974.
5. Ledig C, Theis L, Huszár F, et al. Photo-realistic single image super-resolution using a generative adversarial network. Proceedings of the IEEE conference on computer vision and pattern recognition. 2017:4681-4690.
6. Li Z, Bilgic B, Li Z, Ying K, Polimeni J, Huang S, Tian Q. Noise2Average: deep learning based denoising without high-SNR training data using iterative residual learning. Proceedings of the 31st Annual Meeting of the International Society for Magnetic Resonance in Medicine (ISMRM) 2022. 2022;
7. Smith SM, Jenkinson M, Woolrich MW, et al. Advances in functional and structural MR image analysis and implementation as FSL. Neuroimage. 2004;23:S208-S219.
8. Jenkinson M, Beckmann CF, Behrens TEJ, Woolrich MW, Smith SM. FSL. NeuroImage. 8/15/ 2012;62(2):782-790.
9. Andersson JL, Skare S, Ashburner J. How to correct susceptibility distortions in spin-echo echo-planar images: application to diffusion tensor imaging. Neuroimage. 2003;20(2):870-888.
10. Andersson JL, Sotiropoulos SN. An integrated approach to correction for off-resonance effects and subject movement in diffusion MR imaging. NeuroImage. 2016;125:1063-1078.
11. van der Kouwe AJ, Benner T, Salat DH, Fischl B. Brain morphometry with multiecho MPRAGE. NeuroImage. 2008;40(2):559-569.
12. Fan Q, Witzel T, Nummenmaa A, et al. MGH–USC Human Connectome Project datasets with ultra-high b-value diffusion MRI. NeuroImage. 2016;124:1108-1114.
13. Maggioni M, Katkovnik V, Egiazarian K, Foi A. Nonlocal transform-domain filter for volumetric data denoising and reconstruction. IEEE transactions on image processing. 2012;22(1):119-133.
14. Maggioni M, Foi A. Nonlocal transform-domain denoising of volumetric data with groupwise adaptive variance estimation. Computational Imaging X. 2012;8296:82960O
15. Manjón JV, Coupé P, Martí‐Bonmatí L, Collins DL, Robles M. Adaptive non‐local means denoising of MR images with spatially varying noise levels. Journal of Magnetic Resonance Imaging. 2010;31(1):192-203.
Figure 1. Learning strategies. Supervised denoising trains a CNN to map the average noisy images to its residual image compared to the ground-truth image (a). Noise2Noise trains a CNN to map one noisy image to other noisy images (b). Noise2Average trains a CNN to map one noisy image to image with increased SNR which is the average of the noisy images for iteration 1, and the average of all denoised images from iteration k-1 for iteration k (k>1, …) (c). 3D modified U-Nets with receptive view of 23 and 39 are used to adapt to accommodate different image resolutions (d: 1.5 mm, e: 0.6 mm).
Figure 2. DWI with opposite phase-encoding directions. Exemplary axial image slices acquired with opposite phase-encoding (PE) directions (i.e., anterior–posterior (AP) and posterior–anterior (PA)) (i, ii) before (a, c) and after (b, d) geometric distortion and signal intensity correction using the “topup” function of FSL through the middle brain (a, b) and brain regions near air-tissue boundaries (c, d, green and magenta arrowheads) of a subject from the HCP-A data are shown. Residual maps between corrected images with opposite PE directions are shown (b, iii and d, iii).
Figure 3. Image results of HCP-A diffusion data. Axial image slices of the DWI synthesized from ground-truth tensors (a, i), raw acquired anterior-posterior (AP) volume (a, ii), average volume of AP and PA volumes (a, iii), BM4D-denoised, AONLM-denoised, supervised learning-denoised averaged volume (a, iv-vi), Noise2Noise-denoised data (c, i), Noise2Average-denoised data from iteration 1 to 5 (c, ii-vi) of a subject of HCP-A diffusion data are displayed. The group means (± group standard deviations) of image metrics over 10 subjects are listed (e).
Figure 4. DTI metrics of HCP-A diffusion data. The group means (± group standard deviations) of the mean absolute errors between DTI metrics derived from one b = 0 and six DWI volumes acquired along anterior-posterior (AP) phase-encoding direction (a), AP and PA average volume (b), BM4D-denoised, AONLM-denoised, supervised learning-denoised and Noise2Noise-denoised data (c-f) and Noise2Average (N2A)-denoised data from iteration 1 to 5 (g-k) and the ground truth over 20 subjects. Red and green texts indicate the best and the second-best performance from different methods.
Figure 5. Image results of MGH T1w ME-MPRAGE data. Exemplary axial image slices from the 6-repetition averaged image volume (ground truth, a, i), single noisy image volume (the root mean square of the two images with different echo times of one repetition) (a, ii), BM4D-denoised, AONLM-denoised, supervised learning-denoised data (a, iii-v) , Noise2Noise-denoised data (c, i) and Noise2Average-denoised data from iteration 1 to 4 (d, ii-v) of a subject from MGH T1w ME-MPRAGE data are shown. The group means (± group standard deviations) of image metrics over five subjects are listed (e).