0984

Structure preserving noise removal in Hilbert space from ultra-high resolution diffusion MRI data
Gabriel Ramos-Llordén1, Gonzalo Vegas-Sanchez-Ferrero1, Congyu Liao2, Carl-Fredrik Westin1, Kawin Setsompop2, and Yogesh Rathi1
1Brigham and Women's Hospital, Harvard Medical School, Boston, MA, United States, 2Massachusetts General Hospital, Harvard Medical School, Charlestown, MA, United States

Synopsis

In-vivo submillimeter resolution diffusion MRI suffers from limited signal-to-noise ratio (SNR) due to the small voxel size. Denoising techniques can improve the SNR and facilitate further dMRI analysis. Among them, perhaps PCA-based (e.g, Marchenko-Pastur PCA) have shown the best performance. In this work, we introduce kernel PCA, a powerful nonlinear generalization of linear PCA to Hilbert spaces that is shown to suppress a substantial amount of noise (which MP-PCA is incapable of) and still reliably preserve dMRI signal. We showcase K-PCA noise removal with 660 micrometer gSlider data, where we compared it qualitatively and qualitatively with MP-PCA.

Introduction

Ultra-high resolution diffusion MRI holds the promise of being an essential tool to explore human brain at an unprecedented level of detail. However, at that resolution, severe noise often obscures fine anatomical details and hampers further dMRI analysis. We introduce kernel PCA (principal component analysis), a non-linear generalization of the traditional linear PCA to Hilbert spaces, to suppress noise while preserving the fine details in the diffusion-weighted images. The proposed method holds promise for denoising at very low SNR levels outperforming standard PCA denoising methods.

Methods

Kernel PCA
Kernel PCA (K-PCA) is a nonlinear extension of the traditional linear PCA where principal component analysis is performed not in the native space, but in a high dimensional reproducing kernel Hilbert space (RKHS) where the data lies on a flat manifold1. Let the input data set be $$$\{ {\bf{x}}_k \in \mathbb{R}^D, k=1,...,N\}$$$ and the mapped data be $$$\psi({\bf{x}}_k ) \in F$$$, where $$$\psi(\cdot)$$$ is a non-linear mapping from the original space to the RKHS1,2. As in PCA, the directions of maximum variance in the RKHS feature space can be obtained using singular value decomposition in the feature space $$$F$$$1,2. The power of K-PCA lies in its ability to capture non-linear dependencies/redundancies that may exist in the data, i.e., diffusion signal, and that are not possible to model with linear PCA. In terms of denoising, given an input vector (dMRI signal), $$$ \bf x $$$, a denoised version $$$\hat{\bf x }$$$ is obtained as a pre-image of the projection $$$P\psi( \bf x )$$$ of $$$ \bf x $$$ in the feature space. To accomplish the whole process, there is no need to know the mapping $$$\psi(\cdot)$$$ explicitly but just to define a kernel function, $$$K( {\bf{x}}_k , {\bf{x}}_{k'})$$$, which measures the similarity between two vectors $$${\bf{x}}_k$$$ and $$${\bf{x}}_{k'}$$$,3 allowing to define a metric on $$$F$$$1,2.

Experiments
We demonstrate that K-PCA denoising outperforms state-of-the-art linear PCA based denoising methods, e.g. Marchenko-Pastur PCA, MP-PCA4,5, in removing noise while preserving the fine structure in ultra-high resolution dMRI data. We implement K-PCA in a voxel-wise fashion with a patch size of [5x5x5]. A Gaussian kernel is a typical choice for KPCA with bandwith $$$\sigma$$$ defined automatically and adaptively in a data-driven manner as given in6-8. To obtain the denoised signal $$$\hat{\bf x }$$$ in closed-form, we used the method presented in8. We note that no truncation of the eigenvalues is required in this method, unlike MP-PCA. Finally, MP-PCA was implemented in a similar voxel-wise fashion with the same patch size for comparative analysis.
Dataset:
Whole-brain gSlider-SMS dMRI data (D = 64 directions, $$$b = 1500 s/{mm}^2$$$, 7b0 volumes) was acquired (sagittally), with three repetitions, and reconstructed with conventional gSlider processing9 to create 660 $$${\mu}m$$$ isotropic resolution data( gslider-1av). K-PCA and MP-PCA were then used to denoise the 64 images of one of the three DWI datasets (gslider-1av) and compared to a reference: gSlider reconstruction averaged three times(gSlider-3av).
Validation:
We used the normalized mean square error (NMSE) of the DWI signals and the relative absolute error of the fractional anisotropy (FA), and mean diffusivity (MD) estimated after applying DTI (dtifit FSL) to compare the denoised DWI datasets. Reference FA and MD values were those derived from gSlider-3av: $$$FA_{3AV}$$$, $$$MD_{3AV}$$$ (estimated with the same DTI algorithm).

Results

An axial slice of the denoised DWI image with MP-PCA and K-PCA is shown in Fig.1c-d, respectively, whereas the original (noisy) DWI set (gSlider-1av) is shown in Fig.1b. Clearly, K-PCA achieves substantially more noise reduction than MP-PCA. With three-times shorter acquisition time, visual quality seems superior to the gSlider-3av case (Fig.1a). This is also evident quantitatively from the NMSE results (Fig.2). Further, K-PCA has much lower NMSE in several regions compared to MP-MPCA. Residual map of K-PCA in Fig.3 does not present any structure, demonstrating that no signal information is removed. Superior noise removal can also be observed in the color-encoded FA maps that are presented in Fig 4. Fine details (see cerebellum area) remain obscured by noise in gSlider-1av and in MP-PCA. However, much better delineation of cerebellum structures, due to noise suppression, are seen with K-PCA (Fig.4d). Finally, we report (Fig.5) whole-brain errors in FA and MD. While comparable for MD, K-PCA achieves significant reduction in NMSE as well as in relative error in FA.

Discussion

K-PCA can achieve a higher noise suppression than MP-PCA while preserving intricate anatomical details. An important benefit of K-PCA is that it does not require a large number of samples to perform at its best, as MP-PCA does for the MP assumption to hold. K-PCA does not need eigenvalue truncation and can work with any dMRI protocol.

Conclusion

We have introduced K-PCA, a powerful and nonlinear generalization of linear PCA, that can reliably reduce noise in dMRI substantially more than state-of-the art PCA methods (MP-PCA) while preserving the underlying true signal. Showcased with 660 isotropic micrometer resolution gSlider dMRI data, we demonstrate that fine details that are obscured by noise with MP-PCA, are visible by K-PCA due to its superior noise removal performance. We believe K-PCA denoising can be an essential tool in any dMRI analysis pipeline, especially for highly noisy datasets.

Acknowledgements

R01MH116173 (PIs: Setsompop, Rathi)

References

1. Schölkopf, B., Smola, A., & Müller, K. R. (1997, October). Kernel principal component analysis. In International conference on artificial neural networks (pp. 583-588). Springer, Berlin, Heidelberg.

2. Mika, S., Schölkopf, B., Smola, A. J., Müller, K. R., Scholz, M., & Rätsch, G. (1999). Kernel PCA and de-noising in feature spaces. In Advances in neural information processing systems (pp. 536-542).

3. Kwok, J. Y., & Tsang, I. H. (2004). The pre-image problem in kernel methods. IEEE transactions on neural networks, 15(6), 1517-1525.

4. Veraart, J., Fieremans, E., & Novikov, D. S. (2016). Diffusion MRI noise mapping using random matrix theory. Magnetic resonance in medicine, 76(5), 1582-1593.

5. Veraart, J., Novikov, D. S., Christiaens, D., Ades-Aron, B., Sijbers, J., & Fieremans, E. (2016). Denoising of diffusion MRI using random matrix theory. NeuroImage, 142, 394-406.

6. Weinberger, K. Q., Sha, F., & Saul, L. K. (2004, July). Learning a kernel matrix for nonlinear dimensionality reduction. In Proceedings of the twenty-first international conference on Machine learning (p. 106). ACM.

7. Jorgensen, K. W., & Hansen, L. K. (2011). Model selection for Gaussian kernel PCA denoising. IEEE transactions on neural networks and learning systems, 23(1), 163-168.

8. Rathi, Y., Dambreville, S., & Tannenbaum, A. (2006, February). Statistical shape analysis using kernel PCA. In Image processing: algorithms and systems, neural networks, and machine learning (Vol. 6064, p. 60641B). International Society for Optics and Photonics.

9. Setsompop, K., Fan, Q., Stockmann, J., Bilgic, B., Huang, S., Cauley, S. F., ... & Wald, L. L. (2018). High‐resolution in vivo diffusion imaging of the human brain with generalized slice dithered enhanced resolution: Simultaneous multislice (g Slider‐SMS). Magnetic resonance in medicine, 79(1), 141-151.

Figures

Axial slice for a given diffusion direction of the 660 $$${\mu}m$$$ DWI sets are shown: (b) the gSlider-1av DWI set is denoised with (c) MP-PCA and (d) K-PCA (d). For comparison, the reference gSlider-3av (a) is also shown. Observe that K-PCA denoising achieves a superior noise reduction compared to MP-PCA (center of the slice), while preserving fine details (see zoomed-in area). With a reduction of time of 3x, K-PCA denoised DWI set presents even higher noise suppression than gSlider-3av.

NMSE maps of the (b) gSlider-1av set and (c) MP-PCA and (d) K-PCA denoised sets compared to the (a) gSlider-3av DWI sets. Axial, coronal and sagittal slices are shown for the same diffusion direction. Note that with K-PCA, there is a substantial reduction in NMSE, especially in areas with high noise level (center of sagittal slice).

Residual map (Noisy image - Denoised image) for the mid-axial slice of Fig.1 for (a) MP-PCA and (b) K-PCA denoising.

Axial slice of color-encoded FA maps estimated from the (a) gSlider-3av, (b) gSlider-1av sets and from the (c) MP-PCA and (d) K-PCA denoised sets are shown. The superior denoising capability of K-PCA becomes evident in this example. It is clear in regions such as the cerebellum or crossing-fibers (zoomed-in picture).

Quantitative validation in terms of DWI signal quality as well as DTI-based metrics. The spatially averaged (brain-tissue only, excluding CSF) relative error of FA and MD, defined as $$$| FA_{3AV} - \hat{FA}| / FA_{3AV}$$$ and $$$| MD_{3AV} - \hat{MD}| / MD_{3AV}$$$, is reported.

Proc. Intl. Soc. Mag. Reson. Med. 28 (2020)
0984