1981

dtiRIM: A recurrent inference machine for diffusion tensor estimation

Emanoel R. Sabidussi¹, Stefan Klein¹, Ben Jeurissen², and Dirk H. J. Poot¹
¹Radiology, Erasmus Medical Center, Rotterdam, Netherlands, ²Department of Physics, Imec-Vision Lab, University of Antwerp, Antwerp, Belgium

Synopsis

In Diffusion MRI, the large variability of acquisition schemes limits broader use of Deep Learning for parameter estimation, with data-specific models needed for high-quality predictions. To reduce dependency on training data, we present dtiRIM, a recurrent neural network that learns a regularized solution to a model-based inverse problem. Using the diffusion model allows independent parameters (e.g. gradient directions) to also influence the estimation. We show that a single dtiRIM model predicts diffusion parameters for multiple datasets with lower error than the current state-of-the-art. Our study suggests that dtiRIM has the potential to be the first general learning method for DTI.

Introduction

An outstanding challenge when using Deep Learning (DL) for Diffusion Tensor Imaging (DTI) is the large variability of acquisition settings (e.g. number of q-space samples, gradient directions and strength) between datasets. Existing works^1-3 applied DL to estimate diffusion properties (e.g fractional anisotropy (FA), mean diffusivity (MD)) from diffusion-weighted (DW) images, but different acquisition schemes required training new models.

In this study, we propose to use the Recurrent Inference Machine (RIM) framework⁴ to estimate the diffusion tensor parameters with low dependency on scan settings. RIMs learn to minimize a model-based likelihood function, allowing for greater variability of fixed model parameters (e.g. gradient directions). We evaluated our model in simulation and in-vivo experiments.

Methods

Our RIM for DTI (

$\textit{dti}\textbf{RIM}$ ) is shown in Figure 1. The RIM is a recurrent neural network

$\hat{\boldsymbol{\theta}}=H_{\boldsymbol{\gamma}}(\tilde{\boldsymbol{S}};\boldsymbol{b},\boldsymbol{g})$ that estimates the diffusion tensor parameters

$\boldsymbol{\theta}=[S_0,D_{11},D_{12},D_{22},D_{13},D_{23},D_{33}]$ using the Rice likelihood

$L(\boldsymbol{S}(\boldsymbol{\theta};\boldsymbol{b},\boldsymbol{g})|\tilde{\boldsymbol{S}})$ w.r.t. the acquired DW images

$\tilde{\boldsymbol{S}}$ , where

$\boldsymbol{b}$ is the gradient strength and

$\boldsymbol{g}$ the direction. The network weights

$\boldsymbol{\gamma}$ are trained on simulation data, by minimizing the differences between ground-truth noiseless DW images

$\boldsymbol{S}^{GT}$ and predicted images

$\boldsymbol{S}(\hat{\boldsymbol{\theta}};\boldsymbol{b},\boldsymbol{g})$ :

$\textstyle{\hat{\boldsymbol{\gamma}}=\arg\min_{\boldsymbol{\gamma}}\frac{1}{N}\sum\nolimits_{n=0}^{N-1}(\boldsymbol{S}^{GT}_{n}-\boldsymbol{S}_n(\hat{\boldsymbol{\theta}};\boldsymbol{b},\boldsymbol{g}))^{2}}$ , where

$N$ is the number of images in the training set.

Training samples

$T^F_i=\{\tilde{\boldsymbol{S}},\boldsymbol{S}^{GT},\boldsymbol{b},\boldsymbol{g}\}$ were generated with the Numerical Fiber Generator⁵ (NFG). For each sample

$i$ ,

$\boldsymbol{S}^{GT}$ was simulated with uniformly sampled FA

$\sim\mathcal{U}(0.05,1)$ and MD

$\sim\mathcal{U}(0.0003,0.003)$ ,

$b=1000s/mm^2$ , and

$\boldsymbol{g}$ and

$N$ randomly sampled from a pre-determined set

$\boldsymbol{G}$ . To ensure approximately uniform sets of gradient directions of size

$N\in\{7,13,...,55,61\}$ (in steps of 6), we selected 10 unique subsets

$\boldsymbol{g}\in\boldsymbol{G}$ with the lowest electrostatic repulsion⁶ among 1000 random subsets of size

$N$ . Finally, Rician noise was added. In total,

$400.000$ noisy training samples were created.

Two validation sets were used to tune the RIM's hyperparameters. The first (

$\textbf{NFG}_{val}$ ) was also created with NFG, but with a different configuration of fibers,

$\boldsymbol{g}\in\boldsymbol{G},N=31$ and SNR 25. The second (

$\textbf{Diffantom}$ ), was created using the Diffantom simulator⁷, with gradients

$\boldsymbol{G},N=129,b=2000 s/mm^2$ and SNR 30.

In contrast to voxel-wise methods, RIMs also learn a spatial regularization⁸, which might provide robustness to small deviations between signals and the diffusion model. To evaluate this, we created additional

$\textbf{NFG}_{val}$ and

$\textbf{Diffantom}$ datasets with (slightly) misaligned images. Random rigid transformations were applied when generating

$\tilde{\boldsymbol{S}}$ from

$\boldsymbol{S}^{GT}$ , with translations and rotations independently sampled from

$\mathcal{N}(0.0,\sigma_d)$ , with small

$\sigma_d=[0.25,0.5]$ to simulate sub-voxel misalignment. Fifty realizations of motion and noise were created per dataset. Bias, standard deviation and RMSE of the FA and MD maps were evaluated. Ground-truth maps were defined as the least-squares solution of

$\boldsymbol{\theta}$ on

$\boldsymbol{S}^{GT}$ .

The testing data (

$\textbf{AOMIC}$ ) is composed of 1 subject from the AOMIC-ID1000 dataset⁹, acquired with gradient directions

$\boldsymbol{G_t}\notin\boldsymbol{G}, N=33, b = 1000 s/mm^2$ . To evaluate our method's precision, we emulated repeated scans by creating 4 datasets from the original data with varying number of images (

$N=\{13,19,25,31\},\boldsymbol{g_t}\in\boldsymbol{G_t}$ ). Each contained 5 unique DW sets, with

$\boldsymbol{g_t}$ selected by minimizing the electrostatic repulsion among 100 random subsets of

$\boldsymbol{G_t}$ . Ground-truth maps were obtained from the full set (

$\boldsymbol{G_t}, N=33$ ) with the state-of-the-art

$\textbf{IWLLS}$ ¹⁰ from MRTrix3¹¹, also used here as baseline in all experiments. Statistical significance was evaluated with an unpaired t-test (

$p<0.001$ ).

Results

Figure 2 shows the results of FA for all datasets. On

$\textbf{NFG}_{val}$ datasets,

$\textit{dti}\textbf{RIM}$ presented lower bias, STD and RMSE than

$\textbf{IWLLS}$ . On

$\textbf{Diffantom}$ datasets,

$\textit{dti}\textbf{RIM}$ had lower STD and RMSE, but higher bias. Additionally,

$\textit{dti}\textbf{RIM}$ errors increased slower than

$\textbf{IWLLS}$ for larger

$\sigma_d$ . For

$\textbf{AOMIC}$ ,

$\textit{dti}\textbf{RIM}$ had lower STD (

$\forall N$ ), but larger bias and RMSE for

$N\leq18$ .

Figure 3 shows the results of MD for all datasets.

$\textit{dti}\textbf{RIM}$ had lower bias, STD and RMSE than

$\textbf{IWLLS}$ in all validation datasets. In

$\textbf{AOMIC}$ ,

$\textit{dti}\textbf{RIM}$ had lower STD, but higher RMSE compared to

$\textbf{IWLLS}$ , while no significant differences were found for bias.

Figures 4 and 5 show FA and MD maps from

$\textbf{IWLLS}$ and

$\textit{dti}\textbf{RIM}$ (data

$\boldsymbol{G_t}, N=33$ ) alongside RMSE maps computed for different

$N$ .

$\textit{dti}\textbf{RIM}$ FA estimates are less dependent on

$N$ than

$\textbf{IWLLS}$ 's, while MD maps are smoother, and with lower accuracy in high diffusivity regions (e.g. ventricles).

Discussion

We showed that a single RIM model can be used with DW images acquired with different number of q-space samples, gradient directions and strength. The

$\textit{dti}\textbf{RIM}$ reduced FA and MD errors compared to the state-of-the-art

$\textbf{IWLLS}$ for most datasets. The negative FA bias on in-vivo data can be (mainly) attributed to lower FA estimates in gray matter and CSF, while smaller differences were observed on white matter. MD maps from

$\textit{dti}\textbf{RIM}$ are slightly blurred compared to

$\textbf{IWLLS}$ 's, promoting higher estimation precision, at the cost of higher total error. Additionally, (slight) robustness to misaligned images was shown.

Conclusion

This work presented a novel approach to estimate DTI parameters. Our Recurrent Inference Machine method provides estimates with similar or lower error that IWLLS. It showed robustness to anatomy and variation in scan protocol, including q-space sampling.. Although more validation is needed (e.g. larger in-vivo datasets, pathology), our study suggests that the RIM can provide diffusion parameters that are robust and of higher quality than the current state-of-the-art.

Acknowledgements

This project was funded by the European Union’s Horizon 2020 programme, under the grant agreement No 764513.

References

1. H. Li, Z. Liang, C. Zhang, R. Liu, and et al.,“SuperDTI: Ultrafast dti and fiber tractography with deep learning,” MRM,vol. 86, p. 3334–3347, Jul 2021.

2. V. Golkov, A. Dosovitskiy, J. I. Sperl, and et al., “q-Space Deep Learning: Twelve-Fold Shorter and Model-Free Diffusion MRI Scans,” IEEE-TMI,vol. 35, pp. 1344–1351,May 2016.

3. E. Aliotta, H. Nourzadeh, and S. H. Patel, “Extracting diffusion tensor fractional anisotropy and mean diffusivity from 3-direction DWI scans using deep learning,” MRM,vol. 85, pp. 845–854, Aug. 2020.

4. P. Putzky and W. Welling, “Recurrent inference machines for solving inverse problems,” 2017. arXiv:1706.04008

5. T. G. Close, J.-D. Tournier, F. Calamante, L. A. Johnston, I. Mareels, and A. Connelly, “A software tool to generate simulated white matter structures for the assessment of fibre-tracking algorithms,” NeuroImage, vol. 47, pp. 1288–1300, Oct. 2009.

6. P. A. Cook, M. Symms, P. A. Boulby, and D. C. Alexander, “Optimal acquisition orders of diffusion-weighted MRI measurements,” JMRI, vol. 25, no. 5, pp. 1051–1058,2007.

7. O. Esteban, E. Caruyer, A. Daducci, M. Bach-Cuadra, M. J. Ledesma-Carbayo, and A. Santos, “Diffantom: Whole-Brain Diffusion MRI Phantoms Derived from Real Datasets of the Human Connectome Project,” Frontiers in Neuroinformatics, vol. 10, Feb. 2016.

8. E. Sabidussi, S. Klein, M. Caan, S. Bazrafkan, A. den Dekker, J. Sijbers, W. Niessen, and D. H. J. Poot,“Recurrent inference machines as inverse problem solvers for MR relaxometry,” Medical Image Analysis,vol. 74, p. 102220, Dec.2021.

9. L. Snoek, M. van der Miesen, A. van der Leij, T. Beemsterboer, A. Eigenhuis, and S. Scholte, “”aomic-id1000”,” OpenNeuro 2021. DOI: 10.18112/openneuro.ds003097.v1.2.1

10. J. Veraart, J. Sijbers, S. Sunaert, A. Leemans, and B. Jeurissen, “Weighted linear least squares estimation of diffusion MRI parameters: Strengths, limitations, and pitfalls,” NeuroImage, vol. 81, pp. 335–346,Nov. 2013.

11. J.D. Tournier, R. Smith, D. Raffelt, R. Tabbara, T. Dhollander, M. Pietsch, D. Christiaens, B. Jeuris-sen, C.H. Yeh, and A. Connelly, “Mrtrix3: A fast, flexible and open software framework for medical image processing and visualisation,” NeuroImage, vol. 202, p. 116137, 2019.

Figures

Figure 1. RIM framework (

$\hat{\boldsymbol{\theta}}=H_\gamma(\tilde{\boldsymbol{S}};\boldsymbol{b},\boldsymbol{g})$ ). At a given step

$j \in \{1,...,J\}$ , the neural network

$g_{\gamma}$ receives the current parameter estimates,

$\hat{\boldsymbol{\theta}}_{j-1}$ , the gradient of a likelihood,

$\nabla_{\boldsymbol{\theta}}$ , and a vector of memory states

$\boldsymbol{h}_{j-1}$ the RIM can use to keep track of optimization progress. The network outputs an update to the current estimate (

$\Delta \boldsymbol{\theta}_j$ ).

Figure 2. Error measurement in Fractional Anisotropy estimation for different motion severity (

$\sigma_d$ ) with simulation (

$\textbf{NFG}_{val}$ ,

$\textbf{Diffantom}$ ) and in-vivo datasets (

$\textbf{AOMIC}$ ). Measures of bias, standard deviation (STD) and root-mean-square error (RMSE). Whiskers represent the 95-percentile and outliers are not shown. All mean differences between

$\textit{dti}\textbf{RIM}$ and

$\textbf{IWLLS}$ are significant (unpaired t-test,

$p<0.001$ ).

Figure 3. Error measurement in Mean Diffusivity estimation for different motion severity (

$\sigma_d$ ) with simulation (

$\textbf{NFG}_{val}$ ,

$\textbf{Diffantom}$ ) and in-vivo datasets (

$\textbf{AOMIC}$ ). Measures of bias, standard deviation (STD) and root-mean-square error (RMSE). Whiskers represent the 95-percentile and outliers are not shown.

$\textbf{n.s.}$ denotes non-significant mean differences (unpaired t-test,

$p<0.001$ ).

Figure 4. Left column shows estimated fractional anisotropy (FA) maps for

$dti\textbf{RIM}$ and

$\textbf{IWLLS}$ . RMSE maps for different number of DW images (

$N$ ) are shown on the right. RMSE was defined, per pixel, as

$\sqrt{\frac{1}{C} \sum_{c=1}^{C} \left(\boldsymbol{\hat{\phi}}^{c} - \boldsymbol{\phi}\right)^2}$ , where

$c=[1, ..., C]$ is the repetition index,

$\hat{\boldsymbol{\phi}}$ is the estimated FA map and

$\boldsymbol{\phi}$ is the

$\textbf{IWLLS}$ FA estimate using fully-sampled set (

$N=33$ ).

Figure 5. Left column show estimated mean diffusivity (MD) maps for

$dti\textbf{RIM}$ and

$\textbf{IWLLS}$ . RMSE maps for different number of DW images (

$N$ ) are shown on the right. RMSE was defined, per pixel, as

$\sqrt{\frac{1}{C} \sum_{c=1}^{C} \left(\boldsymbol{\hat{\phi}}^{c} - \boldsymbol{\phi}\right)^2}$ , where

$c=[1, ..., C]$ is the repetition index,

$\hat{\boldsymbol{\phi}}$ is the estimated MD map and

$\boldsymbol{\phi}$ is the

$\textbf{IWLLS}$ MD estimate using fully-sampled set (

$N=33$ ).

Proc. Intl. Soc. Mag. Reson. Med. 30 (2022)

1981

DOI: https://doi.org/10.58530/2022/1981