2390

Explainable machine learning for microstructural imaging of neonatal brain

Yihan Wu^1,2, Hamza Kebiri^1,3, Ali Gholipour¹, and Davood Karimi¹
¹Harvard Medical School & Boston Children's Hospital, Boston, MA, United States, ²Zhejiang University, Hangzhou, China, ³Center for Biomedical Imaging & Lausanne University Hospital, Lausanne, Switzerland

Synopsis

Keywords: Machine Learning/Artificial Intelligence, Diffusion/other diffusion imaging techniques, NODDI

Deep learning has a great potential for estimating brain tissue micro-structure from diffusion MRI measurements. However, it is hard to understand and interpret how these models work. Therefore, until now these deep learning models have been designed using ad-hoc approaches. In this work, we propose a method for determining the contribution of different measurements to the prediction produced by these deep learning models. We apply this method for estimating the parameters of the NODDI model for the neonatal brain. Results show that this method is highly effective in determining the subsets of the measurements that result in lower estimation error.

Introduction

Deep learning (DL) models have shown great promise in estimating brain tissue micro-structure from under-sampled diffusion MRI (dMRI) measurements. However, given the black-box nature of these models and the complexity of the relation between tissue micro-structure and the dMRI signal, understanding the inner workings of these models is a challenging task that has not been attempted before. No prior work has analyzed the dependence of these models on the dMRI measurements in a systematic way. As a result, these DL models have been designed using ad-hoc approaches that may result in highly sub-optimal models. A technique that can determine the contribution of different measurements to the prediction would be of high practical value. Such a technique can identify the measurements that are most critical for the prediction task and those that have insignificant contribution, thereby informing us in designing more accurate models as well as in acquiring the most useful measurements in the limited scan time.

Methods

Model
In this work, we focus on two brain micro-structure parameters from the NODDI model¹: Orientation Dispersion Index (ODI) and Neurite Density Index (NDI). These are powerful biomarkers of brain development and degeneration. Figure 1 describes the DL model that we use to estimate these parameters from multi-shell dMRI data.

Methods
Consider a function $$$y= f(X)$$$ that estimates $$$y$$$ from measurements $$$X \in \mathbb{R}^m$$$. It is often highly desirable to know the contribution of each measurement, or subset of measurements, to the estimation of $$$y$$$. The brute-force approach is to examine all possible measurement combinations, but even when m=30 there will be more than a billion combinations.
We propose a method based on Quantitative Input Influence (QII) measures². Using QII, the marginal contribution of measurement $$$i$$$ over a given measurement set $$$S$$$ can be estimated as²:
$$\text{QII}(i, S)= \mathop{\mathbb{E}} ( f(X_{-S} U_S) - f(X_{-S \cup i} U_{S \cup i}) )$$
where $$$-S$$$ denotes inputs that are not in $$$S$$$ and $$$U$$$ is an intervention distribution. In this work, we choose $$$U$$$ to be the distribution of the mean of the measurement that is to be intervened (i.e., shell and gradient direction) computed over the training data. Based on QII, we propose a new measure for quantifying the power of a subset of the measurements for computing the parameter of interest. We call this new measure the Measurement Subset Prediction Power (MSPP), and define it as follows:
$$\text{MSPP}(S)= \mathop{{}\mathbb{E}}_{S^* \subseteq S} ( \text{QII}(S^*, S \setminus S^*) - \mathop{{}\mathbb{E}}_{S^* \subseteq \{1, ..., m \} \setminus S } ( \text{QII}(S^*, S) ) )$$
Highly accurate approximations of QII and MSPP can be efficiently computed using Monte-Carlo sampling methods. MSPP allows us to compare the prediction power of different measurement subsets for estimating the parameters of interest.

Data and experiments
We use dMRI scans of 90 subjects from the dHCP dataset³; 60 subjects are used for training and 30 subjects for test. Each dMRI scan has 300 measurements in three shells: b=0 (n=20), b=400 (n=64), b=1000 (n=88), b=2600 (n=128). In order to compute the reference ground-truth ODI and NDI images for training and evaluation, we apply an optimization-based method implemented in Dmipy⁴ on all 300 measurements. Highly under-sampled subsets of the measurements are used with the proposed deep learning method and Dmipy to reconstruct ODI and NDI images, which are compared with the ground truth to compute the error.

Results

Figure 2 shows plots of MSPP versus the root mean square of estimation error (RMSE) for ODI and NDI. To compute the RMSE we trained a new deep learning model with the selected measurement subset from scratch, whereas MSPP values were computed using a single model trained with all measurements using Equation 2. The proposed MSPP has a mostly monotonic relation with estimation error. This suggests that MSPP can be used as a reliable surrogate for RMSE in choosing the measurements that result in lower estimation error, thereby removing the need to train a new model for every subset of measurements.
Figures 3 and 4 show example ODI and NDI images reconstructed from 18 measurements with the proposed deep learning method and a classical optimization-based method implemented in Dmipy. Dmipy was also used to compute the reference ground truth using 300 measurements. These figures show the results with two different subsets of 18 measurements: (i) Equal number of six measurements from each shell, and (ii) Measurements in each shell selected based on MSPP. The results show lower RMSE for the deep learning method than for Dmipy. They also show lower RMSE with the measurements selected based on MSPP. For both ODI and NDI, statistical analysis on test subjects showed that the RMSE for the deep learning method was significantly ($$$p<0.001$$$) lower than for Dmipy, and that the RMSE achieved with the measurements selected based on MSPP was significantly ($$$p<0.001$$$) lower than when equal number of measurements in each shell were selected.

Conclusion

Model explainability techniques based on QII can be used to determine the most informative diffusion MRI measurements for estimating brain microstructure. Such techniques can be of high practical value as they can 1) help us design more accurate models, and 2) determine the measurements that should be acquired to achieve lower estimation error.

Acknowledgements

This research was partly supported by the US National Institutes of Health (NIH) under awards R01NS106030 and R01EB032366; by the Office of the Director of the NIH under award S10OD0250111; and by NVIDIA Corporation; and utilized an NVIDIA RTX A6000 GPU and RTX A5000 GPUs.. This work was partly supported by a Mobility grant from the Swiss National Science Foundation (project 205321-182602). We acknowledge the CIBM Center for Biomedical Imaging, a Swiss research center of excellence founded and supported by CHUV, UNIL, EPFL, UNIGE, HUG and the Leenaards and Jeantet Foundations. Newborn data were provided by the developing Human Connectome Project, KCL-Imperial-Oxford Consortium funded by the European Research Council under the European Union Seventh Framework Programme (FP/2007-2013) / ERC Grant Agreement no. [319456]. We are grateful to the families who generously supported this trial.

References

1. Gibbons, E.K., Hodgson, K.K., Chaudhari, A.S., Richards, L.G., Majersik, J.J., Adluru, G. and DiBella, E.V., 2019. Simultaneous NODDI and GFA parameter map generation from subsampled q‐space imaging using deep learning. Magnetic resonance in medicine, 81(4), pp.2399-2411.

2. Datta, A., Sen, S. and Zick, Y., 2016, May. Algorithmic transparency via quantitative input influence: Theory and experiments with learning systems. In 2016 IEEE symposium on security and privacy (SP) (pp. 598-617). IEEE.

3. Bastiani, M., Andersson, J.L., Cordero-Grande, L., Murgasova, M., Hutter, J., Price, A.N., Makropoulos, A., Fitzgibbon, S.P., Hughes, E., Rueckert, D. and Victor, S., 2019. Automated processing pipeline for neonatal diffusion MRI in the developing Human Connectome Project. Neuroimage, 185, pp.750-763.

4. Fick, R.H., Wassermann, D. and Deriche, R., 2019. The dmipy toolbox: Diffusion mri multi-compartment modeling and microstructure recovery made easy. Frontiers in neuroinformatics, 13, p.64.

5. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł. and Polosukhin, I., 2017. Attention is all you need. Advances in neural information processing systems, 30.

Figures

Figure 1. The DL model to estimate NDI and ODI. To enable a single model to work with any number of measurements, the measurements are first projected onto a spherical harmonics (SH) basis, separately for each shell. We use an SH order of 4. The SH-represented measurements are used as input to the DL model, which consists of 4 consecutive encoder modules from a transformer network⁵. The model follows the design of the standard transformer network with a learnable positional encoding.

Figure 2. Each dot represents an experiment with a unique set of 18 measurements. In each plot, the black square is for the experiment with an equal number of (six) measurements in each shell. The red star is for the measurement set that resulted in the highest MSPP among the explored sets. For ODI, the red star corresponds to the set with all 18 measurements in b=1000 shell, whereas for NDI it corresponds to the set with 12 and 6 measurements in b=1000 and b=2600 shells, respectively.

Figure 3. Reconstructions of ODI and the associated error maps. The reference image was computed with Dmipy⁴ using 300 measurements in three shells. The proposed deep learning method and Dmipy were applied on subsets of 18 measurements and compared with the reference to compute the estimation error maps. These 18 measurements were either: (i) selected following a pre-determined scheme with equal number of six measurements in each shell, or (ii) the measurement subset that resulted in highest MSPP.

Figure 4. Reconstructions of NDI and the associated error maps. The reference image was computed with Dmipy using 300 measurements in three shells. The proposed deep learning method and Dmipy were applied on subsets of 18 measurements and compared with the reference to compute the estimation error maps. These 18 measurements were either: (i) selected following a pre-determined scheme with equal number of six measurements in each shell, or (ii) the measurement subset that resulted in highest MSPP.

Proc. Intl. Soc. Mag. Reson. Med. 31 (2023)

2390

DOI: https://doi.org/10.58530/2023/2390