3109

DTI-Net: Unsupervised Diffusion Tensor Reconstruction Using Implicit Neural Representation
Yuting Shi1, Yuyao Zhang2, and Hongjiang Wei1
1Shanghai Jiao Tong University, Shanghai, China, 2ShanghaiTech University, Shanghai, China

Synopsis

Keywords: Image Reconstruction, Machine Learning/Artificial Intelligence

Diffusion tensor imaging (DTI) needs a large number of diffusion-weighted images (DWIs) to reliably reconstruct the diffusion measurements of the brain white matter, making the data acquisition time-consuming. Deep learning has emerged as a powerful technique to reduce the number of acquired DWIs. While most existing deep learning methods are supervised and need high-quality ground truth data as the training labels. Here, we proposed an unsupervised and subject-specific DTI reconstruction method called DTI-Net to significantly reduce the required number of DWIs, while also can simultaneously conduct the super-resolution reconstruction of the tensors.

Introduction

Diffusion Tensor Imaging (DTI)1 is a powerful technique to map tissue microstructure using a second-order tensor model. Although the tensor model theoretically requires only six diffusion-weighted images (DWIs) and one non-diffusion encoded image for tensor reconstruction, a large number of DWIs are always required in practice because the tensor model is sensitive to noise contamination. It would significantly increase the scan time and thus increase the patients’ discomfort and is vulnerable to motion.
Deep learning has emerged as a promising technique to reduce the number of required DWIs due to its strong learning capacity. However, traditional convolutional neural network (CNN)-based reconstruction methods, such as DeepDTI2 and SuperDTI3, require the corresponding high-quality ground truth data and have generalization issues when testing on the data acquired with different acquisition parameters. Implicit neural representation (INR)4 is a recently developed technique that parameterizes the signal as a continuous function and learns this continuous representation by regressing the spatial coordinate to the objective volume intensity using multi-layer perception (MLP). It is unsupervised and training databases free. Inspired by INR, we proposed an unsupervised and subject-specific DTI reconstruction method, DTI-Net, to simultaneously reconstruct DTI and perform super-resolution reconstruction with a significantly reduced number of DWIs.

Methods

DTI-Net Pipeline
The overview structure of the proposed DTI-Net is displayed in figure 1. DTI-Net uses MLP to map the 2D coordinates $$$(x,y)$$$ to the corresponding tensor field, with MLP weights adjusted for each reconstructed sample. Before the first layer of MLP, 2D spatial coordinate was projected to a higher dimension using position encoding5 to better represent high-frequency signals. The MLP contains 9 fully connected (FC) layers and there is a residual connection between the first layer and the fourth layer, with each layer followed by ReLU activation except the last layer. The output layer predicted 6 tensor elements ($$$\mathbf{D} = {\left[ {{D_{xx}},{D_{xy}},{D_{xz}},{D_{yy}},{D_{yz}},{D_{zz}}} \right]^T}$$$) by 6 channels, separately. Then the predicted tensor was used to calculate as equation (1) that is derived from DTI model.
$$ \mathbf{X} = - {{ln({S_i}/{S_0})} \over b} = \mathbf{g_i}^TD\mathbf{g_i} = \mathbf{AD}\tag{1}$$
$$$\mathbf{g_i} = {[{g_{ix}},{g_{iy}},{g_{iz}}]^T}$$$ is the unit vector of diffusion-encoding direction. $$${{S_i}}$$$ is the DWI signal and $$${{S_0}}$$$ is the non-diffusion image, $$$b$$$ is the b-value. $$$\mathbf{A} = [\mathbf{a_1},\mathbf{a_2},...,\mathbf{a_n}]$$$ is the diffusion tensor transformation matrix that depends on the diffusion-encoding directions with $$$\mathbf{a_i} = [g_{ix}^2,2{g_{ix}}{g_{iy}},2{g_{ix}}{g_{iz}},g_{iy}^2,2{g_{iy}}{g_{iz}},g_{iz}^2]$$$ and $$$n$$$ is the number of DWIs. Finally, the loss function is defined as equation (2).
$$L = {\left\| {\mathbf{A\widehat D} - \mathbf{AD}} \right\|_1} + \lambda {\left\| {\nabla \mathbf{\widehat D}} \right\|_1}\tag{2}$$
$$$\lambda $$$ is the weight of total variation (TV) loss which we set as 0.5 in this study.
We compared the proposed DTI-Net to the conventional DTI reconstruction method using 6, 9 and 12 DWIs. In addition, we also tested the DTI-Net and traditional bilinear and cubic interpolation methods for super-resolution reconstruction with 15-fold upsampling.

Data Acquisition
DTI data were acquired using a spin-echo echo planar imaging (SE-EPI) sequence: FOV = 210×210×156 mm3, matrix size = 140×140×104, flip angle = 90°, TR = 4500 ms, TE = 72 ms, spatial resolution = 1.5×1.5×1.5 mm3. The DTI data sets were acquired with 5 b0 images and 1 b-values of 1000 s/mm2 with 64 diffusion encoding directions. For experiments, we used only one slice for 2D reconstruction.

Results

Figure 2 shows the reconstruction results using 6, 9, and 12 DWIs when using the DTI results derived from 64 DWIs for reference. DTI-Net could effectively reduce noise contamination when the number of DWI was reduced to 9 or 12. The FA and PEV results from DTI-Net using 12 DWIs are very similar to the reference results. For super-resolution reconstruction, as shown in figure 3, there exist many high-intensity points in FA images upsampled by bilinear and cubic interpolation methods, while DTI-Net’s FA shows more continuous anatomical structures. For upsampled PEV, DTI-Net shows a more reasonable color change in white matter fibers.

Discussion

In this study, we proposed a deep learning-based method for DTI reconstruction using implicit neural representation, DTI-Net, to reduce the number of DWIs for DTI reconstruction. DTI-Net is subject-specific and training databases free. The proposed DTI-Net shows the powerful capacity for reconstructing high-quality DTI results using fewer DWIs, which effectively reduces the acquisition time. Additionally, DTI-Net learns continuous representation from discrete measurements, leading to better super-resolution reconstruction results than traditional interpolation methods.

Conclusion

In conclusion, our preliminary results indicated that the proposed DTI-Net has the potential to effectively reduce the number of acquired DWIs for high-quality DTI reconstruction and is able to simultaneously upsample the DTI results to arbitrary resolution.

Acknowledgements

No acknowledgement found.

References

1. Le Bihan D, Mangin J-F, Poupon C, et al. Diffusion tensor imaging: Concepts and applications. Journal of Magnetic Resonance Imaging. 2001;13(4):534-46.

2. Tian Q, Bilgic B, Fan Q, et al. DeepDTI: High-fidelity six-direction diffusion tensor imaging using deep learning. NeuroImage. 2020;219:117017.

3. Li H, Liang Z, Zhang C, et al. SuperDTI: Ultrafast DTI and fiber tractography with deep learning. Magn Reson Med. 2021;86(6):3334-47.

4. Sitzmann V, Zollhöfer M, Wetzstein G. Scene representation networks: Continuous 3d-structure-aware neural scene representations. Advances in Neural Information Processing Systems. 2019;32.

5. Mildenhall B, Srinivasan PP, Tancik M, et al. Nerf: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM. 2021;65(1):99-106.

Figures

Figure 1. The overview structure of the DTI-Net.

Figure 2. FA and PEV images reconstructed using 6, 9 and 12 DWIs from the traditional DTI model and DTI-Net.

Figure 3. Super-resolution results using different interpolation methods.

Proc. Intl. Soc. Mag. Reson. Med. 31 (2023)
3109
DOI: https://doi.org/10.58530/2023/3109