2771

Efficient Non-Uniform Fourier Transform with embedded low-rank projection for fast model-based MRI reconstruction
Matteo Cencini1,2, Luca Peretti2,3, and Michela Tosetti1,2
1IRCCS Stella Maris, Pisa, Italy, 2Imago7 Foundation, Pisa, Italy, 3Università di Pisa, Pisa, Italy

Synopsis

Modern approaches to iterative imaging, such as model-based reconstruction, requires efficient implementations of Non-Uniform Fourier Transform to reach feasible reconstruction times. In addition, low-rank subspace projection is often used to reduce computational burden. While many implementations of NUFFT currently exists, they are not optimized for this kind of problems. Here, we propose a fast and memory-efficient NUFFT operator with embedded low-rank subspace projection. We demonstrate an order of magnitude of speed-up with comparable image quality compared to other high-level implementations.

Introduction

Since its introduction in the seminal work from Lustig et al.1, compressed sensing and iterative MR reconstruction enabled unprecedented acceleration in MR imaging without harming image quality, significantly improving patient comfort. In the last few years, the widespread diffusion of open-source image reconstruction frameworks, such as BART2, Gadgetron3 and SigPy4, allowed researchers to further develop novel advanced reconstruction strategies. Among these, model-based reconstruction5 represents a promising research area, enabling fast and artefact-free parametric mapping of the tissues by incorporating knowledge of the physics in the encoding model. This approach has a significant computational burden, which can be alleviated by approximating MR signal with a linear combination of a basis function representing the space of possible signal evolutions, i.e. constraining the solution on a low-rank subspace5,6. Many iterative and model-based reconstruction requires the use of Non-Cartesian k-space sampling trajectories. While several efficient open-source implementations of Non-Uniform Fourier Transform (NUFFT) exist, to the best of our knowledge there are no implementation specifically optimized for low-rank model-based reconstructions. Here, we introduce a fast and memory-efficient open-source NUFFT implementation embedding the low-rank projection operation. The library will be made publicly available under https://github.com/FiRMLAB-Pisa/lr-nufft-torch to interested researchers.

Methods

Theory
Non-Uniform Fourier transform with embedded low-rank projection and its adjoint can be represented by the following equations7:
$$ y = (\phi G)\cdot F \cdot P \cdot A \cdot x (NUFFT) $$
$$ x = A \cdot P^H \cdot F^H \cdot (G^H \phi^H) D y (NUFFT adjoint) $$
Where A, P, F, G, $$$\phi$$$ represent apodization, signal padding, Cartesian Fourier Transform, interpolation to non-Cartesian location and projection from low-rank basis to time domain; here, H represents adjoint operation and D is the density compensation function. Cartesian to Non-Cartesian interpolation and its adjoint (gridding) represents the most intensive computational steps; here, they are fused with the subspace projection operator to allow for compiler optimization and reduce memory usage. Pseudo-code for fused operation is shown in Figure 1.

Implementation details
The library is written in Python 3.7. Core operators (padding, apodization, Cartesian Fourier transform) are based on Pytorch 1.10, while fused projection-interpolation/-gridding are written in Numba 0.54.1 (an open-source Just-In-Time compiler for Python) to allow fast computation and easy deployment. Fused projection-interpolation/-gridding exploit Numba parallelization to speed-up computation. Parallelization is performed over frames and image batches (i.e. different receive channels and slices) to avoid writing conflicts. Interpolation was adapted from the Kaiser-Bessel convolution gridding included in SigPy4. To achieve short computation time while maintaining Validation and Benchmark To test our library, we simulated a Fast-Spin Echo acquisition for T2 mapping. A Shepp-Logan phantom was used as a virtual object. The phantom was segmented in three regions corresponding to White Matter (T2=70ms), Gray Matter(T2=83ms) and CSF (T2=329ms). K-space sampling was based on a 2D radial projection with golden-angle ordering of the spokes (Figure 2a). Low-rank subspace basis was computed by simulating an ensemble of 300 spin (T2=1:329ms, linearly spaced) with the following signal equation: where the echo time TE ranged from 1 to 300ms Figure 2b. In this experiment, we simulated a k-t acquisition with 1000 echo times (here called frames), each sampled with 100 spokes of the trajectory. Ground truth subspace coefficient for the virtual object were generated by projecting signal evolution corresponding to phantom tissues on the subspace basis. We assessed the performances of the library by comparing it with a recent open-source implementation (Torch-KB-NUFFT8). Zero-filled k-space data for all the subspace coefficients were pre-computed to be fed into Torch-KB-NUFFT routines, in order to fully exploit its capability. All the tests were performed on a laptop equipped with an Intel Xeon 12 Core CPU and 32GB of RAM.

Results and Discussion

Figure 3 shows the comparison between ground truth, Torch-KB-NUFFT NUFFT-adjoint results and our implementation. Both our implementation and Torch-KB-NUFFT achieve comparable results. Figure 4 shows the difference in terms of computation times. Our implementation outperforms Torch-KB-NUFFT by an order of magnitude both for forward and adjoint steps. Here, we demonstrated significant speed-up of NUFFT for low-rank constrained reconstruction compared to available open-source implementation. While the Echo train length (1000 echoes) is much longer than realistic implementations, modern approaches such as MR Fingerprinting9 easily reach these values. Similarly to Torch-KB-NUFFT, our NUFFT operators are wrapped as Pytorch Module operators and could be used in end-to-end Deep Learning reconstructions. Being based on pure Python, with acceleration via Numba, the proposed implementation is easy to extend. Further performance gain could be achieved at cost of readability by using lower level languages such as C++ in fused interpolation/subspace projection. Future work will be focused on test the library on massively parallel computational devices (such as GPUs). However, this should be relatively straightforward thanks to Numba and Pytorch backends. In addition, we will test the library in higher dimensional problems (3D+t) and fused self-adjoint operators to speed-up iterative algorithms.

Conclusion

The proposed implementation reaches one order of magnitude of speed-up in fused NUFFT and low rank projection while maintaining comparable image quality.

Acknowledgements

The results here presented have been developed in the framework of the 18HLT05 QUIERO Project. This project has received funding from the EMPIR Programme co-financed by the Participating States and from the European Union’s Horizon 2020 Research and Innovation Programme.

References

1. Lustig M, Donoho D, Pauly JM. Sparse MRI: The application of compressed sensing for rapid MR imaging. Magn Reson Med. 2007;58(6):1182-1195. doi:10.1002/mrm.21391

2. Uecker M, Ong F, Tamir JI, et al. Bart: Version 0.2.08. Zenodo; 2015. doi:10.5281/ZENODO.18041 3. Gadgetron: An open source framework for medical image reconstruction - Hansen - 2013 - Magnetic Resonance in Medicine - Wiley Online Library. Accessed November 11, 2021. https://onlinelibrary.wiley.com/doi/10.1002/mrm.24389

4. (ISMRM 2019) SigPy: A Python Package for High Performance Iterative Reconstruction. Accessed November 11, 2021. https://archive.ismrm.org/2019/4819.html

5. Wang X, Tan Z, Scholand N, Roeloffs V, Uecker M. Physics-based reconstruction methods for magnetic resonance imaging. Philos Trans R Soc Math Phys Eng Sci. 2021;379(2200):20200196. doi:10.1098/rsta.2020.0196

6. Tamir JI, Uecker M, Chen W, et al. T2 shuffling: Sharp, multicontrast, volumetric fast spin-echo imaging. Magn Reson Med. 2017;77(1):180-195. doi:10.1002/mrm.26102

7. Fessler JA, Sutton BP. Nonuniform Fast Fourier Transforms Using Min-Max Interpolation. :19.

8. Muckley, M.J., Stern, R., Murrell, T. and Knoll, F., 2020. TorchKbNufft: A high-level, hardware-agnostic non-uniform fast fourier transform. In ISMRM Workshop on Data Sampling & Image Reconstruction.

9. Ma D, Gulani V, Seiberlich N, et al. Magnetic resonance fingerprinting. Nature. 2013;495(7440):187-192. doi:10.1038/nature11971

Figures

Figure 1 Pseudo-code for forward and adjoint interpolation / gridding with embedded subspace projection operator.

Figure 2 (a) 2D golden-angle radial trajectory used for simulation. (b) ensemble of Fast Spin Echo signal evolution and corresponding subspace basis (4 coefficients).

Figure 3 Results of adjoint NUFFT for proposed and Torch-KB-NUFFT compared to ground truth. Both libraries achieve similar results.

Figure 4 Computation times for proposed and Torch-KB-NUFFT. Proposed library achieves one order of magnitude of speed-up for both forward and adjoint operations.

Proc. Intl. Soc. Mag. Reson. Med. 30 (2022)
2771
DOI: https://doi.org/10.58530/2022/2771