Model-Assisted
Deep Learning-Based Reconstruction: Does the Model Help?
Yue Guan1, Yudu Li2,3, Ruihao Liu1,2, Ziyu Meng1, Yao Li1, Yiping P. Du1, and Zhi-Pei Liang2,4 1School of Biomedical Engineering, Institute for Medical Imaging Technology, School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China, 2Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, United States, 3National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, IL, United States, 4Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, United States
Deep
learning-based image reconstruction has two known practical issues: (a)
sensitivity to data perturbations, and (b) poor generalization. An approach to
addressing these issues is to use classical signal models to assist/constrain
deep learning. This paper performs a systematic analysis of the role of signal
models in model-assisted deep learning-based reconstruction. Our results show
that signal models (e.g., subspace model or sparse model) can substantially reduce
the sensitivity of deep learning-based reconstruction to data perturbations;
they can also help improve generalization capability.
Introduction
Deep
learning (DL)-based image reconstruction methods have achieved impressive performance
in many image reconstruction applications1. However, DL-based image
reconstruction has two known practical issues2,3: (a) sensitivity to
data perturbations (e.g., change in data sampling scheme), and (b) poor
generalization (e.g., novel image features). An approach to addressing these
issues is to use classical signal models to assist/constrain DL4-7. A
number of relevant questions arise, which include: 1)
Does
the model improve the robustness to changes in data acquisition parameters? 2)
Does
the model help reduce dependence on the selection of training data?
3)
Does
the model improve the reconstruction performance on unseen subject-specific
novel features? 4)
Can
the model help assess the performance of DL-based reconstruction?
This paper performs a systematic
analysis of the role of signal models in model-assisted DL-based
reconstruction. We used two classical signal models: subspace model and sparse
model as examples to address the above questions. Our results show that signal
models especially the subspace model can substantially reduce the sensitivity
of DL-based reconstruction to data perturbations; they can also help improve
generalization capability especially with limited training data and/or novel
image features. We also demonstrated the potential of using signal models for
performance assessment.
Methods
Conventional
DL-based image reconstruction methods use training data to learn an end-to-end
nonlinear mapping from degraded images (or limited k-space data) to desired
images8,9. Such an approach is appropriate when large amount of
training data are available based on the universal approximation theorem10.
However, many imaging applications only have small amount of training data,
leading to two major problems for conventional DL-based image reconstruction:
(a) sensitivity to data perturbations, and (b) poor generalization.
One strategy to address the above
problems is combining DL with traditional signal models. There are two main approaches:
a) DL-assisted model-based reconstruction (such as ADMM net11 etc), and
b) model-assisted DL-based reconstruction (such as, subspace or sparse model
assisted DL-based reconstruction4-7). The former emphasizes the role
of the model and to a large extent, still belongs to the class of classical
model-based reconstruction methods except for the use of DL for model
optimization (e.g., with a learned regularization functional). The latter emphasizes
the role of DL with the model serving as an “assistant”. This work focused on
analyzing the role of the signal models in this class of DL-based
reconstruction methods. We chose two popular signal models: the sparse and subspace
model, and one popular end-to-end variational network (E2E-VN)12,
for our study. More specifically, the end-to-end network was trained to provide
the following mapping relationship: $$\hat{\rho}(x)=f_{\text{nn}}(\rho_{m}(x)),\ \ \ \ \ \ \ \ \ \ (1)$$ where $$$\rho_{m}(x)$$$ and $$$\hat{\rho}(x)$$$ represent the network input and output,
respectively. In conventional DL-based reconstruction, $$$\rho_{m}(x)$$$ is often the degraded image (say, zero-padded
Fourier reconstruction). In model-assisted DL-based reconstruction, $$$\rho_{m}(x)$$$ is the initial reconstruction from a model.
Specifically, in sparse model-assisted DL, $$$\rho_{m}(x)$$$ is obtained as: $$\hat{\rho}_{m}=\text{arg}\min_{\rho_{m}}||d-E\rho_{m}||_2^2+\lambda||W\rho_{m}||_1,\ \ \ \ \ \ \ (2)$$ where $$$d$$$, $$$E$$$, $$$W$$$ represent
the measurement, encoding operator and sparse transformation, respectively. In subspace model-assisted DL, $$$\rho_{m}(x)$$$ is obtained by4: $$\hat{\rho}_{m}=\sum_{r=1}^Ra_{r}\phi_{r}(x),\ \ \ \ (3)$$ with $$\left\{\hat{a}_{r}\right\}=\text{arg}\min_{\left\{a_{r}\right\}}||d-E(\sum_{r=1}^Ra_{r}\phi_{r}(x))||_2^2-\sigma^2\text{log}(p(\left\{a_{r}\right\})),\ \ \ \ \ \ \ (4)$$ where $$$\phi_{r}(x)$$$ is the pre-learned basis functions (obtained,
for example, by principal component analysis of the training images), $$$a_{r}\sim p(\left\{a_{r}\right\})$$$ the corresponding coefficients and $$$\sigma^2$$$ the variance of the measure noise.
Results and Discussion
We have
systematically investigated the role of signal models in model-assisted
DL-based image reconstruction in improving robustness and generalization capability using the experimental data from the fastMRI
dataset13. A couple of key points are summarized here.
First, signal
models can substantially
enhance
the stability of DL-based reconstructions with respect to changes in data acquisition
parameters and selection of training data. A couple of representative results are
shown in Figs. 1-2. The improved stability comes from the fact that model-based
reconstruction can naturally account for data acquisition changes through the
forward operator and incorporate image priors through constrained models, thus providing
a built-in “filtering” function to reduce “nuisance” signal variations induced
by changes in data acquisition parameters and selection of training data.
Second, signal
models can help
improve generalization capability. A couple of examples are shown in Figs. 3-4.
The
enhanced generalization capability is a result of both dimension reduction and
effective representation of novel features through the signal models. More
specifically, subspace model or sparse model represents the desired image
function in a much lower-dimensional space through low-rank or sparse
modelling. Therefore, they substantially reduce the dimensionality of the
underlying learning problem, thus reducing the dependence on training data and
helping recover novel features from limited data.
We also found that classical signal
models can help assess the DL reconstruction performance by examining the subspace
extrapolation effect of DL processing, illustrated in Fig. 5.
Conclusions
This
paper performs a systematic analysis of the role of signal models in
model-assisted DL-based reconstruction. Our results show that signal models can
improve both the robustness and generalization capability of DL-based
reconstruction. This work may provide useful insights into developing new
methods to more effectively integrate classical signal models with DL for image reconstruction from limited data.
Acknowledgements
This work was supported by Shanghai
Pilot Program for Basic Research—Shanghai Jiao Tong University (21TQ1400203);
the National Natural Science Foundation of China (81871083, 62001293); and Key
Program of Multidisciplinary Cross Research Foundation of Shanghai Jiao Tong
University (YG2021ZD28).
References
[1]
D. Liang, J. Cheng, Z. Ke, et al, Deep Magnetic Resonance Image Reconstruction:
Inverse Problems Meet Neural Networks, IEEE Signal Process. Mag., 2020; 37(1):141–151.
[2]
V. Antun, F. Renna, C. Poon, et al, On instabilities of deep learning in image
reconstruction and the potential costs of AI, Proc. Natl. Acad. Sci. U. S. A.,
2020;117(48):30088–30095.
[3]
Knoll F, Hammernik K, Kobler E, et al. Assessment of the generalization of
learned image reconstruction and the potential for transfer learning[J].
Magnetic resonance in medicine, 2019, 81(1): 116-128.
[4]
Guan, Y., Li, Y., Li, Y., et al. Image reconstruction with subspace-assisted
deep learning. In Proceedings of the 31th Annual Meeting of ISMRM, 2022
[5]
Ke, Z., Guan Y., Li, Y., et al. Learned Subspace Model Enables Ultrafast
Neonatal Brain MR Imaging. In Proceedings of the 31th Annual Meeting of ISMRM,
2022
[6]
Guan Y., Li, Y., Meng Z., et al. Highly Accelerated MPRAGE Imaging of the Brain
Incorporating Deep Learning Priors with Subject-Specific Novel Features. In
Proceedings of the 29th Annual Meeting of ISMRM, 2020.
[7]
Muckley M J, Riemenschneider B, Radmanesh A, et al. Results of the 2020 fastmri
challenge for machine learning mr image reconstruction[J]. IEEE transactions on
medical imaging, 2021, 40(9): 2306-2317.
[8]
Zhu B, Liu J Z, Cauley S F, et al. Image reconstruction by domain-transform
manifold learning[J]. Nature, 2018, 555(7697): 487-492.
[9]
Yang G, Yu S, Dong H, et al. DAGAN: deep de-aliasing generative adversarial
networks for fast compressed sensing MRI reconstruction[J]. IEEE transactions
on medical imaging, 2017, 37(6): 1310-1321.
[10]
Cybenko G. Approximation by superpositions of a sigmoidal function[J].
Mathematics of control, signals and systems, 1989, 2(4): 303-314.
[11]
Y. Yang, J. Sun, H. Li, and Z. Xu, Deep ADMM-Net for compressive sensing MRI,
in Advances in Neural Information Processing Systems, 2016:10–18.
[12]
Sriram A, Zbontar J, Murrell T, et al. End-to-end variational networks for
accelerated MRI reconstruction[C]//International Conference on Medical Image
Computing and Computer-Assisted Intervention. Springer, Cham, 2020: 64-73.
[13]
Knoll F, Zbontar J, Sriram A, et al. fastMRI: A publicly available raw k-space
and DICOM dataset of knee images for accelerated MR image reconstruction using
machine learning[J]. Radiology. Artificial intelligence, 2020, 2(1).
Figures
Figure
1. Illustration of the improved robustness of sparse/subspace model-assisted end-to-end
variational network (E2E-VN) for number of encoding changes. As can be seen,
with the assistance of signal models, networks trained under an acceleration
factor of 8 produced more stable results when tested under acceleration factors
not included in the training, especially for increased number of encodings.
Figure
2. Illustration of the reduced dependence on the selection of training data
number for model-assisted end-to-end variational network (E2E-VN) (R=8).
Figure
3. Illustration of the improved generalization capability of
sparse/subspace-model assisted end-to-end variational network (E2E-VN) for
normal test case (R=8). The role of these two models is comparable with similar
reconstruction error and quality metrics.
Figure
4. Illustration of the improved generalization capability of
sparse/subspace-model assisted end-to-end variational network (E2E-VN) for
lesions not included in the training data (R=8).
Figure
5. Illustration of subspace extrapolation by DL processing. The subspace
model-based reconstruction was obtained using 1500-dimensional subspace. The
reconstruction result showed that the DL processing effectively extrapolated
the data to higher-dimensional space through built-in nonlinear processing.