1317

Rapid Dealiasing of Undersampled, Radial First-Pass Cardiac Perfusion MR Images using 3D Residual U-Net

Lexiaozi Fan^1,2, Daming Shen^1,2, Hassan Haji-Valizadeh³, Nivedita K Naresh⁴, James C. Carr¹, Benjamin H. Freed⁵, Daniel C. Lee⁵, and Daniel Kim^1,2
¹Department of Radiology, Northwestern University Feinberg School of Medicine, Chicago, IL, United States, ²Department of Biomedical Engineering, Northwestern University, Evanston, IL, United States, ³Department of Medicine (Cardiovascular Division), Beth Israel Deaconess Medical Center & Harvard Medical School, Boston, MA, United States, ⁴Department of Radiology, University of Colorado Denver, Denver, CO, United States, ⁵Division of Cardiology, Internal Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL, United States

Synopsis

Compressed sensing (CS) is capable of accelerating cardiac perfusion MRI for achieving high spatial resolution (1.6 mm x 1.6 mm x 8 mm) and extensive spatial coverage (6+ slices per heartbeat), but the lengthy image reconstruction time (~8 min per slice with 64 frames using GPU) hinders its clinical translation. In this study, we sought to, for the first time, rapidly reconstruct accelerated cardiac perfusion data using a 3D residual U-net for clinical translation.

Introduction

Compressed sensing¹ can be used to accelerate 2D cardiac perfusion MRI and achieve clinically acceptable image quality with high spatial resolution (1.6mm x 1.6mm x 8mm) and extensive myocardial coverage (6-8 2D slices per heartbeat), as previously described2. While CS is a promising method from an imaging perspective, its relatively lengthy image reconstruction (e.g. ~8min per slice with 64 frames using GPU) hinders its clinical translation. In this study, we sought to implement an image reconstruction pipeline including a 3D residual U-Net and test whether it is capable of reconstructing undersampled, non-Cartesian 2D cardiac perfusion k-space data at least 10 times faster than GPU-accelerated CS reconstruction, without significant loss in data fidelity or image quality.

Methods

Human Subjects & Pulse Sequence: We prospectively enrolled 27 patients and combined the results together with existing raw data of 13 patients (in total 28 men and 12 women, mean age = 51 ± 14 years, see Table1 for details). Relevant image parameters included: field of view (FOV) = 300 mm x 300 mm, acquisition matrix = 192 x 192, spatial resolution = 1.6 mm x 1.6 mm, slice thickness =8 mm, TE/TR =1.5/2.6 ms, flip angle = 12°, receiver bandwidth =700 Hz/pixel, 30 rays per frame with the 7th Fibonacci sequence of golden angles (=23.628°)³, single-shot readout duration per frame = 78 ms, 75 repetitions, electrocardiogram triggering every heartbeat, and 6-8 slices per heartbeat, depending on heart rate. For more details on the pulse sequence, please see reference².
Image reconstruction: Our proposed image reconstruction pipeline includes three sequential steps: In step 1 (pre-processing), GPU-accelerated NUFFT⁴ was used to grid the radial k-space data onto the Cartesian space and SENSE⁵ was used to combine multi-coil zero-filled images, where coil sensitivities were self-calibrated by processing the time average image, as previously described⁶. In step 2 (dealiasing), the trained U-Net was used to dealiase coil-combined, zero-filled image. In step 3, residual aliasing artifacts were filtered using block-wise low rank (BWLR) with a single iteration with image block size = 8 x 8 and normalization threshold value = 10% of the absolute maximum value.
U-Net training and testing: We implemented a 3D residual U-net architecture (Figure 1) on a GPU workstation (P100 Tesla 12 GB memory, NVIDIA; Xeon E5-2650 v4 128 GB memory, Intel); GPU-accelerated CS reconstruction was performed on the same hardware. The detail of our 3D U-Net architecture included: 3 hidden layers, 16 channels, 3x3x3 (x-y-time) convolutional kernels size, and 2x2x2 max-pooling size. For training, we used 132 2D+time datasets with 64 frames per slice (or 8448 2D images in total) from 28 randomly selected patients, where the coil-combined, zero-filled images were used as input and the corresponding CS reconstructed images without BWLR filtering were used as output. For testing, we used 56 2D+time datasets with 64 frames per slice (i.e. 3584 2D images in total) from 12 remaining patients as input to our trained U-Net.
Image quality analysis: Images were graded using quantitative metrics of image quality (SSIM, NRMSE, and edge sharpness assessment). Two clinical readers independently graded the following 3 categories on a 5-point (1: worst, 3: clinically acceptable, 5: best) Likert scale: conspicuity of wall enhancement, noise, and artifact. For statistical analysis, we used the two-tailed, paired t-test for quantitative metrics, and the Wicoxon signed rank test for visual scores, where p < 0.05 was considered significant.

Results

The mean processing time per slice with 64 frames along the proposed pipeline was 30.8 ± 1.4 s for pre-processing (step 1), 0.7 ± 0.1 s for dealiasing (step 2), and 0.6 ± 0.1 s for post-processing (step 3). The mean processing time per slice with 64 frames along the GPU-accelerated CS pipeline was 30.8 ± 1.4 s for pre-processing (step 1), 429.2 ± 16.4 s for dealiasing (step 2), and 1.3 ± 0.1 s for post-processing (step 3). Including all three steps, the mean reconstruction time per slice was on average was 14.4 times shorter for U-Net (32.1 ± 1.4 s) than CS (461.3 ± 16.9 s)(p<0.001). Excluding the pre- and post-processing steps, the mean dealiasing time per slice was on average was 613.1 times shorter for U-Net (0.7 ± 0.1 s) than CS (429.2 ± 16.4 s)(p<0.001). Figure 2 shows representative images of three different patients reconstructed with zero-filling, CS, and U-Net, as well as difference images with respect to CS. Figure 3 shows 3 short-axis images of a patient with a perfusion defect in the septal wall reconstructed with CS and U-Net, as well as difference images with respect to CS. Compared with CS, our proposed method maintained high data fidelity (structural similarity index =0.914±0.023, normalized root mean square error=1.7±0.3%, identical mean edge sharpness of 1.2 mm). As summarized in Table 2, the median conspicuity and noise scores were not significantly different, whereas the artifact score was significantly different. Nonetheless, all scores were above the clinically acceptable (3.0) cut point.

Conclusion

This study demonstrates a reconstruction pipeline including a U-Net that is capable of reconstructing 6.4-fold accelerated, non-Cartesian cardiac perfusion k-space data 14.4 times faster than CS, without significant loss in data fidelity (SSIM>0.90, NRMSE<5%) or image quality.

Acknowledgements

This work is supported by National Institutes of Health (R01HL116895, R01HL138578, R21EB024315, R21AG055954) and the American Heart Association (19IPLOI34760317).

References

1. Lustig M, Donoho D, Pauly JM. Sparse MRI: The application of compressed sensing for rapid MR imaging. Magn Reson Med 2007;58(6):1182-1195.

2. Naresh NK, Haji-Valizadeh H, Aouad PJ, Barrett MJ, Chow K, Ragin AB, Collins JD, Carr JC, Lee DC, Kim D. Accelerated, first-pass cardiac perfusion pulse sequence with radial k-space sampling, compressed sensing, and k-space weighted image contrast reconstruction tailored for visual analysis and quantification of myocardial blood flow. Magn Reson Med 2019;81(4):2632-2643.

3. Wundrak S, Paul J, Ulrici J, Hell E, Geibel MA, Bernhardt P, Rottbauer W, Rasche V. Golden ratio sparse MRI using tiny golden angles. Magn Reson Med 2016;75(6):2372-2378.

4. Knoll F, Schwarzl A, Diwoky C, Sodickson DK. gpuNUFFT-an open source GPU library for 3D regridding with direct Matlab interface. In: Proceedings of the 22rd Annual Meeting of ISMRM, Melbourne, Australia 2014. Abstract No. 4297.

5. Pruessmann KP, Weiger M, Scheidegger MB, Boesiger P. SENSE: sensitivity encoding for fast MRI. Magn Reson Med 1999;42(5):952-962.

6. Walsh DO, Gmitro AF, Marcellin MW. Adaptive reconstruction of phased array MR imagery. Magn Reson Med 2000;43(5):682-690.

Figures

Table 1: Clinical characteristics of 40 patients enrolled for this study. Only one patient presented with LGE in the septal wall due to myocardial infarction. The remaining 39 patients did not show LGE during cardiac MRI. A detailed list of clinical profiles is not relevant for this study and is thus omitted due to space constraint.

Table 2: Summary of visual scores. The wall enhancement, artifact, and noise scores are presented as median (interquartile range [IQR]). Conspicuity of wall enhancement: 1 = nondiagnostic; 2 = poor; 3 = clinically acceptable; 4 = good; 5 =excellent. Noise and artifact levels: 1 = nondiagnostic; 2 = severe; 3 = moderate; 4 = mild; 5 = minimal. P < 0.05 was considered statistically significant.

Figure 1. A schematic architecture of residual 3D U-Net. The number above each block corresponds to the number of features for each layer. The number in the left of each block corresponds to the matrix size at every level. Each convolutional layer has a 3 x 3 x 3 kernel and a rectified linear unit, except for the last layer that produced the residual update. Between different layers, a 2 x 2 x 2 max pooling size was used.

Figure 2. Representative results from 3 different patients at peak blood enhancement (a) and wall enhancement (b): CS as reference (row 1), zero-filled (row 2), U-Net (row 3), difference between zero-filled and CS (row 4), and difference between U-Net and CS (row 5). Both CS and U-Net includes BWLR filtering. Difference images are displayed with 5 times narrower grayscale to bring out differences.

Figure 3. Examples short-axis images in base (top row), mid-ventricular (middle row), and apical (bottom row) planes of a patient with a perfusion defect in the septal wall: CS reference (left column), U-Net (middle column), and difference images (right column). Note, difference images are displayed with 5 times narrower grayscale to bring out differences.

Proc. Intl. Soc. Mag. Reson. Med. 28 (2020)

1317