3501

Test-retest Repeatability of Echo Planar Imaging Diffusion-weighted MRI on a 1.5T MR-linac for Head and Neck Cancers

Brigid McDonald¹, Dina El-Habashy¹, Renjie He¹, Abdallah S. R. Mohamed¹, Sam Mulder¹, Sara Ahmed², John Christodouleas³, and Clifton David Fuller¹
¹Radiation Oncology, UT MD Anderson Cancer Center, Houston, TX, United States, ²Dartmouth Hitchcock Medical Center, Lebanon, NH, United States, ³Elekta AB, Philadelphia, PA, United States

Synopsis

Keywords: Simulation/Validation, Diffusion/other diffusion imaging techniques, MR-linac, image-guided radiation therapy, repeatability

Motivation: In order to use DWI on MR-linacs to adapt head and neck cancer radiotherapy treatment plans based on response, the variability in ADC must be characterized.

Goal(s): To quantify the repeatability of ADC on a 1.5T MR-linac in a large cohort (37 head and neck cancer patients).

Approach: Patients were imaged with echo planar imaging-DWI twice before the start of radiotherapy. Mean ADC values of primary tumors and lymph nodes were compared across time points using repeatability metrics.

Results: Repeatability coefficients were 53.0%/35.5% for tumors/nodes, indicating that this DWI sequence is insufficient for detecting clinically significant ADC changes and must be further optimized.

Impact: Our DWI test-retest results demonstrate that the current widely implemented EPI-DWI sequence for head and neck cancers on 1.5T MR-linacs has substantial ADC variability across time points and needs to be further refined.

Introduction

Changes in diffusion-weighted imaging (DWI) apparent diffusion coefficient (ADC) values of head and neck cancers (HNC) during radiation therapy (RT) can predict response to RT¹, forming the basis for biological image-guided adaptive RT where changes in quantitative imaging biomarkers inform treatment plan adaptation². MRI-linear accelerators (MR-linacs) are well-suited for this purpose, but the variability in ADC must be characterized to understand when ADC changes are biologically meaningful. We aim to quantify ADC repeatability metrics in a large cohort of HNC patients on a 1.5T MR-linac.

Methods

This retrospective study included HNC patients that fit the following criteria: histologically confirmed HNC, consented to MOMENTUM observational trial³, and imaged with echo planar imaging (EPI)-DWI on the 1.5T MR-linac (Elekta Unity) at two time points between the pre-treatment MR simulation and first day of treatment (inclusive). Patients were imaged in RT immobilization masks with a T2-weighted sequence and EPI-DWI with b-values of 0, 150, and 500 s/mm². (Acquisition parameters are in Table 1.) ADC maps were reconstructed by the scanner using only b=150 and 500 s/mm² images to minimize perfusion effects⁴. Primary tumors (PTs) and pathological lymph nodes (LNs) were segmented in VelocityAI (v3.0.1) on the DWI b0 images—which were rigidly registered to the T2w images as guidance for better visualization—then were rigidly copied to the ADC maps. Mean ADC values and volumes of each structure were extracted in VelocityAI. Wilcoxon signed rank tests (α=0.05) were performed in GraphPad Prism (v10.0.3) to determine if there were significant differences in volume and mean ADC between time points 1 and 2 (TP1 and TP2) for PTs and LNs. The following repeatability metrics were calculated according to consensus guideline procedures⁵: within-subject deviation (wSD), repeatability coefficient (RC), within-subject coefficient of variation (wCV), and percent RC (%RC). Bland-Altman analysis was performed in GraphPad Prism for PT and LN mean ADC values across TP1 and TP2 to determine the mean bias and 95% limits of agreement.

Results

37 patients were included in the cohort with a total of 32 PTs and 52 LNs. Patient demographics are provided in Table 2. The median (range) number of days between imaging time points was 11 (1-15). Structure volumes, mean ADC values, and repeatability metrics for PTs and LNs are shown in Table 3. The differences in volume and mean ADC between time points were not statistically significant except for LN volume (p=0.0008). The wSD (95% confidence interval) values were 0.212 (0.179-0.280) x10^-3 mm²/s and 0.133 (0.112-0.165) x10^-3 mm²/s for PTs and LNs, respectively. The wCV (95% confidence interval) values were 19.1% (15.4%-25.3%) and 12.8% (10.8%-15.9%) for PTs and LNs, respectively. From Bland-Altman analysis, the mean bias (95% limits of agreement) between TP1 and TP2 was 0.033 (-0.560 – 0.626) x10^-3 mm²/s for PTs and -0.020 (-0.390 – 0.350) x10^-3 mm²/s for LNs.

Discussion/Conclusions

While a small number of studies have quantified ADC repeatability for HNC in both conventional MRIs⁶ and MR-linacs^7,8, cohorts have been small (≤11 patients). Our study has the largest cohort so far (32 PTs and 52 LNs) and comes closest to meeting the minimum recommended sample size of 35 lesions for both PTs and LNs⁹. Our wCV values are similar to the two previous reports using the same EPI sequence on a 1.5T MR-linac. Habrich et al. used b=150,500 ADC maps and reported RC values of 0.457 and 0.310 x10^-3 mm²/s for PTs and LNs and %RC values of 31.3% and 23.5% for PTs and LNs⁷. However, all test-retest scans were acquired during the same session with no repositioning, so the variability is expected to be lower. For repeat scans at least a day apart, McDonald et al. reported PT and LN %RC values of 52.5% and 39.5% for b=150,500 ADC maps and 28.5% and 27.6% for b=0,500 ADC maps⁸; these values are very similar to our current results (53.0% (PT) and 35.5% (LN)) but demonstrate that ADC variability is higher when using b=150,500 ADC maps, which was done to minimize perfusion effects⁴. Furthermore, repeatability was consistently worse in PTs than in LNs in all studies, which is likely due to 1) the greater delineation uncertainty for PTs than LNs and 2) the more heterogeneous composition of PTs. Finally, previous studies have demonstrated that a threshold mid-RT ΔADC as low as 7% can discriminate between complete response and non-responders in HNC¹. Our results indicate that the current MR-linac EPI-DWI sequence is not sufficiently sensitive to detect clinically significant ADC changes, so further sequence optimization is necessary to improve signal-to-noise ratio and ADC measurement robustness.

Acknowledgements

This project is supported by an academic-industrial partnership R01 grant from the National Institutes of Health (NIH)/National Institute of Dental and Craniofacial Research (NIDCR) (R01DE028290), NIH/NCI (National Cancer Institute) Image-Guided Cancer Therapy T32 Program (T32CA261856), and an AAPM-ASTRO Post-doc/Resident Seed Grant.

References

Mohamed ASR, Abusaif A, He R, et al. Prospective validation of diffusion-weighted MRI as a biomarker of tumor response and oncologic outcomes in head and neck cancer: Results from an observational biomarker pre-qualification study. Radiother Oncol. 2023;183. doi:10.1016/j.radonc.2023.109641
van Houdt PJ, Saeed H, Thorwarth D, et al. Integration of quantitative imaging biomarkers in clinical trials for MR-guided radiotherapy: Conceptual guidance for multicentre studies from the MR-Linac Consortium Imaging Biomarker Working Group. Eur J Cancer. 2021;153:64-71. doi:10.1016/j.ejca.2021.04.041
van Otterloo SR de M, Christodouleas JP, Blezer ELA, et al. The MOMENTUM Study: An International Registry for the Evidence-Based Introduction of MR-Guided Adaptive Therapy. Front Oncol. 2020;10(September):1328. doi:10.3389/fonc.2020.01328
Kooreman ES, van Houdt PJ, Keesman R, et al. ADC measurements on the Unity MR-linac – A recommendation on behalf of the Elekta Unity MR-linac consortium. Radiother Oncol. 2020;153:106-113. doi:10.1016/j.radonc.2020.09.046
Shukla-Dave A, Obuchowski NA, Chenevert TL, et al. Quantitative imaging biomarkers alliance (QIBA) recommendations for improved precision of DWI and DCE-MRI derived biomarkers in multicenter oncology trials. J Magn Reson Imaging. 2019;49(7):e101-e121. doi:10.1002/jmri.26518
Paudyal R, Konar AS, Obuchowski NA, et al. Repeatability of Quantitative Diffusion-Weighted Imaging Metrics in Phantoms, Head-and-Neck and Thyroid Cancers: Preliminary Findings. Tomography. 2019;5(1):15-25. doi:10.18383/j.tom.2018.00044
Habrich J, Boeke S, Nachbar M, et al. Repeatability of diffusion-weighted magnetic resonance imaging in head and neck cancer at a 1.5 T MR-Linac. Radiother Oncol. 2022;174:141-148. doi:10.1016/j.radonc.2022.07.020
McDonald BA, Salzillo T, Mulder S, et al. Prospective evaluation of in vivo and phantom repeatability and reproducibility of diffusion-weighted MRI sequences on 1.5 T MRI-linear accelerator (MR-Linac) and MR simulator devices for head and neck cancers. Radiother Oncol. 2023;185:109717. doi:10.1016/j.radonc.2023.109717
Obuchowski NA, Bullen J. Quantitative imaging biomarkers: Effect of sample size and bias on confidence interval coverage. Stat Methods Med Res. 2018;27(10):3139-3150. doi:10.1177/0962280217693662

Figures

Figure 1: Bland-Altman plots showing the differences in ADC values from TP1 and TP2 vs. the average ADC values for a) primary tumors and b) lymph nodes. The dashed lines represent the mean biases, and the dotted lines represent the 95% limits of agreement.

Table 1: MRI acquisition parameters for the T2-weighted and EPI-DWI sequences.

Table 2: Patient demographics

Table 3: Volume, mean ADC, and ADC repeatability results for primary tumors and lymph nodes. Volume and mean ADC are represented as mean ± standard deviation. Repeatability metrics are represented as value (95% confidence interval).

Proc. Intl. Soc. Mag. Reson. Med. 32 (2024)

3501

DOI: https://doi.org/10.58530/2024/3501