Hakmook Kang1,2, Allison Hainline1, Xia Li3, Lori R. Arlinghaus4, Vandana G. Abramson5,6, A. Bapsi Chakravarthy5,7, Brian Bingham8, and Thomas E. Yankeelov2,4,5,9
1Biostatistics, Vanderbilt University, Nashville, TN, United States, 2Center for Quantitative Science, Vanderbilt University, Nashville, TN, United States, 3GE Global Research, Niskayuna, NY, United States, 4Institute of Imaging Science, Vanderbilt University, Nashville, TN, United States, 5Ingram Cancer Center, Vanderbilt University, Nashville, TN, United States, 6Medical Oncology, Vanderbilt University, Nashville, TN, United States, 7Radiation Oncology, Vanderbilt University, Nashville, TN, United States, 8School of Medicine, Vanderbilt University, Nashville, TN, United States, 9Radiology and Radiological Sciences, Vanderbilt University, Nashville, TN, United States
Synopsis
Pathologic
complete response (pCR) following neoadjuvant chemotherapy is used as a short
term surrogate marker of ultimate outcome in patients with breast cancer.
Current imaging tools are suboptimal in predicting this response. Analyzing
voxel-level heterogeneity in multi-modal MRI maps in conjunction with receptor
status data, i.e., DCE- and DW-MRI, and ER/PR/HER2 status, allows us to improve
the predictive power after the first cycle of neoadjuvant chemotherapy (NAC). PURPOSE
The purpose of this study is to investigate the
ability of multivariate
penalized regression models with tumor grade, age, receptor status and DCE- and
DW-MRI parametric maps, obtained early in the course of therapy, to predict
pathologic complete response (pCR) status at the time of surgery.
METHODS
DCE-
and DW-MRI data were collected (3.0T MR scanner, Philips Healthcare, The
Netherlands) before and after the first cycle of NAC from thirty-three patients
with Stage II or III breast cancer who participated in an IRB-approved clinical
trial. Details on data acquisition and longitudinal registration of DCE- and
DW-MRI parametric maps can be found elsewhere
1. Pathological
complete responses were confirmed at the time of surgery. There were 12 patients
with pCR and 21 who had residual disease at surgery. Voxel-based information of
the efflux rate constant K
ep from the DCE-MRI and apparent diffusion
coefficient ADC from the DW-MRI was extracted via the redundancy analysis in
which the most uncorrelated percentiles of K
ep and ADC values were
chosen after fitting a series of multiple linear regression models for those
voxels with increase in K
ep and decrease in ADC, respectively, from
pre- to post-NAC
2. Logistic regression with ridge penalty was
employed to accommodate a large number of predictors (e.g., voxel-level K
ep
and ADC) compared to our sample size (n=33). The performance of the full model,
i.e., voxel-level K
ep & ADC, the receptor status, age, and tumor
grade (VRC model), was compared to the same model without the receptors’ status
(VC model) in terms of overfitting-corrected
3 AUC and Brier score
that measures the accuracy of prediction. For comparison, the prediction power
of the conventional measure RECIST (RC model) was also assessed. The
statistical significance of the difference in overfitting-corrected AUCs and
Brier scores was investigated by generating a bootstrap distribution of the
difference with 300 replicates.
RESULTS
In
Table 1, the overfitting-corrected AUCs and Brier scores resulted from the
three logistic ridge regression models are reported along with the
corresponding 95% CIs. The overfitting-corrected AUC resulted from VRC model
outperforms the other two models, i.e., VC and RC models. Moreover, VRC model
achieves the smallest overfitting-corrected Brier score, indicating that VRC
model outperforms the others. The differences in overfitting-corrected AUCs
between VRC and VC, and between VRC and RC model are illustrated in Figure 1.
The shaded area 0.057 and 0.05 in Figure 1 indicates how overfitting-corrected AUC
resulted from VRC is smaller than that from VC and RC, respectively. The shaded
area 0.097 and 0.053 in Figure 2 indicate how
the overfitting-corrected Brier score resulted from VCR is larger than
that from VC and RC, respectively. Although the 95% CI on each
overfitting-corrected statistics is not statistically significant, Figure 1 and
2 strongly support that the overfitting-corrected AUC and Brier score of VRC
are likely to outperform the corresponding statistics based on VC and RC model.
CONCLUSION/DISCUSSION
The
study supports incorporating voxel-level information along with hormone
receptor information, to improve the ability to predict treatment response for
breast cancer patients using early imaging time points.
Acknowledgements
NCI
1R01CA129961, NCI 1U01CA142565, NCI 1P50 098131, NCI P30 CA68485, NCRR/NIH UL1
RR024975-01.References
1.
Li X, et al. Invest Radiol 2014; 50(4):195-204.
2. Li X, et al. Transl Oncol
2014; 7(1): 14-22.
3. Harrell F, 2010; Ch.5 in Regression Modeling Strategies,
New York, NY: Springer.