3180

Applying Continuous-Time-Random-Walk (CTRW) diffusion model based Radiomics in Predicting HER-2 Expression in Breast Invasive Ductal Cancer

Siyao Du¹, Mengfan Wang¹, Shasha Liu¹, Xiaoqian Bian¹, Xinyue Chen¹, Liangcun Guo¹, Guoliang Huang¹, Ruimeng Zhao¹, Can Peng¹, Wenhong Jiang¹, Qinglei Shi², Xu Yan², Guang Yang³, and Lina Zhang¹
¹Department of Radiology, The First Affiliated Hospital of China Medical University, Shenyang, China, ²MR Scientific Marketing, Siemens Healthineers Ltd., Beijing, China, ³Shanghai Key Laboratory of Magnetic Resonance, East China Normal University, Shanghai, China

Synopsis

In this study, we built a support vector machine (SVM) model based on quantitative parameters of continuous-time random-walk (CTRW) diffusion model in predicting the human epidermal growth factor receptor-2 (HER-2) expression in breast invasive ductal carcinoma. An AUC of 0.753 was achieved, which may have a great potential in future clinical practice.

Backgrounds and Purpose

In recent years, diffusion weighted imaging plays a vital role in breast cancer differential diagnosis and therapeutic effect evaluation. However, the microenvironments of biological tissues in cancer, a heterogeneity tumor, are complicated. Therefore, more specific models were developed to reflect heterogeneity of the water motion in the human body¹. The continuous-time random-walk (CTRW) diffusion model reflects intravoxel diffusion heterogeneity in both time and spatial scale. Therefore, it may be a potential biomarker to reflect the changes of tissue complexity and microenvironment. In making therapy schedule of invasive ductal carcinoma (IDC) of the breast, the levels of human epidermal growth factor receptor-2 (HER-2) expression plays an important role. In this study, we will evaluate the value of radiomics based on continuous-time random-walk (CTRW) diffusion model combined with a support vector machine (SVM) in predicting the HER-2 expression in breast invasive ductal carcinoma.

Materials and Methods

A total of 131 females, diagnosed with breast invasive ductal carcinoma (IDC) with biopsy confirmed, were enrolled from May 2020 to October 2021. According to the HER-2 states confirmed with immunohistochemistry results, all patients were divided into HER2-positive and HER2-negative groups. All MR examinations were performed on a 3T scanner (MAGNETOM Skyra, Siemens, Erlangen, Germany). The parameter maps of the CTRW were calculated by an in-house developed software called BoDiLab, which is based on Python 3.7. Based on a high b value images (b= 800 s/mm2), the whole tumor volume was delineated using a software itk-snap (http://www.itksnap.org/), by referring to the corresponding T1 contrast-enhancement or T2WI images . Radiomics features were extracted using an open source tool named Pyradiomics (https://pyradiomics.readthedocs.io/). During data preprocessing and model establishing, considering the effective and robust of the classifier, a SVM classifier was used. To explore the potential of this classifier, data enhancement, data normalization, dimension reduction and feature screening schemes in model establishing were optimized, and the optimal number of features in the prediction efficiency was also explored. The performance of the model was evaluated using receiver operating characteristic (ROC) curve analysis. The area under the ROC curve (AUC) was calculated for quantification. The accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were also calculated. All above processes were implemented with FeAture Explorer (FAE,v0.2.5,https://github.com/salan668/FAE) on Python (3.6.8,https://www.python.org/).

Results

After comparing and optimizing in model establishing, to remove the unbalance of the training data set, a synthetic minority oversampling technique (SMOTE) was used to make positive/negative samples balance. In the normalization of the feature matrix, each feature vector was subtracted by the mean value of the vector, and was divided by the length of it. In the feature dimension reduction, a pearson correlation coefficient (PCC) method was used. Before building the model, a Relief method was used to select features. Finally, when 4 features of original_ngtdm_Contrast_CTRW_D, original_glszm_LargeAreaHighGrayLevelEmphasis_CTRW_beta, original_firstorder_Median_CTRW_beta, original_firstorder_90Percentile_CTRW_beta were adopted, the SVM demonstrated the highest diagnostic values in training, validation and test data (Table 2).

Discussions

Unlike the other DWI models reflecting Gaussian or non-Gaussian diffusion behavior, the CTRW model can reflect intravoxel diffusion heterogeneity in both time and space, thus could be a potential biomarker for changes of tissue complexity and microenvironment² . Radiomics, since first proposed by Philippe Lambin in 2012³, have gained remarkable achievements in clinical practice, especially together with machine learning models. However, radiomics, especially texture features are very sensitive to image quality, signal intensity, matrix, or the physiological and physical means. Therefore, a quantitative parameter map, which the signal intensity will not change with the difference of scanning parameters, is of great significance for radiomics in the clinical application. SVM was an effective and robust classifier to build the model. The kernel function has the ability to map the features into a higher dimension to search the hyper-plane for separating the cases with different labels. Here we used the linear kernel function because it was easier to explain the coefficients of the features for the final model. To determine the hyper-parameter (e.g. the number of features) of model, we applied cross validation with 5-fold on the training data set. Using the SVM, the radiomics based on CTRW derived maps in predicting the HER-2 expression in breast IDC showed a high diagnostic performance, and it may have a great potential in future clinical practice.

Conclusions

The support vector machine based on radiomics of CTRW derived maps demonstrated high diagnostic performance in predicting the HER-2 expression in invasive ductal carcinoma of the breast, which may have a great potential in future clinical practice.

Acknowledgements

no.

References

1. Iima M, Honda M, Sigmund EE, et al. Diffusion MRI of the breast: Current status and future directions. 2020;52(1):70-90.

2. Karaman MM, Zhang JX, Xie KL, et al. Quartile histogram assessment of glioma malignancy using high b-value diffusion MRI with a continuous-time random-walk model. 2021;34(4):e4485.

3. Lambin P, Rios-Velazquez E, Leijenaar R, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer, 2021;48(4):441-446.

Figures

Figure 1. A-D: In the training, a mean method, pearson coefficent correlation (PCC) and Relief method showed higher AUCs than any other methods; D: The relationship between feature numbers and the AUCs, from which we can see that when 4 features adopted, SVM demonstrated the highest values in training, validation and test data in the pre-experiment. CV Train: Result under the training data via a 5-fold cross validation method; CV Validation: Result under the validation data via a 5-fold cross validation method; Train: Result using all training data; Test: Result using all test data.

Figure 2. (A) The contributions of adopted parameter values to the model. (B) The AUC values of ROC on CV training, CV validation, training and testing data. CV Train: Result under the training data via a 5-fold cross validation method; CV Validation: Result under validation data via a 5-fold cross validation method; Train: Result using all training data; Test: Result using all test data.

Table 1. ROC analysis results on CV training, CV validation, training and testing data. CV Train: Result under the training data via a 5-fold cross validation method; CV Validation: Result under validation data via a 5-fold cross validation method; Train: Result using all training data; Test: result using all test data.

Figure 3. A: The decision curve analysis (DCA), the graph is showing that, based on the SVM model, when the threshold probability is less than 90% or higher than 30%, then more benefit can be obtained than a biopsy all, or biopsy none scheme; B: the interventions avoided analysis (INA) showed that at a probability threshold of 80%, the net reduction in interventions is about 40 per 100 patients. In other words, at this probability threshold, biopsying patients on the basis of the model is the equivalent of a strategy that reduced the biopsy rate by 40%, without missing any cancers.

Figure 4, A-C: CTRW derived parameters (D, α, β) of an example with HER2-positive; D-F: CTRW derived parameters (D, α, β) of an example with HER2-negative. We can see HER2-positive patient shows lower CTRW derived parameters than HER2-negative patient.

Proc. Intl. Soc. Mag. Reson. Med. 30 (2022)

3180

DOI: https://doi.org/10.58530/2022/3180