1117

Characterization of Breast Tumor using Machine Learning based upon Multi-parametric MRI Features.

Snekha Thakran¹, Rakesh Kumar Gupta², and Anup Singh^1,3
¹Centre for Biomedical Engineering, Indian Institute of Technology Delhi, New Delhi, India, Delhi, India, ²Department of Radiology, Fortis Memorial Research Institute, Haryana, Gurgaon, India, Delhi, India, ³Department of Biomedical Engineering, All India Institute of Medical Science, New Delhi, India, Delhi, India

Synopsis

Multi-parametric MRI(mp-MRI) has shown promising outcomes with high sensitivity and accuracy in the characterization of breast tumor. Quantitative analysis of mp-MRI and texture features with machine learning approach have also shown potential in improving accuracy of breast tumor classification. The objective of this study was to differentiate low-grade vs. high-grade breast tumor using machine learning with optimized feature vector obtained from mp-MRI data. The study included mp-MRI data of 35 patients with breast cancer. The combination of support-vector-machine(SVM) with Wrapper method using Adaptive-Boosting(AdaBoost) technique resulted in high sensitivity(0.94±0.07), specificity(0.80±0.05), and accuracy(0.90±5.48) in classification of low-grade vs. high-grade tumors.

Introduction:

MRI can be used to obtain multi-parametric(mp) information such as structural, hemodynamic, physiological, etc^1,2. Multiple studies have been reported focusing on the use of mp-MRI data for the characterization of breast tumor with varying sensitivity and specificity^3,4,5. Radjenovic et al. have reported pharmacokinetic parameters from DCE-MRI for grading of invasive breast tumors. The volume transfer coefficient(K^trans) and rate constant(Kep) were significantly higher in grade-III tumors than in grade-II tumors. None of the measured parameters varied significantly between grade-I and grade-II tumor³. Jiang et al. reported sensitivity of 57% and specificity of 78% with morphology features⁴. The quantitative analysis of mp-MRI data has shown potential in improving accuracy of breast tumor classification². In general, a large set of quantitative and texture features can be generated depending upon the type of methodology used. A suitable combination of selected quantitative and texture features can further improve accuracy of tumor classification. Machine learning(ML) classifiers based upon features derived from MRI data has shown potential in the classification of tumors^6,7. There is a need for further research studies on selection of appropriate combination of features as well to evaluate performance of different ML classifiers for accurate classification of breast tumor. In the current study, it was hypothesized that quantitative features(tracer kinetic, hemodynamic and diffusion) in combination with some texture features can improve accuracy of tumor classification using an optimized ML classifier. The objectives of the current study were to develop and optimize an machine learning framework for characterization of breast tumor(low-grade vs. high-grade tumor) using an optimized feature vector computed from mp-MRI data.

Methods:

All the MRI experiments were performed at 3T-whole body Ingenia MRI system(Philips Healthcare,The Netherlands) using a 7-channel biopsy compatible breast coil. This study included breast mp-MRI data of 35-female patients with histopathology results. Protocol included conventional-MRI(T₁-W,T₂-W, and PD-W), Diffusion-weighted-imaging(DWI) with different b-values(0,1000s/mm²) and T₁-perfusion MRI data with 40-dynamics, 5.4-seconds temporal-resolution).
Data processing: The mp-MRI data was processed for extraction of features followed by selection of features and evaluation of multiple ML classifiers for classification of breast tumors(Figure-1). Tumor tissue was used as a region of interest(ROI). Quantitative features and texture features were extracted from ROI:1) 4-Tracer kinetics features(K^trans , Kep, Vp, Ve) and 3-hemodynamic features(BBV, BBVcorr, and BBF), computed from T₁-perfusion MRI data⁵, 2) apparent-diffusion-coefficient(ADC) parameter, computed using DWI data⁸, and 3) 40-texture features^9,10 were computed from each 3 T₁-perfusion images(pre-, peak- and delayed-contrast) using Radiomics tool. A total 128 features(8-quantitative features and 120-texture features) per patient were used in this study. Finally, a 15-feature vectors(FVs) consists of different combinations of quantitative and texture features were used in this study as shown in Table-1. Wrapper-based-feature-selection(WBFS) method¹¹with Adaptive-Boosting technique was used for feature selection. 5-ML classifiers were evaluated for tumor classification using 10-fold cross-validation(CV) with 10-repetitions. The diagnostic performance for differentiating between low-grade vs. high-grade tumors using different classifiers without and with the feature selection was analyzed using accuracy, sensitivity, specificity, precision, F1-score, mean-absolute-error(MAE) and root-mean-squared-error(RMSE) with respect to histology result.

Results:

The performance of 5 different classifiers and 15 different FVs, without and with feature selection, were evaluated for classification of low-grade vs. high-grade tumor. Among all FVs, the FV-15 feature vector without any feature selection, with SVM classifier provided highest average accuracy(77%±4.31) and AUC(0.76±0.06) for the classification of low-grade vs. high-grade tumors with 10-fold CV over 10-repetitions. WBFS techniques provided reduced number of features in this feature vector (FV-15). WBFS method selected 5 features(BBV, BBVcorr, Correlation at peak-contrast, Gray-Level Variance(GLV) from GLRLM at delayed contrast and High Gray Level Zone Emphasis(HGZE) from GLSZM at delayed contrast to classify the low-grade vs. high-grade tumor. SVM with optimum features selected using Wrapper method with Adaptive-Boosting technique provided a highest average accuracy(90%±5.48) and AUC(0.93±0.10) as shown in Figure-2. Table-2 shows that the SVM classifier with the WBFS method provided the highest sensitivity(0.94±0.07), specificity (0.80±0.05), precision(0.92±0.05), and F1 score(0.92±0.06) in characterizing the tumor between low-grade vs. high-grade tumors among all other methods using 10-fold CV method over 10-repetitions. It also provided the lowest MAE(0.16±0.03) and RMSE(0.24±0.03).

Discussion and Conclusion:

The choice of features plays a crucial part in the accurate classification of tumors. In the current study, hemodynamic features were combined with other features and used for ML classifier development. Hemodynamic features are biomarkers for angiogenesis, which has been reported to be correlated with tumor growth and tumor type⁵. Inclusion of hemodynamic features in mp-MRI feature vector improved accuracy of tumor classification substantially. The AdaBoost method reduced the problem of imbalance data and mitigated the problem of overfitting. Using a combination of SVM and optimal mp-MRI features set with wrapper method provided the highest sensitivity, specificity and accuracy as compared to reported studies^3,4,5. These are preliminary results with a small number of patients. It is necessary to validate the classifier on a larger data size in future studies. In conclusion, hemodynamic features were included with tracer kinetic, ADC and texture features to create a feature vector in the current study, which improved the accuracy in the classification of low vs. high grade tumor. Our finding provided that the SVM outperformed other ML methods in the binary classification of breast tumor.

Acknowledgements

The authors thank Dr. Sunita Ahlawat for providing histopathology results; Rupsa Bhattacharjee for technical assistance. The Authors acknowledge an internal grant from the Indian Institute of Technology Delhi.

References

1. Rahbar H, Partridge SC. Multiparametric Breast MRI of Breast Cancer. Magn Reson Imaging Clin N Am. 2016;24(1):223-238.

2. Pinker K, Helbich TH, Morris EA. The potential of multiparametric MRI of the breast. Br Inst Radiol. 2017;90(20160715):1-17.

3. Radjennovic A, Dall BJ, Ridgway JP, Smith MA. Measurement of pharmacokinetic parameters in histologically graded invasive breast tumours using dynamic contrast-enhanced MRI. Br J Radiol. 2008;81(962):120-128.

4. Jiang X, Xie F, Liu L, Peng Y, Cai H, Li LI. Discrimination of malignant and benign breast masses using automatic segmentation and features extracted from dynamic contrast-enhanced and diffusion-weighted mri. Oncol Lett. 2018;16(2):1521-1528.

5. Thakran S, Chatterjee S, Singhal M, Gupta RK. Automatic outer and inner breast tissue segmentation using multi-parametric MRI images of breast tumor patients. Published online 2018:1-21.

6. Cai H, Peng Y, Ou C, Chen M, Li L. Diagnosis of breast masses from dynamic contrast-enhanced and diffusion-weighted MR: A machine learning approach. PLoS One. 2014;9(1).

7. Liu B, Li X, Li J, et al. Comparison of Machine Learning Classifiers for Breast Cancer Diagnosis Based on Feature Selection. Proc - 2018 IEEE Int Conf Syst Man, Cybern SMC 2018. Published online 2019:4399-4404.

8. Woodhams R, Matsunaga K, Iwabuchi K, Hata H, Kuranami M, Hayakawa K. Diffusion-Weighted Imaging of Malignant Breast Tumors The Usefulness of Apparent Diffusion Coefficient (ADC) Value and ADC Map for the Detection of Malignant Breast Tumors and Evaluation of Cancer Extension. J Comput Assist Tomogr. 2005;29(5):644-649.

9. Agner SC, Soman S, Libfeld E, et al. Textural kinetics: A novel dynamic contrast-enhanced (DCE)-MRI feature for breast lesion classification. J Digit Imaging. 2011;24(3):446-463.

10. Haralick R, Shanmugam K, and Dinstein I, Textural Features for Image Classication, IEEE Transactions on Systems, Man, and Cybernetics, vol. SMC-3, no. 6, pp. 610-621, Nov. 1973.

11. Darzi M, AsgharLiaei A, Hosseini M, HabibollahAsghari. Feature Selection for Breast Cancer Diagnosis: A Case-Based Wrapper Approach. World Acad Sci Eng Technol. 2011;77(5):1142-1145.

Figures

Figure-1: The flow chart of the proposed framework.

Table-1: List of different feature vectors and their description.

Figure-2: Performance (average accuracy using 10-fold CV over 10 repetitions) of different classifiers without (FV-15) and with feature selection (selected features from FV-15 using wrapper method) for low-grade vs. high-grade tumor classification. Error bars show standard deviation.

Table-2:Diagnostic performance for differentiating between low-grade vs. high-grade tumors using different classifiers without and with the feature selection using repeated 10-fold Cross Validation (CV).

Proc. Intl. Soc. Mag. Reson. Med. 29 (2021)

1117