The automation of the grading task for the knee MRI scoring is appealing. The goal of this study is to leverage recent developments in Deep Learning (DL) applied to medical imaging in order to (i) identify cartilage lesions and assess severity (ii) identify the presence of BMELs, (ii)combine the two models in a multi-task automated and scalable fashion. We were able to boost performance of our final classifiers by not simply focusing on what the fine tuning of a single purpose model could offer, but rather broadly considering related tasks that could bring additional information to our classification problem.
Introduction
Semi quantitative scoring systems, such as the Whole-Organ Magnetic Resonance Imaging Score (WORMS)1 have been developed in an attempt to standardize the Knee MRI reading. Despite grading systems being widely used in research setting the clinical application is hampered by the time and level of expertise needed to reliably perform the reading making the automation of this task appealing for a smoother and faster clinical translation. The goal of this study is to fill this void by capitalizing on recent developments in Deep Learning-DL applied to medical imaging. Specifically, we aim to (i)identify cartilage lesions and assess severity (ii)identify the presence of BMELs, (ii)combine the two models in a multi-task automated and scalable fashion.Results
The first step on the cartilage lesion classification was to automatically classify lesions severity only with 3D volumetric image data. Overall accuracy for that classifier was 79% on the holdout set. Based on three-channel MIPS, the accuracy of BMEL classifier was >80%. For the shallow classifier ensemble three class WORMS model, an overall accuracy of 82% was achieved when combining the 3D-CNN with demographics data. The count confusion matrix can be viewed in Figure 4, along with results for the combinations of the 3 classifiers used in our pipeline. A 4th option is also considered when using the real radiologists graded BMEL labels as input for the shallow classifier, where it boosted the performance to a 86% overall accuracy. This shows the potential of improving the BMEL classifier and combining it with cartilage lesion classification in a multi-task learning approach. In an attempt to interpreted better our results misclassified cases were further inspected by experts Figure 5 shows an example.Discussion and Conclusion
By combining different anatomical structures (distinct cartilage compartments) and lesion classification grading for both cartilage and BMEL, we are moving towards multitask machine learning for lesion detection. The proposed approach is weakly supervised in the sense that it learns features using only image level labels (i.e., all that is known is the presence or absence of a lesion somewhere in the 3D volume). With the proposed approach, we were able to boost performance of our final classifiers by not simply focusing on what the fine tuning of a single purpose model could offer, but rather broadly considering related tasks that could bring additional information to our classification problem.1. Peterfy CG, Guermazi A, Zaim S, et al. Whole-Organ Magnetic Resonance Imaging Score (WORMS) of the knee in osteoarthritis. Osteoarthritis Cartilage. 2004;12(3):177-90.
2. Fausto Milletari, Nassir Navab, Seyed-Ahmad Ahmadi. V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation. arXiv: 1606.04797. 2016
3. Norman B, Pedoia V, Majumdar S. Deep Learning Convolutional Neural Networks for Knee Multi-Tissue Automatic Morphometry and Relaxometry. Radiology. 2018 Jul;288(1):177-185
4. Tianqi Chen and Carlos Guestrin, "XGBoost: A Scalable Tree Boosting System", 22nd SIGKDD Conference on Knowledge Discovery and Data Mining, 2016, arXiv:1603.02754