Machine learning strongly enhances diffusion MRI in terms of acquisition speed and quality of results. Different machine learning tasks are applicable in different situations: labels for training might be available only for healthy data or only for common but not rare diseases; training labels might be available voxel-wise, or only scan-wise. This leads to various tasks beyond supervised learning. Here we examine whether it is possible to perform accurate voxel-wise MS lesion detection if only scan-wise training labels are used. We use negative-unlabeled learning (an equivalent of positive-unlabeled learning) and achieve an AUC of 0.77.
Data: 94 MS patients and 26 healthy controls, each with six b=0 images and 40 diffusion-weighted images (6+40 “channels”), bmax=1200s/mm², SE-EPI, voxel size 1.8mm×1.8mm×2.4mm, matrix 128×128, 57 slices, TE=94.5ms, TR=16s, motion/distortion-corrected6. Human raters marked MS lesions. To facilitate neural network training, we perform so-called feature scaling by dividing each channel by the corresponding channel mean taken over all scans. To prevent overfitting on intensity values, we also divide each scan by its mean intensity, and multiply it by a random scalar between 0.8 and 1.2 during each training epoch.
Negative-Unlabeled Learning: We examine whether it is possible to perform accurate voxel-wise MS lesion detection if only scan-wise training labels are used. We treat every voxel as a sample, with its features being the q-space measurements. We distinguish a set of negative samples (all voxels from healthy controls) and a set of unlabeled samples (patient scans consisting of lesions and healthy voxels without labels). Using such training data is called negative-unlabeled learning7, or equivalently (by renaming the classes) positive-unlabeled learning. When labeling the unlabeled set entirely as positive (hence introducing some “label-noise”) and optimizing the area under the ROC curve (AUC) is equivalent to supervised learning with AUC optimization8. Here we optimize mean-squared-error instead of AUC, which usually yields similar results. We expect a prediction around 1 for lesions and around 0.78 (due to class imbalance and neural networks averaging out label-noise) for healthy voxels.
Training: We used a convolutional network: four layers with 128, 256, 512, and 1 filter, respectively, all filter sizes 1×1×1, ReLU, mean-squared-error loss (and quality evaluation) only on segmented9,10 brain voxels. We used 60% of scans for training, 20% for early stopping, 20% for testing.
[1] Golkov et al.: “q-Space Deep Learning: Twelve-Fold Shorter and Model-Free Diffusion MRI Scans” In IEEE Transactions on Medical Imaging, volume 35, 2016
[2] Golkov et al.: “Model-Free Novelty-Based Diffusion MRI” In IEEE International Symposium on Biomedical Imaging (ISBI), 2016
[3] Golkov et al.: “q-Space Novelty Detection in Short Diffusion MRI Scans of Multiple Sclerosis” ISMRM Annual Meeting, 2018
[4] Vasilev et al.: “q-Space Novelty Detection with Variational Autoencoders”, In ArXiv preprint, 2018
[5] Golkov et al.: “q-Space Deep Learning for Alzheimer’s Disease Diagnosis: Global Prediction and Weakly Supervised Localization” ISMRM 2018
[6] Klein et al.: “elastix: A toolbox for intensity-based medical image registration” In IEEE Transactions on Medical Imaging 29(1):196–205, 2010
[7] G. Niu: “Recent Advances on Positive Unlabeled (PU) Learning” IBISML, 2017.
[8] Zhung, Lee: “Learning Classifiers without Negative Examples: A Reduction Approach” In Third International Conference on Digital Information Management, 2008
[9] P. Kalavathi: “Brain tissue segmentation in MR brain images using multiple Otsu's thresholding technique” 8th International Conference on Computer Science & Education, 2013
[10] Garyfallidis et al.: “DIPY, a library for the analysis of diffusion MRI data” Frontiers in Neuroinformatics, vol.8, no.8., 2014