2822

Deep Neural Network based Feature Selection in rs-fMRI Brain Functional Connectivity
Gengyan Zhao1, Gyujoon Hwang1, Cole Cook1, Fang Liu1, Mary Meyerand1, and Rasmus Birn1

1University of Wisconsin - Madison, Madison, WI, United States

Synopsis

Deep neural networks (DNN) have been successfully applied to various prediction tasks in rs-fMRI, but the feature selection mechanism of it often appear to be a black box. We developed understanding of DNN’s prediction mechanism and proposed a feature selection method based on each feature’s contribution to the prediction. Experiments were done on the functional connectivity (FC) gender prediction to extract gender related brain FC patterns with 1003 subjects’ rs-fMRI data. The proposed method was validated by the cross-entropy loss of each feature’s prediction, and results showed the selected features are robust and consistent with the findings in previous studies.

Introduction:

Deep learning based neural networks have been successfully applied to various prediction tasks in resting-state fMRI (rs-fMRI) brain functional connectivity (FC) and outperform conventional machine learning models1. However, in many circumstances knowing which features matter most in a prediction is as important as reaching a high accuracy, but the feature selection in deep neural network (DNN) often remains a black box. In this study, we developed understanding of the DNN prediction mechanism and proposed a feature selection and ranking method based on each feature’s contribution to the prediction. Experiments were done on the rs-fMRI FC gender prediction to extract gender related brain FC patterns2, which can be used as a testbed to study the method’s performance and characteristics for it being applied to other feature selection applications.

Methods:

In Fig. 1, the weight vectors between the input layer and the 1st hidden layer of DNN can be considered as convolutional kernels. These kernels are trained to extract the FC patterns which are most helpful on classifying different classes from the input connectivity matrices. Between the 1st and 2nd hidden layers the weights are used to look for a certain combination of the patterns for the later activations. The absolute value of the weight determines the importance of the corresponding pattern, and the sign of the weight means whether the pattern is preferred to appear in the combination or not. When it comes to the higher hidden layers, the higher level of combinations of the patterns are evaluated for the classification task. The key part of the DNN structure for extracting and ranking the features for each class in the proposed method is between the last hidden layer and the softmax layer. In the application of gender classification, there are two classes, male and female. Each activation, in the last hidden layer will be multiplied by the weights wM and wF respectively towards the male and female neurons. The two scores compete with each other and the input will be classified into the class with the larger score. The sign of the difference wM - wF determines the class the high-level FC pattern belongs to, while the absolute value of the difference determines how important it is for the DNN classification. Therefore, the absolute value of the difference can be sorted to rank the contribution of the high-level features for both classes. Since a high-level feature can be treated as the combination of lower-level features, they can be mapped back to the FC patterns at the original input level1. Experiments were done on the rs-fMRI data of 1003 healthy adults in the Human Connectome Project S1200 release3 in the framework of extracting features from predictive models with high accuracies. To test the robustness of the models, all the experiments were done with 50 randomly permuted 2-fold cross validations.

Results:

Fig. 2 shows that DNN reaches 83.0%, 87.6%, 92.0%, 93.5% and 94.1% accuracies respectively with the FC input derived from 25, 50, 100, 200, 300 independent component analysis (ICA) components. DNN outperforms the linear support vector machine (SVM) (Bonferroni corrected p-values<0.001) when smaller ICA component numbers are used. The proposed feature ranking method was validated with the cross-entropy loss on the training dataset. Fig. 3 shows that the more highly ranked feature pair has a lower cross-entropy loss, which means it is better at making the predicted label distribution similar to the ground truth distribution. The robustness and repeatability of the features extracted was also studied. Fig. 4 shows as the neural network becomes deeper, the repeatability of the most important high-level feature becomes higher. The most important male and female features extracted by DNN are shown in Fig. 5. For both 25 and 50 ICA components, females have stronger DM-DM, CC-VIS connections, relatively weaker CC-DM connections, and both stronger and weaker connections in CC-CC (as indicated by the black boxes in Fig. 5). These findings are consistent with previous studies2,4.

Conclusion:

In this study, we proposed a feature selection method for DNN according to each feature’s contribution to the final prediction, which provides a new understanding of the DNN model and makes DNN no longer a black box. Experiments on FC gender feature extraction serve as the basic framework to verify and study the performance and characteristics of DNN feature selection for its further applications in other fields of MRI/fMRI based predictions.

Acknowledgements

We gratefully acknowledge the Human Connectome Project for the data collection, data processing and data sharing. We also acknowledge the support of NVIDIA Corporation with the donation of the Titan Xp GPU used for this research. This work was supported by grants from the National Institutes of Health: U01-NS093650. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

References

1. Kim, J., Calhoun, V. D., Shim, E. & Lee, J.-H. Deep neural network with weight sparsity control and pre-training extracts hierarchical features and enhances classification performance: Evidence from whole-brain resting-state functional connectivity patterns of schizophrenia. NeuroImage 124, Part A, 127–146 (2016).

2. Zhang Chao, Dougherty Chase C., Baum Stefi A., White Tonya & Michael Andrew M. Functional connectivity predicts gender: Evidence for gender differences in resting brain connectivity. Hum. Brain Mapp. 39, 1765–1776 (2018).

3. Smith, S. et al. Resting-state fMRI in the Human Connectome Project. NeuroImage 80, 144–168 (2013).

4. Smith, S. M. et al. Functional connectomics from resting-state fMRI. Trends Cogn. Sci. 17, 666–682 (2013).

Figures

Fig. 1. Illustration of the structure of a typical DNN with 3 hidden layers for classifing 2 classes.

Fig. 2. The mean and standard deviation of the prediction accuracies across the 50 cross validation permutations for each kind of predictive model and each kind of input. L – number of hidden layers in the DNN model; N – number of neurons in the first hidden layer of the DNN, the number of neurons in each of the rest hidden layers is half of this number.

Fig. 3. The cross entropy loss achieved by a single high-level male and female feature pair of different importance on the training dataset . ‘1’, ‘2’, ‘3’, ‘4’, ‘5’ are the importance levels of the feature pairs, which mean the most, 2nd, 3rd, 4th and 5th highly ranked feature pair in the last hidden layer. In this figure and all the figures having boxplots, the points with red ‘+’ symbols are drawn as outliers, if they are greater than q3+1.5(q3-q1) or less than q1-1.5(q3-q1), where q1 and q3 are the first and third quartiles respectively.

Fig. 4. Correlations of the most important high-level features across all the 50 randomly permuted cross validations for each neural network structure, number of ICA component and gender. For each number of neuron, hidden layer and ICA component, male and female features in the highest level hidden layer were extracted and ranked with the proposed method. The correlations of the highliest ranked features were calculated across the 50 randomly permuted 2-fold cross valadtions (each boxplot shows the correlations between 100 extracted features).

Fig. 5. The most important high-level brain FC feature pairs extracted by DNN. (A) DNN: 200-neuron 2-hidden-layer network; input: 25-component ICA connectivity matrices. (B) DNN: 200-neuron 3-hidden-layer network; input: 50-component ICA connectivity matrices. Each group mean of the input is also shown for comparison. SC, subcortical; CC, cognitive control; DM, default-mode; CB, cerebellar; VIS, visual; AUD, auditory; SM, somatomotor.

Proc. Intl. Soc. Mag. Reson. Med. 27 (2019)
2822