3867

A BRUTE FORCE APPROACH TO IMPROVE THE CLASSIFICATION ACCURACY IN RESTING STATE FMRI DATA
Debbrata Kumar Saha1, Eswar Damaraju2, Barnaly Rashid3, Anees Abrol2, Sergey Plis2, and Vince Calhoun2

1Computer Science, University of New Mexico, Albuquerque, NM, United States, 2The Mind Research Network, Albuquerque, NM, United States, 3Harvard Medical School, Boston, MA, United States

Synopsis

Currently, extensive research is ongoing to perform classification between healthy controls (HC) and patients by extracting features from resting state fMRI based dynamic connectivity states where these states are typically identified by applying different clustering algorithm. However, for classification purposes, the information captured by all dynamic states may not be significant. In this work, we propose a brute force (BF) approach where we consider a subset of these states to perform classification. Our results indicate that in most of the cases, there exists a subset of states which provides better accuracy instead of utilizing information from all of the states.

Background

There is currently a lot of research focusing on the connectivity dynamics within resting state fMRI. Typically the k-means clustering algorithm is used to find out the optimal number of connectivity states, and the information extracted from these discrete states can be utilized to perform automatic classification between HC and patients with mental disorders. One of the classification frameworks based on the dynamic connectivity features is proposed in [2], where the information from all states (where the states are identified using k-means algorithm) are used as features to perform classification. However, this approach assumes all of these dynamic states contain significant information for classification purposes. However, some states may contain more valuable features for classification. In this work, we propose a brute force (BF) approach for classification where we take all subsets of states and compute cross validated accuracy. Our results show that in most of the cases, there exists a subset of states that provides better classification accuracy instead of incorporating features from all available states.

Methods

For our experiments, we use resting state fMRI data (163 HC and 151 SZ patients) previously used in the study [1]. For each subject, we took 1081 static functional network connectivity (sFNC) measures and for dynamic FNC (dFNC), we took 136 x 1081 dFNC windows per subject. We evaluated the classification performance of sFNC and dFNC, as shown in our recent work [2]. To perform classification, we applied linear support vector machine (SVM) on 5-fold cross validations of data with 10 repetitions in all our experimental settings.

For sFNC, we trained a linear SVM classifier using the training dataset of any specific fold and then computed the accuracy using the left-out testing dataset of that fold. In the same manner the remaining 4 folds were computed. This whole procedure was repeated 10 times to compute the standard error of mean accuracy. For dFNC of any specific fold (the same fold that we used in sFNC), we ran k-means separately on different groups (HC and SZ) on training dataset for model order 2-5 in k-means. Then, we combined the cluster centroids that were extracted from different groups. When combining the centroids, we apply the BF approach to select the subset of centroids which provide the best classification accuracy. For any subset of centroids from HC and SZ, each observation (i.e., dynamic windows) was regressed onto these combined subset centroids to identify its contribution weight (beta coefficient). The detailed procedure to compute the beta coefficient is described in [2]. After completion of regression, one mean beta (computed as the mean overall observations of any individual subject) per centroid state was obtained as feature for SVM. Next, we trained a linear SVM classifier using these beta coefficients from this training fold. Beta coefficients for testing dataset were computed using the combined centroids (i.e., the centroids that we used on training dataset) and then the classification accuracy was computed for this testing dataset using the SVM classifier. Finally, the best subsets of centroids were chosen which provided the maximum classification accuracy.

Results

We demonstrate our results in a pirate plot shown in Figure 1. Each pirate plot contains 10 points where each point is the mean accuracy over five folds for any certain repetition. We ran our experiments for three model orders (3, 4 and 5) which are represented as dm3, dm4 and dm5 in Figure 1. The flat approach represents the results followed by the method mentioned in [2] and BF approach is our new proposed brute force technique. From our results, we observe that BF approach always outperforms sFNC and regular regression technique for dFNC, supporting our hypothesis that certain states are more informative than others in the prediction.

Conclusions

In this work, we have examined whether sub states information (picked by BF manner) can provide higher classification accuracy instead of taking all states into account. Our proposed approach has explored this experimental setup for model orders 3 to 5. However, further research is required to justify the validity and consistency of this new approach. We will focus on examining the consistency of classification accuracy during increasing model orders (model order 6 to 10 or more) in order to identify the most repetitive connectivity states which contribute the most in classification.

Acknowledgements

No acknowledgement found.

References

[1] Damaraju, E., Allen, E. A., Belger, A., Ford, J. M., McEwen, S., Mathalon, D. H., ... & Calhoun, V. D. (2014). Dynamic functional connectivity analysis reveals transient states of dysconnectivity in schizophrenia. NeuroImage: Clinical, 5, 298-308.

[2] Rashid, B., Arbabshirani, M.R., Damaraju, E., Cetin, M.S., Miller, R., Pearlson, G.D., Calhoun, V.D., (2016). Classification of schizophrenia and bipolar patients using static and dynamic resting-state fmri brain connectivity. NeuroImage 134, 645–657.

Figures

Figure 1: Classification accuracy of static and dynamic FNC (flat and Brute force approach); In X axis, labels dm3-dm5 indicates the model order for dFNC and Y axis indicates mean accuracy.

Proc. Intl. Soc. Mag. Reson. Med. 27 (2019)
3867