We introduce a deep learning approach to derive functional fingerprint of the brain that can identify individuals. By investigating the features extracted by our model, we noticed that they mostly resemble the existing resting state networks, and three networks (default mode, attention, and frontopariental control networks) contribute the most to individual discriminability.
FMRI data of 100 subjects from human connectome project (HCP) was used in this work (age: 22-36+, gender: 46 male/ 54 female, TR=0.72 second). Each subject has 4 different scans with a total duration of 57.6 minutes (14.4 minutes x 4). We used the data that was denoised with ICA-FIX and registered to group level cortical surface template with MSM-All from HCP preprocessing pipeline3. To reduce computational complexity, signals from 236 regions of interest (ROIs)4 were extracted by averaging time courses of voxels within a 5 mm radius. Data from 3 scans (REST1_LR, REST1_RL and REST2_RL) was adopted as training dataset, and data from REST2_LR was used as testing dataset. The entire scan was cropped into 100-time-frame periods with a step size of 8. Lastly, these short periods were demeaned and normalized within themselves.
A recurrent neural network was applied to capture the sequential information in fMRI data. Our deep learning model architecture has three layers as shown in Figure 1. The first layer is an average pooling layer with a length of 4 on temporal axis to shorten the number of time steps for the following recurrent layer and thus to reduce computation complexity. A 25% dropout was applied on the input before the pooling layer to alleviate overfitting. The second layer is a recurrent layer with 512 gated recurrent units (GRUs)5 as represented by the following equations:
$$z_t=\sigma(W_z\cdot[h_{t-1},x_t])$$
$$r_t=\sigma(W_r\cdot[h_{t-1},x_t])$$
$$\tilde{h}_t=tanh(W_h\cdot[r_th_{t-1},x_t])$$
$$h_t=(1-z_t)h_{t-1}+z_t\tilde{h}_t)$$
where at time t, xt is the input of GRU; ht is the output of GRU; zt is the update gate and rt is the reset gate. Note that 25% dropout was also applied to both input connections (from xt) and recurrent connections (from ht-1) to reduce overfitting. The top layer is a softmax layer with 100 outputs to predict the individual identity.
Plenty of efforts have been devoted to reducing individual differences, such as developing brain template and registration methods8, so that group level analysis can be conducted. However, we have shown that individual variability still exists in how the brain functions over time. Given only 72 seconds of fMRI data, we were able to predict individual identity with 97%+ accuracy, which was much higher than the approach that used only FC. This informs us that at a smaller temporal scale, our model can capture more individual variability than traditional FC. In order to test whether this additional discriminability is from neuronal sources or residual physiological signals after the preprocessing steps, we further investigated the output of GRUs and found that most of these GRUs were sensitive to RSNs defined in literature. We also noticed that the same RSN appeared multiple times with slight differences as shown in Figure 4. Some of these duplications are due to the dropout we applied in the model. With dropout layers, the model would keep a certain level of redundancy in order to achieve robust prediction. Within all these similar activation patterns, 3 RSNs (default mode, attention, and frontopariental control networks) have the most varieties, which indicates these 3 RSNs contribute the most to individual subject discriminability.
In the future, we believe individual identification can be used as a pre-training step of deep learning approaches, and the resultant model can be further fine-tuned to accomplish classification of brain diseases.
1. Finn, E. S. et al. Functional connectome fingerprinting: identifying individuals using patterns of brain connectivity. Nature neuroscience (2015).
2. Chen, B. et al. Individual Variability and Test-Retest Reliability Revealed by Ten Repeated Resting-State Brain Scans over One Month. PloS one 10, e0144963 (2015).
3. Glasser, M. F. et al. The minimal preprocessing pipelines for the Human Connectome Project. Neuroimage 80, 105-124 (2013).
4. Power, J. D. et al. Functional network organization of the human brain. Neuron 72, 665-678 (2011).
5. Cho, K. et al. Learning phrase representations using RNN encoder-decoder for statistical machine translation. arXiv preprint arXiv:1406.1078 (2014).
6. Yeo, B. T. et al. The organization of the human cerebral cortex estimated by intrinsic functional connectivity. Journal of neurophysiology 106, 1125-1165 (2011).
7. Lee, M. H., Smyser, C. D. & Shimony, J. S. Resting-state fMRI: a review of methods and clinical applications. American Journal of Neuroradiology 34, 1866-1872 (2013).
8. Robinson, E. C. et al. MSM: A new flexible framework for Multimodal Surface Matching. Neuroimage 100, 414-426 (2014).