We propose Transformer-based Alzheimer’s disease (AD) analyzer for 3D MRI. The proposed network can analyze 3D MRI images efficiently combining 3D CNN, and Transformer. It is possible to efficiently extract locality information for AD-related abnormalities in local brain based on CNN networks with inductive bias. Also, the transformer network is also used to obtain attention relationship among 3D representation features after CNN. Our proposed method was compared to various networks including 3D CNN and transformer with an area under curve and accuracy for AD classification in multi-institutional datasets. Also, the transformer interpretability technique-based activation map can visualize AD-related abnormality region.
[1] Kenton, J. D. M. W. C. et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of NAACL-HLT, 4171-4186 (2019)
[2] Dosovitskiy, A. et al. An image is worth 16x16 words: Transformers for image recognition at scale. In International Conference on Learning Representations, (2020)
[3] Liu, Ze, et al. "Swin transformer: Hierarchical vision transformer using shifted windows." In Proceedings of ICCV (2021). [4] Chen, J. at el. ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration. In MIDL (2021)
[5] Matsoukas, Christos, et al. "Is it Time to Replace CNNs with Transformers for Medical Images?." In Proceedings of ICCV, (2021).
[6] Tondelli, Manuela, et al. "Structural MRI changes detectable up to ten years before clinical Alzheimer's disease." Neurobiology of aging 33.4 (2012): 825-e25.
[7] Vemuri, P., & Jack, C. R. (2010). Role of structural MRI in Alzheimer's disease. Alzheimer's research & therapy, 2(4), 1-10.
[8] E Petersen, R. C., Aisen, P. S., Beckett, L. A., Donohue, M. C., Gamst, A. C., Harvey, D. J., ... & Weiner, M. W. (2010). Alzheimer's disease neuroimaging initiative (ADNI): clinical characterization. Neurology, 74(3), 201-209.
[9] Ellis, K. A., Rowe, C. C., Villemagne, V. L., Martins, R. N., Masters, C. L., Salvado, O., ... & AIBL Research Group. (2010). Addressing population aging and Alzheimer's disease through the Australian Imaging Biomarkers and Lifestyle study: Collaboration with the Alzheimer's Disease Neuroimaging Initiative. Alzheimer's & dementia, 6(3), 291-296.
[10] LaMontagne, P. J., Benzinger, T. L., Morris, J. C., Keefe, S., Hornbeck, R., Xiong, C., ... & Marcus, D. (2019). OASIS-3: longitudinal neuroimaging, clinical, and cognitive dataset for normal aging and Alzheimer disease. MedRxiv.
[11] G. Huang, Z. Liu, L. Van Der Maaten and K. Q. Weinberger, "Densely Connected Convolutional Networks," 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017, pp. 2261-2269, doi: 10.1109/CVPR.2017.243.
[12] Chen, Ting, et al. "A simple framework for contrastive learning of visual representations." International conference on machine learning. PMLR, 2020.
[13] He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 770-778).
[14] Fulton, L. V., Dolezel, D., Harrop, J., Yan, Y., & Fulton, C. P. (2019). Classification of Alzheimer’s disease with and without imagery using gradient boosted machines and ResNet-50. Brain sciences, 9(9), 212.
[15] Ebrahimi, A., Luo, S., & Chiong, R. (2020, November). Introducing Transfer Leaming to 3D ResNet-18 for Alzheimer’s Disease Detection on MRI Images. In 2020 35th International Conference on Image and Vision Computing New Zealand (IVCNZ) (pp. 1-6). IEEE.
[16] Carreira, J., & Zisserman, A. (2017). Quo vadis, action recognition? a new model and the kinetics dataset. In proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 6299-6308).
[17] Bien, N., Rajpurkar, P., Ball, R. L., Irvin, J., Park, A., Jones, E., ... & Lungren, M. P. (2018). Deep-learning-assisted diagnosis for knee magnetic resonance imaging: development and retrospective validation of MRNet. PLoS medicine, 15(11), e1002699.
[18] Chen, S., Ma, K., & Zheng, Y. (2019). Med3d: Transfer learning for 3d medical image analysis. arXiv preprint arXiv:1904.00625.
[19] Chefer, H., Gur, S., & Wolf, L. (2021). Transformer interpretability beyond attention visualization. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 782-791).