5000

ASN: Adaptive Segmentation Network for Visual Pathway Identification in Multi-parametric MR Images

Alou Diakite^1,2, Cheng Li¹, Lei Xie³, Yuanjing Feng³, Hua Han¹, Hairong Zheng¹, and Shanshan Wang^1,4,5
¹Paul C Lauterbur Research Center for Biomedical Imaging, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China, ²University of Chinese Academy of Science, Beijing, China, ³Zhejiang University of Technology, Hangzhou, China, ⁴Guangdong Provincial Key Laboratory of Artificial Intelligence in Medical Image Analysis and Application, Guangzhou, China, ⁵Peng Cheng Laboratory, Shenzhen, China

Synopsis

Keywords: Machine Learning/Artificial Intelligence, Visualization, Adaptive convolution, visual pathway segmentation, deep learning

Motivation: Accurate visual pathway (VP) segmentation is critical for clinical diagnosis and surgical planning. Current deep learning-based methods struggle to capture significant context information, impacting the segmentation precision.

Goal(s): Improve multi-parametric MRI-based VP segmentation by designing an Adaptive Segmentation Network (ASN).

Approach: ASN uses adaptive convolution (AC) to dynamically adjust the kernel based on complementary context, facilitating the integration of contextual information. A spatial attention block selectively extracts relevant regions‘ features in each MRI sequence and fuses them.

Results: ASN's effectiveness is validated by segmenting the VP in MR images from two MRI sequences. It surpasses state-of-the-art techniques in VP segmentation.

Impact: The introduction of ASN, a new multi-parametric MR images segmentation approach, demonstrates superior performance in visual pathway (VP) segmentation in MR images, surpassing existing state-of-the-art techniques. This novel method effectively incorporates context information, leading to improved segmentation performance.

Introduction

Multi-parametric magnetic resonance imaging (MRI) plays a crucial role in the diagnosis and treatment of various diseases affecting the visual system, such as optic neuritis and optic nerve glioma ^{1, 2}. By combining different MRI sequences, such as T1-weighted and fractional anisotropy (FA), a more comprehensive understanding of the visual pathway can be obtained compared to isolated modalities. For instance, T1-weighted imaging is useful in detecting the anatomy situation, while FA can provide information about the integrity of white matter tracts. Accurate segmentation of visual pathway (VP) in multi-parametric MRI is essential for diagnosis and surgical planning, but manual segmentation is time-consuming and prone to errors. Therefore, the development of automatic and reliable algorithms for multi-parametric MRI-based VP segmentation is highly valuable for clinical practice.

During the past decade, many multi-parametric MRI segmentation methods have been developed ^1-4. These methods can be mainly divided into three categories: early fusion, late fusion, and intermediate layer fusion.

Although these methods have achieved good segmentation results, there still remain some challenges: (i) Current DL-based multi-parametric MRI segmentation methods rely on convolution operations from source images, but they struggle to model long-range context dependencies. However, neuroscience research suggests that the global context capability of neurons is crucial for effectively processing complex perceptual issues ^5,
6. (ii) Current DL-based multi-parametric MRI segmentation methods still have limitations in modeling the complex inter-sequence relationships.

To overcome the above challenges, we propose an Adaptive Segmentation Network (ASN), which uses adaptive convolution (AC) to adjust the convolutional kernel based on complementary context. This allows for effective incorporation of global context information and improves the segmentation performance. Additionally, we introduce a spatial attention block to selectively extract features from relevant regions and fuse them. Experimental results on the human connectome project (HCP) dataset validate the superior performance of ASN compared to state-of-the-art techniques.

Methodology

As per neuroscience studies on the importance of global context information in perceptual tasks, we propose a supervised multi-parametric MRI segmentation approach called Adaptive Segmentation Network (ASN) for segmenting multi-parametric MRI images (Figure 1). ASN consists of two key components: AC and spatial attention.

Inspired by ^7,
8, we use a U-like encoder-decoder structure with standard convolutions being replaced by adaptive convolutions. Specifically, AC modifies the weights of convolutional layers in CNNs based on global context information. By adaptively modulating the convolution kernels, AC enables the extraction of representative local patterns and the composition of discriminative features guided by the global context, leading to improved segmentation performance.

The spatial attention block enables the model to selectively attend to relevant regions or features in each MRI sequence and fuse them. The spatial attention mechanism is implemented by using the adaptive convolution layers in place of standard convolution layers to learn a set of spatial attention maps, which are then multiplied with the feature maps of each imaging sequence to emphasize the most informative regions.

Results and Analysis

In our study, we evaluated the performance of our proposed Adaptive Segmentation Network (ASN) on the HCP dataset (training set: 82, testing set: 10) and compared it with existing state-of-the-art techniques.

As shown in Table 1, ASN achieved a Dice similarity coefficient (DSC) of 87.7%, a Hausdorff distance (HD) of 2.13 mm, and an average symmetric surface distance (ASD) of 0.13 mm, surpassing state-of-the-art techniques like FuseNet ² and TPSN ¹. This demonstrates the accurate VP region segmentation capability of ASN. The higher DSC score indicates effective capture of true positives and reduction of false positives. The lower HD and ASD value suggests precise delineation of VP boundaries. Besides, the qualitative results illustrated in Figure 2 confirmed the quantitative results.

The superior performance of ASN can be attributed to its utilization of AC and spatial attention mechanisms. These results underscore the potential of ASN in improving clinical diagnosis and surgical planning. Accurate VP segmentation aids in the identification of diseases affecting the VP, enabling targeted treatment and intervention.

Conclusion

In summary, the proposed Adaptive Segmentation Network (ASN) demonstrated superior performance in segmenting VP in multi-parametric MRI images compared to state-of-the-art techniques. The incorporation of adaptive convolution and spatial attention mechanisms allowed to effectively capture global context information and complex inter-sequence relationships. This improved the segmentation accuracy, as evidenced by high DSC, low HD, and low ASD.

ASN has potential applications in clinical diagnosis and surgical planning by aiding in the identification of abnormalities or diseases affecting the visual pathway. Further research can explore the use of ASN in other medical imaging tasks and its generalizability to different datasets.

Acknowledgements

This research was partly supported by the National Natural Science Foundation of China (62222118, U22A2040), Guangdong Provincial Key Laboratory of Artificial Intelligence in Medical Image Analysis and Application (2022B1212010011), Shenzhen Science and Technology Program (RCYX20210706092104034, JCYJ20220531100213029), and Key Laboratory for Magnetic Resonance and Multimodality Imaging of Guangdong Province (2023B1212060052).

References

1. S. Li, Z. Chen, W. Guo, Q. Zeng, and Y. Feng, “Two parallel stages deep learning network for anterior visual pathway segmentation,” in Computational Diffusion MRI: International MICCAI Workshop, Lima, Peru, October 2020. Springer, 2021, pp. 279–290.

2. L. Xie, L. Yang, Q. Zeng, J. He, J. Huang, Y. Feng, E. Amelina, and M. Amelin, “Deep Multimodal Fusion Network for the Retinogeniculate Visual Pathway Segmentation,” in The 42nd Chinese Control Conference (CCC 2023), 2023.

3. Cheng Li, Hui Sun, Zaiyi Liu, Meiyun Wang, Hairong Zheng, and Shanshan Wang, “Learning cross-modal deep representations for multi-modal MR image segmentation,” in Medical Image Computing and Computer Assisted Intervention–MICCAI 2019: 22nd International Conference, Shenzhen, China, October 13–17, 2019, Proceedings, Part II 22. 2019, pp. 57–65, Springer.

4. Yang, Q., Guo, X., Chen, Z., Woo, P.Y. and Yuan, Y., 2022. D 2-Net: Dual disentanglement network for brain tumor segmentation with missing modalities. IEEE Transactions on Medical Imaging, 41(10), pp.2953-2964.

5. Li, W., Piëch, V. and Gilbert, C.D., 2004. Perceptual learning and top-down influences in primary visual cortex. Nature neuroscience, 7(6), pp.651-657.

6. Gilbert, C.D. and Li, W., 2013. Top-down influences on visual processing. Nature Reviews Neuroscience, 14(5), pp.350-363.

7. Lin, X., Ma, L., Liu, W. and Chang, S.F., 2020. Context-gated convolution. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XVIII 16 (pp. 701-718). Springer International Publishing.

8. Tang, W., He, F., Liu, Y. and Duan, Y., 2022. MATR: Multimodal medical image fusion via multiscale adaptive transformer. IEEE Transactions on Image Processing, 31, pp.5134-5149.

Figures

The overall pipeline of ASN with each vanilla convolution layer replacing the adaptive convolution layer. a The detailed structure of ASN’s encoder, b detailed architecture of different blocks in ASN’s encoder and c the structure of the adaptive convolution.

Qualitative comparison of VP segmentation results produced by different methods on the HCP dataset.

Quantitative results of different methods on the HCP dataset.

Proc. Intl. Soc. Mag. Reson. Med. 32 (2024)

5000

DOI: https://doi.org/10.58530/2024/5000