1022

Segmentation of Brain Lesions Using Posterior Distributions Learned by Subspace-assisted Generative Model
Huixiang Zhuang1, Yue Guan1, Yi Ding1, Chang Xu1, Yuhao Ma1, Ziyu Meng1, Ruihao Liu1,2, Zhi-Pei Liang2,3, and Yao Li1
1School of Biomedical Engineering, Shanghai Jiao Tong University, Shanghai, China, 2Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, United States, 3Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, United States

Synopsis

Keywords: Segmentation, Segmentation, Lesion segmentation; Generative model

Motivation: Deep learning shows great potential for brain lesion segmentation but poor generalization (due to limited training data) could lead to false positives.

Goal(s): Our goal was to improve the segmentation accuracy by learning target-specific posterior distributions.

Approach: We proposed a new Bayesian brain lesion segmentation method, leveraging posterior distributions learning, including both posterior normal and lesion distributions, through a subspace-assisted deep generative model.

Results: The proposed method achieved significantly improved segmentation performance across multiple public datasets with stroke, tumor, and multiple sclerosis lesions, in comparison with the state-of-the-art methods.

Impact: The proposed method significantly improved accuracy and robustness of lesions segmentation in brain MR images, which may provide a useful tool for brain lesion delineation in image processing and clinical applications.

Introduction

Accurate segmentation of brain lesions are essential for treatment planning of brain disorders1. Unsupervised learning methods, focusing on learning normative distributions from images of healthy subjects, are less dependent on lesion-labeled data, thus exhibiting better generalization capabilities2,3. A fundamental challenge in learning normative distributions of images lies in the high dimensionality if each pixel is treated as a random variable. Recent deep-learning based methods, such as Variational Auto-Encoders (VAEs) and Generative Adversarial Nets (GANs), have generated very encouraging results2,4. In this study, we further extended those methods using a subspace-assisted deep generative model to capture both posterior normal and lesion distributions. The proposed method achieved significantly improved segmentation performance across multiple public datasets with stroke, tumor, and multiple sclerosis lesions, in comparison with the state-of-the-art methods.

Methods

The overall framework of the proposed method is shown in Fig. 1, which contains three integral components: 1) posterior normative distribution learned by an image subspace-assisted deep generative model; 2) posterior lesion distribution learned by a tissue subspace-assisted Bayesian classifier; 3) a fusion network to generate the final segmentation.

Problem formulation $$$\\$$$

Given an image with lesion $$$I_t(x)$$$, we represent it as $$$I_t(x)=I_N(x)+\Delta(x),$$$ where $$$I_N(x)$$$ represents an “hypothetical” lesion-free image and $$$\Delta(x)$$$ contains the lesion features. The following posterior distributions are estimated in our proposed method to capture the statistics of $$$I_N(x)$$$ and $$$\Delta(x)$$$ for improved segmentation: a) the position-dependent posterior normal intensity distribution, $$$p(I_N(x_i)|I_t(x))$$$ , at each pixel $$$i$$$; and b) the posterior lesion intensity distribution, $$$p(\Delta(x)|I_t(x))$$$.

Learning $$$\,p(I_N(x_i)|I_t(x))\\$$$
We proposed an image subspace-assisted conditional generative model to estimate $$$p(I_N(x_i)|I_t(x))$$$. After training, $$$p(I_N(x_i)|I_t(x))$$$ can be estimated based on the conditional samples from the learned generative model.
$$$\quad$$$In particular, we first obtained one posterior normal image $$$\hat{I}_N(x)$$$ as the condition of the generative model, leveraging a probabilistic image subspace model constrained by the pre-estimated neighboring normal pixels in $$$I_t(x)$$$ (denoted as $$$W(x)$$$). Specifically, $$$\hat{I}_N(x)$$$ was obtained by: $$\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\hat{I}_N(x)=(1-W(x))\odot\sum_{r=1}^Ra_r\phi_r(x)+W(x)\odot I_t(x), \qquad\qquad\qquad\qquad\qquad\qquad(1)$$where $$$W(x)$$$ is estimated leveraging prior intensity distributions of the normal training images $$$\{\rho_m(x)\}$$$ in a multiscale sense5, $$$\{\phi_r(x)\}$$$ is the pre-learned basis functions obtained by applying principal component analysis to $$$\{\rho_m(x)\}$$$, $$$a_r\sim p(\{a_r\})$$$ is the corresponding coefficients obtained by maximum-a-posteriori estimation: $$\qquad\qquad\qquad\qquad\qquad\qquad{\hat{a}_r}=\arg\min\limits_{\{a_r\}}\left\|W(x)\odot\left[I_t(x)-\sum_{r=1}^R a_r\phi_r(x)\right]\right\|_2-\lambda_{\text{coeff}} \log p(\{a_r\}). \qquad\qquad\qquad\qquad\qquad(2)$$$$$\quad$$$After $$$\hat{I}_N(x)$$$ was determined, we learned a conditional GAN to capture the nonlinear mapping from noise samples $$$z$$$ to a real image $$$I_N(x)$$$, conditioned on $$$\hat{I}_N(x)$$$. We adapted the Wasserstein GAN6 with improved loss function: $$L_{\text{cWGAN}}(G,D)=\mathbb{E}_{\hat{I}_N,z,I_N}\left[D(I_N)-D(G(z,\hat{I}_N))\right]+\lambda_{\text{GP}}L_{\text{GP}}(D)\\ \qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\quad+\lambda_1L_{\text{L1}}(G,\hat{I}_N)+\lambda_2L_{\text{CE}}(G,\hat{I}_N)+\lambda_3L_{\text{VGG}}(G,\hat{I}_N), \qquad\qquad\qquad\qquad\;(3)$$where $$$L_{\text{GP}}$$$, $$$L_{\text{L1}}$$$, $$$L_{\text{CE}}$$$, $$$L_{\text{VGG}}$$$ is the gradient penalty, L1-loss, pixel-wise cross entropy loss and VGG loss, respectively. The patient-specific posterior normal images with high-fidelity details could be obtained by Monte Carlo sampling conditioned on $$$\hat{I}_N(x)$$$. $$$p(I_N(x_i)|I_t(x))$$$ could then be estimated from the sampled posterior images using a Mixture of Gaussian (MOG) model.

Estimation of $$$\,p(\Delta(x)|I_t(x))\\$$$
To estimate $$$p(\Delta(x)|I_t(x))$$$, we first obtained a global normative spatial-intensity probability distribution: $$\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad p(\hat{I}_N(x_i),c(x_i);\pmb{\theta})=\sum_{k=1}^{3}p(\hat{I}_N(x_i)|c(x_i);\pmb{\theta}_k)\cdot p(c(x_i)), \qquad\qquad\qquad\qquad\qquad\quad\,(4)$$where $$$c$$$ represents the tissue class in $$$\hat{I}_N(x_i)$$$ and $$$p(\hat{I}_N(x_i)|c(x_i);\pmb{\theta}_k)$$$ is the likelihood of the observed intensity $$$p(\hat{I}_N(x_i))$$$ to be tissue class $$$k$$$. To obtain the posterior $$$p(c(x_i))$$$ of $$$p(\hat{I}_N(x_i))$$$, we incorporated a tissue subspace model into the Bayesian classifier for robust segmentation: $$\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\pi_k(c(x_i))=\sum_{r=1}^{R'}\beta_{r,k}\psi_{r,k}(c(x_i)), \qquad\qquad\qquad\qquad\qquad\qquad\qquad\qquad\quad\;(5)$$where $$$\{\psi_{r,k}(c)\}$$$ are the basis functions of tissue $$$k$$$ obtained on the tissue segmentation labels $$$\{c_1,c_2,\ldots,c_M\}$$$ of the training images. After $$$p(c(x_i))$$$ was obtained, $$$p(\hat{I}_N(x_i),c(x_i);\pmb{\theta})$$$ was fitted using the collection of pixels based on the predicted $$$c(x_i)$$$. With $$$p(\hat{I}_N(x_i),c(x_i);\pmb{\theta})$$$ estimated, the lesion class in $$$I_t(x)$$$ was obtained leveraging voxel-wise Bayesian hypothesis testing, from which $$$p(\Delta(x)|I_t(x))$$$ was estimated using an MOG model.

Fusion network$$$\\$$$

The final classification was determined by fusing the features derived from the learned posterior distributions using a patch-based convolutional neural network, including: 1) position-dependent posterior distributions; 2) multiscale region-level normal/lesion likelihood ratio; 3) multiscale posterior lesion distribution.

Results

To obtain the normal image prior, we used 20000 T1w and 20000 FLAIR brain images from Biobank dataset7. The performance was evaluated on simulation data, BraTS2019 tumor (N=288)8, MSSEG2016 MS (N=53)9 and ATLAS stroke (N=236)10 datasets, respectively. Fig. 2 demonstrates our proposed method consistently and accurately segmented lesions across various sizes, locations and intensities. As shown in Figs 3-5, our proposed method outperformed the state-of-the-art deep learning-based abnormality detection methods2,11–15 for challenging lesion/normal interface (tumor), small and isolated lesions (MS) and atrophy brains (stroke). Quantitative analysis also indicated that the proposed method had improved performance compared with the state-of-the-art methods.

Conclusions

This paper presents a new method for lesion segmentation leveraging posterior distributions learned using a subspace-assisted generative model. The proposed method showed significantly improved accuracy and robustness across multiple public brain MR image datasets with various lesion types.

Acknowledgements

This work was supported by Shanghai Pilot Program for Basic Research—Shanghai Jiao Tong University (21TQ1400203), the Program for Professor of Special Appointment (Eastern Scholar) at Shanghai Institutions of Higher Learning, Key Program of Multidisciplinary Cross Research Foundation of Shanghai Jiao Tong University (YG2021ZD28, YG2023ZD22), National Natural Science Foundation of China (62001293).

References

1. Prastawa M, Bullitt E, Ho S, Gerig G. A brain tumor segmentation framework based on outlier detection. Med Image Anal. 2004;8(3):275-283.

2. Chen X, You S, Tezcan KC, Konukoglu E. Unsupervised lesion detection via image restoration with a normative prior. Med Image Anal. 2020;64.

3. Van Leemput K, Maes F, Vandermeulen D, Colchester A, Suetens P. Automated segmentation of multiple sclerosis lesions by model outlier detection. IEEE Trans Med Imaging. 2001;20(8):677-688.

4. Baur C, Denner S, Wiestler B, Navab N, Albarqouni S. Autoencoders for unsupervised anomaly segmentation in brain MR images: a comparative study. Med Image Anal. 2021;69:101952.

5. Arbeláez P, Maire M, Fowlkes C, Malik J. Contour detection and hierarchical image segmentation. IEEE Trans Pattern Anal Mach Intell. 2011;33(5).

6. Isola P, Zhu JY, Zhou T, Efros AA. Image-to-image translation with conditional adversarial networks. In: Proceedings - 30th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017. Vol 2017-January; 2017.

7. Alfaro-Almagro F, Jenkinson M, Bangerter NK, et al. Image processing and Quality Control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage. 2018;166:400-424.

8. Menze BH, Jakab A, Bauer S, et al. The multimodal brain tumor image segmentation benchmark (BRATS). IEEE Trans Med Imaging. 2014;34(10):1993-2024.

9. Liew SL, Anglin JM, Banks NW, et al. A large, open source dataset of stroke anatomical brain images and manual lesion segmentations. Sci Data. 2018;5(1):1-11.

10. Commowick O, Kain M, Casey R, et al. Multiple sclerosis lesions segmentation from multiple experts: The MICCAI 2016 challenge dataset. Neuroimage. 2021;244.

11. You S, Tezcan KC, Chen X, Konukoglu E. Unsupervised lesion detection via image restoration with a normative prior. In: International Conference on Medical Imaging with Deep Learning. PMLR; 2019:540-556.

12. Silva-Rodríguez J, Naranjo V, Dolz J. Constrained unsupervised anomaly segmentation. Med Image Anal. 2022;80.

13. Schlegl T, Seeböck P, Waldstein SM, Langs G, Schmidt-Erfurth U. f-AnoGAN: Fast unsupervised anomaly detection with generative adversarial networks. Med Image Anal. 2019;54:30-44.

14. Wyatt J, Leach A, Schmon SM, Willcocks CG. AnoDDPM: Anomaly Detection with Denoising Diffusion Probabilistic Models using Simplex Noise. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. Vol 2022-June. IEEE Computer Society; 2022:649-655.

15. Zimmerer D, Kohl SAA, Petersen J, Isensee F, Maier-Hein KH. Context-encoding variational autoencoder for unsupervised anomaly detection. arXiv preprint arXiv:181205941. 2018.

Figures

Figure 1. Illustration of the overall pipeline of the proposed method. (a) Position-dependent posterior normative intensity distribution learning; (b) Posterior lesion intensity distribution learning; (c) Fusion network.

Figure 2. Lesion segmentation performance with different locations, sizes, intensities of simulated lesions on Biobank FLAIR dataset. The left columns are the example images of simulated lesions with intensity offset +200 or -200. The right columns are the average DICE scores for the segmentation of simulated lesions for each intensity offset value. (a) Hyperintense lesion in white matter; (b) Hyperintense lesion near periventricular zone; (c) Hypointense lesion in cortical region. Our proposed method shows consistently improved accuracy in all cases.

Figure 3. Lesion segmentation results on BraTS2019 dataset (N=288) for brain tumor. (a) Detection error maps and binarized predicted lesion masks obtained by our proposed method and the state-of-the-art methods; (b) Quantitative comparison of segmentation performance. The proposed method achieved the best overall segmentation accuracy compared to the state-of-the-art methods.

Figure 4. Lesion segmentation results on MSSEG2016 dataset (N=53) for multiple sclerosis (MS). (a) Detection error maps and binarized predicted lesion masks obtained by our proposed method and the state-of-the-art methods; (b) Quantitative comparison of segmentation performance. The proposed method achieved the best overall segmentation accuracy compared to the state-of-the-art methods.

Figure 5. Lesion segmentation results on ATLAS dataset (N=236) for stroke. (a) Detection error maps and binarized predicted lesion masks obtained by our proposed method and the state-of-the-art methods; (b) Quantitative comparison of segmentation performance. The proposed method achieved the best overall segmentation accuracy compared to the state-of-the-art methods.

Proc. Intl. Soc. Mag. Reson. Med. 32 (2024)
1022
DOI: https://doi.org/10.58530/2024/1022