Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
Diffusion semantic segmentation model: A generative model for medical image segmentation based on joint distribution
4
Zitationen
7
Autoren
2025
Jahr
Abstract
Abstract Background The mainstream semantic segmentation schemes in medical image segmentation are essentially discriminative paradigms based on conditional distributions . Although efficient and straightforward, this prevalent paradigm focuses solely on extracting image features while ignoring the underlying data distribution . Therefore, the learned feature space exhibits inherent instability, which directly affects the precision of the model in delineating anatomical boundaries. Purpose This paper reformulates the semantic segmentation task as a distribution alignment problem for medical image segmentation, aiming to minimize the gap between model predictions and ground truth labels by modeling the joint distribution of the data. Methods We propose a novel segmentation architecture based on joint distribution, called Denoising Semantic Segmentation Model (DSSM). We propose learning classification decision boundaries in pixel feature space and modeling joint distributions in latent feature space. Specifically, DSSM optimizes probability maps based on pixel feature classification through Bayesian posterior probabilities. To this end, we design a Feature Fusion Module (FFM) to guide the generative module in inference and provide label features for the semantic module. Furthermore, we introduce a stable Markov inference process to reduce inference offset. Finally, the joint distribution‐based model is end‐to‐end trained in a discriminative manner, that is, maximizing , which endows DSSM with the strengths of both generative and discriminative models. Results The image datasets utilized in this study are from different modalities, including MRI scans, x‐ray images, and skin lesion photographic images, demonstrating superior performance compared to state‐of‐the‐art (SOTA) discriminative models. Specifically, DSSM achieved a Dice coefficient of 0.8871 in MSD cardiac MRI segmentation, 0.9451 in ACDC left ventricular MRI segmentation, and 0.9647 in x‐ray image segmentation. DSSM also reached 0.8731 Dice in prostate MRI segmentation. Furthermore, in the field of skin lesion segmentation, DSSM achieved a Dice score of 0.8869 on the ISIC 2018 dataset and delivered exceptional performance with 0.9421 on the PH2 dataset. Besides the Dice score, HD95, mIoU, Precision, and Recall are evaluated across the above datasets, which further demonstrate the superior performance of DSSM. Conclusions Our methodology enables the stabilization of the learned feature space by effectively capturing the latent feature distribution information. Experimental results demonstrate that our model considerably outperforms traditional discriminative segmentation methods across a variety of datasets from multiple modalities.
Ähnliche Arbeiten
Deep Residual Learning for Image Recognition
2016 · 220.256 Zit.
U-Net: Convolutional Networks for Biomedical Image Segmentation
2015 · 88.003 Zit.
ImageNet classification with deep convolutional neural networks
2017 · 75.677 Zit.
Very Deep Convolutional Networks for Large-Scale Image Recognition
2014 · 75.505 Zit.
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
2016 · 53.720 Zit.