DMSPS：用于涂鸦监督医学图像分割的动态混合软伪标签监督

IF 10.7 1区医学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Medical image analysis Pub Date : 2024-07-15 DOI:10.1016/j.media.2024.103274

Meng Han , Xiangde Luo , Xiangjiang Xie , Wenjun Liao , Shichuan Zhang , Tao Song , Guotai Wang , Shaoting Zhang

{"title":"DMSPS：用于涂鸦监督医学图像分割的动态混合软伪标签监督","authors":"Meng Han , Xiangde Luo , Xiangjiang Xie , Wenjun Liao , Shichuan Zhang , Tao Song , Guotai Wang , Shaoting Zhang","doi":"10.1016/j.media.2024.103274","DOIUrl":null,"url":null,"abstract":"<div><p>High performance of deep learning on medical image segmentation rely on large-scale pixel-level dense annotations, which poses a substantial burden on medical experts due to the laborious and time-consuming annotation process, particularly for 3D images. To reduce the labeling cost as well as maintain relatively satisfactory segmentation performance, weakly-supervised learning with sparse labels has attained increasing attentions. In this work, we present a scribble-based framework for medical image segmentation, called Dynamically Mixed Soft Pseudo-label Supervision (DMSPS). Concretely, we extend a backbone with an auxiliary decoder to form a dual-branch network to enhance the feature capture capability of the shared encoder. Considering that most pixels do not have labels and hard pseudo-labels tend to be over-confident to result in poor segmentation, we propose to use soft pseudo-labels generated by dynamically mixing the decoders’ predictions as auxiliary supervision. To further enhance the model’s performance, we adopt a two-stage approach where the sparse scribbles are expanded based on predictions with low uncertainties from the first-stage model, leading to more annotated pixels to train the second-stage model. Experiments on ACDC dataset for cardiac structure segmentation, WORD dataset for 3D abdominal organ segmentation and BraTS2020 dataset for 3D brain tumor segmentation showed that: (1) compared with the baseline, our method improved the average <span><math><mrow><mi>D</mi><mi>S</mi><mi>C</mi></mrow></math></span> from 50.46% to 89.51%, from 75.46% to 87.56% and from 52.61% to 76.53% on the three datasets, respectively; (2) DMSPS achieved better performance than five state-of-the-art scribble-supervised segmentation methods, and is generalizable to different segmentation backbones. The code is available online at: <span><span>https://github.com/HiLab-git/DMSPS</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"97 ","pages":"Article 103274"},"PeriodicalIF":10.7000,"publicationDate":"2024-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DMSPS: Dynamically mixed soft pseudo-label supervision for scribble-supervised medical image segmentation\",\"authors\":\"Meng Han , Xiangde Luo , Xiangjiang Xie , Wenjun Liao , Shichuan Zhang , Tao Song , Guotai Wang , Shaoting Zhang\",\"doi\":\"10.1016/j.media.2024.103274\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>High performance of deep learning on medical image segmentation rely on large-scale pixel-level dense annotations, which poses a substantial burden on medical experts due to the laborious and time-consuming annotation process, particularly for 3D images. To reduce the labeling cost as well as maintain relatively satisfactory segmentation performance, weakly-supervised learning with sparse labels has attained increasing attentions. In this work, we present a scribble-based framework for medical image segmentation, called Dynamically Mixed Soft Pseudo-label Supervision (DMSPS). Concretely, we extend a backbone with an auxiliary decoder to form a dual-branch network to enhance the feature capture capability of the shared encoder. Considering that most pixels do not have labels and hard pseudo-labels tend to be over-confident to result in poor segmentation, we propose to use soft pseudo-labels generated by dynamically mixing the decoders’ predictions as auxiliary supervision. To further enhance the model’s performance, we adopt a two-stage approach where the sparse scribbles are expanded based on predictions with low uncertainties from the first-stage model, leading to more annotated pixels to train the second-stage model. Experiments on ACDC dataset for cardiac structure segmentation, WORD dataset for 3D abdominal organ segmentation and BraTS2020 dataset for 3D brain tumor segmentation showed that: (1) compared with the baseline, our method improved the average <span><math><mrow><mi>D</mi><mi>S</mi><mi>C</mi></mrow></math></span> from 50.46% to 89.51%, from 75.46% to 87.56% and from 52.61% to 76.53% on the three datasets, respectively; (2) DMSPS achieved better performance than five state-of-the-art scribble-supervised segmentation methods, and is generalizable to different segmentation backbones. The code is available online at: <span><span>https://github.com/HiLab-git/DMSPS</span><svg><path></path></svg></span>.</p></div>\",\"PeriodicalId\":18328,\"journal\":{\"name\":\"Medical image analysis\",\"volume\":\"97 \",\"pages\":\"Article 103274\"},\"PeriodicalIF\":10.7000,\"publicationDate\":\"2024-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical image analysis\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1361841524001993\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841524001993","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

深度学习在医学影像分割上的高性能依赖于大规模像素级的密集标注，由于标注过程费时费力，这给医学专家带来了很大的负担，尤其是三维图像。为了降低标注成本并保持相对令人满意的分割性能，使用稀疏标签的弱监督学习受到越来越多的关注。在这项工作中，我们提出了一种基于涂鸦的医学图像分割框架，称为动态混合软伪标签监督（DMSPS）。具体来说，我们用一个辅助解码器扩展一个骨干网络，形成一个双分支网络，以增强共享编码器的特征捕捉能力。考虑到大多数像素没有标签，而硬伪标签往往过于自信，导致分割效果不佳，我们建议使用动态混合解码器预测生成的软伪标签作为辅助监督。为了进一步提高模型的性能，我们采用了两阶段方法，即根据第一阶段模型的低不确定性预测来扩展稀疏涂鸦，从而获得更多注释像素来训练第二阶段模型。在 ACDC 数据集（用于心脏结构分割）、WORD 数据集（用于三维腹部器官分割）和 BraTS2020 数据集（用于三维脑肿瘤分割）上进行的实验表明(1) 与基线相比，我们的方法在这三个数据集上的平均 DSC 分别从 50.46% 提高到 89.51%，从 75.46% 提高到 87.56%，从 52.61% 提高到 76.53%；(2) DMSPS 比五种最先进的涂鸦监督分割方法取得了更好的性能，并且可用于不同的分割骨干。代码可在线获取：https://github.com/HiLab-git/DMSPS。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

DMSPS: Dynamically mixed soft pseudo-label supervision for scribble-supervised medical image segmentation

High performance of deep learning on medical image segmentation rely on large-scale pixel-level dense annotations, which poses a substantial burden on medical experts due to the laborious and time-consuming annotation process, particularly for 3D images. To reduce the labeling cost as well as maintain relatively satisfactory segmentation performance, weakly-supervised learning with sparse labels has attained increasing attentions. In this work, we present a scribble-based framework for medical image segmentation, called Dynamically Mixed Soft Pseudo-label Supervision (DMSPS). Concretely, we extend a backbone with an auxiliary decoder to form a dual-branch network to enhance the feature capture capability of the shared encoder. Considering that most pixels do not have labels and hard pseudo-labels tend to be over-confident to result in poor segmentation, we propose to use soft pseudo-labels generated by dynamically mixing the decoders’ predictions as auxiliary supervision. To further enhance the model’s performance, we adopt a two-stage approach where the sparse scribbles are expanded based on predictions with low uncertainties from the first-stage model, leading to more annotated pixels to train the second-stage model. Experiments on ACDC dataset for cardiac structure segmentation, WORD dataset for 3D abdominal organ segmentation and BraTS2020 dataset for 3D brain tumor segmentation showed that: (1) compared with the baseline, our method improved the average $D S C$ from 50.46% to 89.51%, from 75.46% to 87.56% and from 52.61% to 76.53% on the three datasets, respectively; (2) DMSPS achieved better performance than five state-of-the-art scribble-supervised segmentation methods, and is generalizable to different segmentation backbones. The code is available online at: https://github.com/HiLab-git/DMSPS.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Medical image analysis 工程技术-工程：生物医学

CiteScore

22.10

自引率

6.40%

发文量

309

审稿时长

6.6 months

期刊介绍： Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.