Semi-Supervised Segmentation of Abdominal Organs and Liver Tumor: Uncertainty Rectified Curriculum Labeling Meets X-Fuse

Pengju Lyu, Wenjian Liu, Tingyi Lin, Jie Zhang, Yao Liu, Cheng Wang, Jianjun Zhu
{"title":"Semi-Supervised Segmentation of Abdominal Organs and Liver Tumor: Uncertainty Rectified Curriculum Labeling Meets X-Fuse","authors":"Pengju Lyu, Wenjian Liu, Tingyi Lin, Jie Zhang, Yao Liu, Cheng Wang, Jianjun Zhu","doi":"10.1088/2632-2153/ad4c38","DOIUrl":null,"url":null,"abstract":"\n Precise liver tumors and associated organ segmentation hold immense value for surgical and radiological intervention, enabling anatomical localization for pre-operative planning and intra-operative guidance. Modern deep learning models for medical image segmentation have evolved from convolution neural networks to transformer architectures, significantly boosting global context understanding. However, accurate delineation especially of hepatic lesions remains an enduring challenge due to models’ predominant focus solely on spatial feature extraction failing to adequately characterize complex medical anatomies. Moreover, the relative paucity of expertly annotated medical imaging data restricts model exposure to diverse pathological presentations. In this paper, we present a three-phrased cascaded segmentation framework featuring an X-Fuse model that synergistically integrates spatial and frequency domain’s complementary information in dual encoders to enrich latent feature representation. To enhance model generalizability, building upon X Fuse topology and taking advantage of additional unlabeled pathological data, our proposed integration of curriculum pseudo-labeling with Jensen-Shannon variance-based uncertainty rectification promotes optimized pseudo-supervision in the context of semi-supervised learning. We further introduce a tumor-focus augmentation technique including training-free copy-paste and knowledge-based synthesis that show efficacy in simplicity, contributing to the substantial elevation of model adaptability on diverse lesional morphologies. Extensive experiments and modular evaluations on a holdout test set demonstrate that our methods significantly outperform existing state-of-the-art segmentation models in both supervised and semi-supervised settings, as measured by the Dice similarity coefficient, achieving superior delineation of bones (95.42%), liver (96.26%), and liver tumors (89.53%) with 16.41% increase comparing to V-Net on supervised-only and augmented-absent scenario. Our method marks a significant step toward the realization of more reliable and robust AI-assisted diagnostic tools for liver tumor intervention. We have made the codes publicly available.","PeriodicalId":503691,"journal":{"name":"Machine Learning: Science and Technology","volume":"57 7","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine Learning: Science and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2632-2153/ad4c38","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Precise liver tumors and associated organ segmentation hold immense value for surgical and radiological intervention, enabling anatomical localization for pre-operative planning and intra-operative guidance. Modern deep learning models for medical image segmentation have evolved from convolution neural networks to transformer architectures, significantly boosting global context understanding. However, accurate delineation especially of hepatic lesions remains an enduring challenge due to models’ predominant focus solely on spatial feature extraction failing to adequately characterize complex medical anatomies. Moreover, the relative paucity of expertly annotated medical imaging data restricts model exposure to diverse pathological presentations. In this paper, we present a three-phrased cascaded segmentation framework featuring an X-Fuse model that synergistically integrates spatial and frequency domain’s complementary information in dual encoders to enrich latent feature representation. To enhance model generalizability, building upon X Fuse topology and taking advantage of additional unlabeled pathological data, our proposed integration of curriculum pseudo-labeling with Jensen-Shannon variance-based uncertainty rectification promotes optimized pseudo-supervision in the context of semi-supervised learning. We further introduce a tumor-focus augmentation technique including training-free copy-paste and knowledge-based synthesis that show efficacy in simplicity, contributing to the substantial elevation of model adaptability on diverse lesional morphologies. Extensive experiments and modular evaluations on a holdout test set demonstrate that our methods significantly outperform existing state-of-the-art segmentation models in both supervised and semi-supervised settings, as measured by the Dice similarity coefficient, achieving superior delineation of bones (95.42%), liver (96.26%), and liver tumors (89.53%) with 16.41% increase comparing to V-Net on supervised-only and augmented-absent scenario. Our method marks a significant step toward the realization of more reliable and robust AI-assisted diagnostic tools for liver tumor intervention. We have made the codes publicly available.
腹部器官和肝脏肿瘤的半监督分割:不确定性矫正课程标签与 X-Fuse 的结合
精确的肝脏肿瘤和相关器官分割对外科手术和放射学干预具有巨大价值,可为术前规划和术中指导提供解剖定位。用于医学影像分割的现代深度学习模型已从卷积神经网络发展到变换器架构,极大地增强了对全局上下文的理解。然而,由于模型主要侧重于空间特征提取,无法充分表征复杂的医学解剖结构,因此准确划分病变(尤其是肝脏病变)仍是一项持久的挑战。此外,专家注释的医学影像数据相对较少,限制了模型对各种病理表现的接触。在本文中,我们提出了一个以 X-Fuse 模型为特色的三词组级联分割框架,该模型在双编码器中协同整合了空间域和频率域的互补信息,从而丰富了潜在特征表征。为了增强模型的通用性,我们在 X Fuse 拓扑的基础上,利用额外的未标记病理数据,提出了课程伪标记与基于 Jensen-Shannon 方差的不确定性矫正的整合方案,在半监督学习的背景下促进了伪监督的优化。我们进一步介绍了一种肿瘤焦点增强技术,包括免训练复制粘贴和基于知识的合成,这些技术显示了简便性的功效,有助于大幅提高模型对不同病变形态的适应性。在保留测试集上进行的广泛实验和模块化评估表明,我们的方法在监督和半监督设置中都明显优于现有的一流分割模型,以 Dice 相似性系数来衡量,我们的方法在骨骼(95.42%)、肝脏(96.26%)和肝脏肿瘤(89.53%)的划分上都更胜一筹,在仅监督和无增强的情况下,与 V-Net 相比提高了 16.41%。我们的方法标志着向实现更可靠、更稳健的肝脏肿瘤人工智能辅助诊断工具迈出了重要一步。我们已公开了相关代码。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信