Position-aware representation learning with anatomical priors for enhanced pancreas tumor segmentation

IF 5.5 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Neurocomputing Pub Date : 2024-11-16 DOI:10.1016/j.neucom.2024.128881

Kaiqi Dong , Peijun Hu , Yu Tian , Yan Zhu , Xiang Li , Tianshu Zhou , Xueli Bai , Tingbo Liang , Jingsong Li

{"title":"Position-aware representation learning with anatomical priors for enhanced pancreas tumor segmentation","authors":"Kaiqi Dong , Peijun Hu , Yu Tian , Yan Zhu , Xiang Li , Tianshu Zhou , Xueli Bai , Tingbo Liang , Jingsong Li","doi":"10.1016/j.neucom.2024.128881","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate pancreatic tumor segmentation in CT images is crucial but challenging due to the complex anatomy and varied tumor appearance. Previous methods predominantly adopt two-stage segmentation approaches to identify and localize tumors and rely heavily on CNN-extracted texture features. In this study, we propose a tumor position-aware branch to learn pancreatic anatomical priors and integrate them into a standard 3D U-Net segmentation network. The tumor position-aware branch consists of three innovative components. Firstly, the proposed method utilizes discrete information bottleneck theory to extract compact and informative segmentation features with pancreatic anatomical priors. Secondly, we propose a coordinate position encoding transformer that encodes the spatial coordinates of each patch within the CT volume. This encoding provides the model with a global positional context, allowing it to effectively model the spatial relationships between anatomical structures. Thirdly, a probability margin regularization loss is proposed to further eliminate the interference of background patches on the learning of pancreatic anatomical positions. Our model is trained and validated our model on the public Medical Segmentation Decathlon (MSD) dataset and a private clinical dataset. Experimental results demonstrate that our approach achieves competitive performance compared to state-of-the-art (SOTA) methods in both pancreas and tumor segmentation, with Dice scores of 82.11% for the pancreas and 55.56% for the tumor on the MSD dataset. The proposed framework offers an effective solution to leverage anatomical priors and enhance representation learning for improved pancreatic tumor segmentation.</div></div>","PeriodicalId":19268,"journal":{"name":"Neurocomputing","volume":"616 ","pages":"Article 128881"},"PeriodicalIF":5.5000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurocomputing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0925231224016527","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate pancreatic tumor segmentation in CT images is crucial but challenging due to the complex anatomy and varied tumor appearance. Previous methods predominantly adopt two-stage segmentation approaches to identify and localize tumors and rely heavily on CNN-extracted texture features. In this study, we propose a tumor position-aware branch to learn pancreatic anatomical priors and integrate them into a standard 3D U-Net segmentation network. The tumor position-aware branch consists of three innovative components. Firstly, the proposed method utilizes discrete information bottleneck theory to extract compact and informative segmentation features with pancreatic anatomical priors. Secondly, we propose a coordinate position encoding transformer that encodes the spatial coordinates of each patch within the CT volume. This encoding provides the model with a global positional context, allowing it to effectively model the spatial relationships between anatomical structures. Thirdly, a probability margin regularization loss is proposed to further eliminate the interference of background patches on the learning of pancreatic anatomical positions. Our model is trained and validated our model on the public Medical Segmentation Decathlon (MSD) dataset and a private clinical dataset. Experimental results demonstrate that our approach achieves competitive performance compared to state-of-the-art (SOTA) methods in both pancreas and tumor segmentation, with Dice scores of 82.11% for the pancreas and 55.56% for the tumor on the MSD dataset. The proposed framework offers an effective solution to leverage anatomical priors and enhance representation learning for improved pancreatic tumor segmentation.

查看原文本刊更多论文

基于解剖先验的位置感知表征学习增强胰腺肿瘤分割

由于复杂的解剖结构和不同的肿瘤外观，在CT图像中准确的胰腺肿瘤分割是至关重要的，但具有挑战性。以前的方法主要采用两阶段分割方法来识别和定位肿瘤，并且严重依赖于cnn提取的纹理特征。在这项研究中，我们提出了一个肿瘤位置感知分支来学习胰腺解剖先验，并将它们整合到标准的3D U-Net分割网络中。肿瘤位置感知分支由三个创新组件组成。该方法首先利用离散信息瓶颈理论提取具有胰腺解剖先验的紧凑、信息丰富的分割特征；其次，我们提出了一种坐标位置编码转换器，对CT体内每个patch的空间坐标进行编码。这种编码为模型提供了一个全局位置上下文，使其能够有效地模拟解剖结构之间的空间关系。再次，提出一种概率边缘正则化损失，进一步消除背景斑块对胰腺解剖位置学习的干扰。我们的模型在公共医疗分割十项全能（MSD）数据集和私人临床数据集上进行了训练和验证。实验结果表明，与最先进的（SOTA）方法相比，我们的方法在胰腺和肿瘤分割方面都取得了具有竞争力的性能，在MSD数据集上，胰腺的Dice得分为82.11%，肿瘤的Dice得分为55.56%。所提出的框架提供了有效的解决方案，利用解剖先验和增强表征学习来改进胰腺肿瘤分割。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Neurocomputing 工程技术-计算机：人工智能

CiteScore

13.10

自引率

10.00%

发文量

1382

审稿时长

70 days

期刊介绍： Neurocomputing publishes articles describing recent fundamental contributions in the field of neurocomputing. Neurocomputing theory, practice and applications are the essential topics being covered.