Step edge and edge loss enhanced Transformer for medical image segmentation

IF 1.7 4区 综合性期刊 Q2 MULTIDISCIPLINARY SCIENCES
Juan Zheng , Mei Tian , Meng Zhou , Jing Cai , Chanzi Liu , Tao Lin , Haibo Si
{"title":"Step edge and edge loss enhanced Transformer for medical image segmentation","authors":"Juan Zheng ,&nbsp;Mei Tian ,&nbsp;Meng Zhou ,&nbsp;Jing Cai ,&nbsp;Chanzi Liu ,&nbsp;Tao Lin ,&nbsp;Haibo Si","doi":"10.1016/j.jrras.2025.101667","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>Knee osteoarthritis (KOA) is the most common degenerative joint disease, significantly impacting patients' quality of life. Currently, KOA assessments using ultrasound images rely on complex, time-consuming manual measurements. Developing automatic, accurate medical image segmentation methods for clinical diagnosis is crucial. However, due to challenges like blurred boundaries and irregular shapes in medical images, existing methods show poor edge accuracy and morphological differences.</div></div><div><h3>Methods</h3><div>To address these issues, we propose IELSEMT model. The model proposes a step edge module, incorporating it into both global and local branches during training. Additionally, after integrating the step edge module, the network's edge feature extraction capability is enhanced. During the network's backpropagation phase, we propose a weighted loss function that incorporates edge loss terms, specifically using the weighted sum of cross-entropy and mean absolute error as the network's loss function, reducing edge region errors in network segmentation results.</div></div><div><h3>Results</h3><div>Compared to the convolutional baseline, the IELSEMT model improves F1_Score by approximately 5 % and IoU by 15 %. Compared to the Med baseline model, IoU and F1 Score are about 4 % higher. Additionally, by incorporating edge loss terms during the backpropagation phase, the network training speed is enhanced. The IELSEMT model can quickly locate segmentation object edges and trains faster, achieving a convergence speed twice that of the MedT model.</div></div><div><h3>Conclusion</h3><div>The IELSEMT model proposed in this paper incorporates full-scale and edge detection concepts, introducing edge information into the decoding phase to achieve more accurate edge information acquisition, improving segmentation accuracy with clearer and more precise edges. Unlike other Transformer-based models, our proposed method does not require pre-training on large-scale datasets. Finally, we conducted experiments on knee joint ultrasound datasets, where IELSEMT demonstrated superior performance over both CNN and other related Transformer-based architectures.</div></div>","PeriodicalId":16920,"journal":{"name":"Journal of Radiation Research and Applied Sciences","volume":"18 3","pages":"Article 101667"},"PeriodicalIF":1.7000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Radiation Research and Applied Sciences","FirstCategoryId":"103","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1687850725003796","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Objective

Knee osteoarthritis (KOA) is the most common degenerative joint disease, significantly impacting patients' quality of life. Currently, KOA assessments using ultrasound images rely on complex, time-consuming manual measurements. Developing automatic, accurate medical image segmentation methods for clinical diagnosis is crucial. However, due to challenges like blurred boundaries and irregular shapes in medical images, existing methods show poor edge accuracy and morphological differences.

Methods

To address these issues, we propose IELSEMT model. The model proposes a step edge module, incorporating it into both global and local branches during training. Additionally, after integrating the step edge module, the network's edge feature extraction capability is enhanced. During the network's backpropagation phase, we propose a weighted loss function that incorporates edge loss terms, specifically using the weighted sum of cross-entropy and mean absolute error as the network's loss function, reducing edge region errors in network segmentation results.

Results

Compared to the convolutional baseline, the IELSEMT model improves F1_Score by approximately 5 % and IoU by 15 %. Compared to the Med baseline model, IoU and F1 Score are about 4 % higher. Additionally, by incorporating edge loss terms during the backpropagation phase, the network training speed is enhanced. The IELSEMT model can quickly locate segmentation object edges and trains faster, achieving a convergence speed twice that of the MedT model.

Conclusion

The IELSEMT model proposed in this paper incorporates full-scale and edge detection concepts, introducing edge information into the decoding phase to achieve more accurate edge information acquisition, improving segmentation accuracy with clearer and more precise edges. Unlike other Transformer-based models, our proposed method does not require pre-training on large-scale datasets. Finally, we conducted experiments on knee joint ultrasound datasets, where IELSEMT demonstrated superior performance over both CNN and other related Transformer-based architectures.
用于医学图像分割的阶梯边缘和边缘损失增强变压器
目的膝关节骨关节炎(KOA)是最常见的退行性关节疾病,严重影响患者的生活质量。目前,使用超声图像进行KOA评估依赖于复杂、耗时的人工测量。开发自动、准确的医学图像分割方法对临床诊断至关重要。然而,由于医学图像存在边界模糊、形状不规则等问题,现有方法的边缘精度较差,且存在形态学差异。为了解决这些问题,我们提出了IELSEMT模型。该模型提出了一个阶跃边缘模块,在训练时将其纳入全局分支和局部分支。此外,集成步进边缘模块后,增强了网络的边缘特征提取能力。在网络的反向传播阶段,我们提出了一种包含边缘损失项的加权损失函数,特别是使用交叉熵和平均绝对误差的加权和作为网络的损失函数,减少了网络分割结果中的边缘区域误差。结果与卷积基线相比,IELSEMT模型将F1_Score提高了约5%,IoU提高了15%。与Med基线模型相比,IoU和F1评分高出约4%。此外,通过在反向传播阶段引入边损失项,提高了网络的训练速度。IELSEMT模型可以快速定位分割对象的边缘,训练速度更快,收敛速度是MedT模型的两倍。本文提出的IELSEMT模型融合了全尺寸和边缘检测的概念,在解码阶段引入边缘信息,实现更准确的边缘信息获取,以更清晰、更精确的边缘提高分割精度。与其他基于transformer的模型不同,我们提出的方法不需要在大规模数据集上进行预训练。最后,我们在膝关节超声数据集上进行了实验,其中IELSEMT表现出优于CNN和其他相关的基于transformer的架构的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
5.90%
发文量
130
审稿时长
16 weeks
期刊介绍: Journal of Radiation Research and Applied Sciences provides a high quality medium for the publication of substantial, original and scientific and technological papers on the development and applications of nuclear, radiation and isotopes in biology, medicine, drugs, biochemistry, microbiology, agriculture, entomology, food technology, chemistry, physics, solid states, engineering, environmental and applied sciences.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信