Juan Zheng , Mei Tian , Meng Zhou , Jing Cai , Chanzi Liu , Tao Lin , Haibo Si
{"title":"用于医学图像分割的阶梯边缘和边缘损失增强变压器","authors":"Juan Zheng , Mei Tian , Meng Zhou , Jing Cai , Chanzi Liu , Tao Lin , Haibo Si","doi":"10.1016/j.jrras.2025.101667","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>Knee osteoarthritis (KOA) is the most common degenerative joint disease, significantly impacting patients' quality of life. Currently, KOA assessments using ultrasound images rely on complex, time-consuming manual measurements. Developing automatic, accurate medical image segmentation methods for clinical diagnosis is crucial. However, due to challenges like blurred boundaries and irregular shapes in medical images, existing methods show poor edge accuracy and morphological differences.</div></div><div><h3>Methods</h3><div>To address these issues, we propose IELSEMT model. The model proposes a step edge module, incorporating it into both global and local branches during training. Additionally, after integrating the step edge module, the network's edge feature extraction capability is enhanced. During the network's backpropagation phase, we propose a weighted loss function that incorporates edge loss terms, specifically using the weighted sum of cross-entropy and mean absolute error as the network's loss function, reducing edge region errors in network segmentation results.</div></div><div><h3>Results</h3><div>Compared to the convolutional baseline, the IELSEMT model improves F1_Score by approximately 5 % and IoU by 15 %. Compared to the Med baseline model, IoU and F1 Score are about 4 % higher. Additionally, by incorporating edge loss terms during the backpropagation phase, the network training speed is enhanced. The IELSEMT model can quickly locate segmentation object edges and trains faster, achieving a convergence speed twice that of the MedT model.</div></div><div><h3>Conclusion</h3><div>The IELSEMT model proposed in this paper incorporates full-scale and edge detection concepts, introducing edge information into the decoding phase to achieve more accurate edge information acquisition, improving segmentation accuracy with clearer and more precise edges. Unlike other Transformer-based models, our proposed method does not require pre-training on large-scale datasets. Finally, we conducted experiments on knee joint ultrasound datasets, where IELSEMT demonstrated superior performance over both CNN and other related Transformer-based architectures.</div></div>","PeriodicalId":16920,"journal":{"name":"Journal of Radiation Research and Applied Sciences","volume":"18 3","pages":"Article 101667"},"PeriodicalIF":1.7000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Step edge and edge loss enhanced Transformer for medical image segmentation\",\"authors\":\"Juan Zheng , Mei Tian , Meng Zhou , Jing Cai , Chanzi Liu , Tao Lin , Haibo Si\",\"doi\":\"10.1016/j.jrras.2025.101667\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><div>Knee osteoarthritis (KOA) is the most common degenerative joint disease, significantly impacting patients' quality of life. Currently, KOA assessments using ultrasound images rely on complex, time-consuming manual measurements. Developing automatic, accurate medical image segmentation methods for clinical diagnosis is crucial. However, due to challenges like blurred boundaries and irregular shapes in medical images, existing methods show poor edge accuracy and morphological differences.</div></div><div><h3>Methods</h3><div>To address these issues, we propose IELSEMT model. The model proposes a step edge module, incorporating it into both global and local branches during training. Additionally, after integrating the step edge module, the network's edge feature extraction capability is enhanced. During the network's backpropagation phase, we propose a weighted loss function that incorporates edge loss terms, specifically using the weighted sum of cross-entropy and mean absolute error as the network's loss function, reducing edge region errors in network segmentation results.</div></div><div><h3>Results</h3><div>Compared to the convolutional baseline, the IELSEMT model improves F1_Score by approximately 5 % and IoU by 15 %. Compared to the Med baseline model, IoU and F1 Score are about 4 % higher. Additionally, by incorporating edge loss terms during the backpropagation phase, the network training speed is enhanced. The IELSEMT model can quickly locate segmentation object edges and trains faster, achieving a convergence speed twice that of the MedT model.</div></div><div><h3>Conclusion</h3><div>The IELSEMT model proposed in this paper incorporates full-scale and edge detection concepts, introducing edge information into the decoding phase to achieve more accurate edge information acquisition, improving segmentation accuracy with clearer and more precise edges. Unlike other Transformer-based models, our proposed method does not require pre-training on large-scale datasets. Finally, we conducted experiments on knee joint ultrasound datasets, where IELSEMT demonstrated superior performance over both CNN and other related Transformer-based architectures.</div></div>\",\"PeriodicalId\":16920,\"journal\":{\"name\":\"Journal of Radiation Research and Applied Sciences\",\"volume\":\"18 3\",\"pages\":\"Article 101667\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2025-06-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Radiation Research and Applied Sciences\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1687850725003796\",\"RegionNum\":4,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Radiation Research and Applied Sciences","FirstCategoryId":"103","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1687850725003796","RegionNum":4,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
Step edge and edge loss enhanced Transformer for medical image segmentation
Objective
Knee osteoarthritis (KOA) is the most common degenerative joint disease, significantly impacting patients' quality of life. Currently, KOA assessments using ultrasound images rely on complex, time-consuming manual measurements. Developing automatic, accurate medical image segmentation methods for clinical diagnosis is crucial. However, due to challenges like blurred boundaries and irregular shapes in medical images, existing methods show poor edge accuracy and morphological differences.
Methods
To address these issues, we propose IELSEMT model. The model proposes a step edge module, incorporating it into both global and local branches during training. Additionally, after integrating the step edge module, the network's edge feature extraction capability is enhanced. During the network's backpropagation phase, we propose a weighted loss function that incorporates edge loss terms, specifically using the weighted sum of cross-entropy and mean absolute error as the network's loss function, reducing edge region errors in network segmentation results.
Results
Compared to the convolutional baseline, the IELSEMT model improves F1_Score by approximately 5 % and IoU by 15 %. Compared to the Med baseline model, IoU and F1 Score are about 4 % higher. Additionally, by incorporating edge loss terms during the backpropagation phase, the network training speed is enhanced. The IELSEMT model can quickly locate segmentation object edges and trains faster, achieving a convergence speed twice that of the MedT model.
Conclusion
The IELSEMT model proposed in this paper incorporates full-scale and edge detection concepts, introducing edge information into the decoding phase to achieve more accurate edge information acquisition, improving segmentation accuracy with clearer and more precise edges. Unlike other Transformer-based models, our proposed method does not require pre-training on large-scale datasets. Finally, we conducted experiments on knee joint ultrasound datasets, where IELSEMT demonstrated superior performance over both CNN and other related Transformer-based architectures.
期刊介绍:
Journal of Radiation Research and Applied Sciences provides a high quality medium for the publication of substantial, original and scientific and technological papers on the development and applications of nuclear, radiation and isotopes in biology, medicine, drugs, biochemistry, microbiology, agriculture, entomology, food technology, chemistry, physics, solid states, engineering, environmental and applied sciences.