Yufei Gao , Shichao Zhang , Lei Shi , Guohua Zhao , Yucheng Shi
{"title":"Collaborative transformer U-shaped network for medical image segmentation","authors":"Yufei Gao , Shichao Zhang , Lei Shi , Guohua Zhao , Yucheng Shi","doi":"10.1016/j.asoc.2025.112841","DOIUrl":null,"url":null,"abstract":"<div><div>Recent advances in the Transformer have shown significant ability to understand the relationship between the lesion area and surrounding tissue, especially for medical image analysis. Existing medical image segmentation algorithms based on transformers often suffer from limited feature extraction granularity and overlook the semantic relationships between multi-scale features. To solve the above limitations, we propose CoTransUNet: a collaborative transformer U-shaped network, that effectively captures fine-grained features and long-range dependencies by performing context extraction between multiple scales. The designed Correlation Extraction (CE) module bridges the encoder and decoder to achieve effective interaction and information transfer. Specifically, a collaborative mechanism in the encoder is proposed that can efficiently exploit inductive bias to extract local fine-grained features of the image while having the ability to capture long-distance feature dependencies. Besides, the CE module focuses on deeply integrating contextual information of multi-scale features, which enriches feature representation by exploiting the intrinsic correlation between features at different scales. It can extract not only local and global features but also capture semantic information related to different multi-scale features simultaneously. Compared to TransUNet, CoTransUNet achieves a 4.91% improvement in DSC on the Synapse multi-organ segmentation dataset while using only a quarter of the parameters. The extensive experiments on three datasets, including skin lesion segmentation (ISIC2016, ISIC2017, ISIC2018) demonstrates that CoTransUNet achieves DSC scores of 92.18%, 85.59%, and 88.75%, respectively, and on Synapse multi organ segmentation achieves DSC score of 82.39% , which outperforms the baseline and other promising methods.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"173 ","pages":"Article 112841"},"PeriodicalIF":7.2000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625001528","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Recent advances in the Transformer have shown significant ability to understand the relationship between the lesion area and surrounding tissue, especially for medical image analysis. Existing medical image segmentation algorithms based on transformers often suffer from limited feature extraction granularity and overlook the semantic relationships between multi-scale features. To solve the above limitations, we propose CoTransUNet: a collaborative transformer U-shaped network, that effectively captures fine-grained features and long-range dependencies by performing context extraction between multiple scales. The designed Correlation Extraction (CE) module bridges the encoder and decoder to achieve effective interaction and information transfer. Specifically, a collaborative mechanism in the encoder is proposed that can efficiently exploit inductive bias to extract local fine-grained features of the image while having the ability to capture long-distance feature dependencies. Besides, the CE module focuses on deeply integrating contextual information of multi-scale features, which enriches feature representation by exploiting the intrinsic correlation between features at different scales. It can extract not only local and global features but also capture semantic information related to different multi-scale features simultaneously. Compared to TransUNet, CoTransUNet achieves a 4.91% improvement in DSC on the Synapse multi-organ segmentation dataset while using only a quarter of the parameters. The extensive experiments on three datasets, including skin lesion segmentation (ISIC2016, ISIC2017, ISIC2018) demonstrates that CoTransUNet achieves DSC scores of 92.18%, 85.59%, and 88.75%, respectively, and on Synapse multi organ segmentation achieves DSC score of 82.39% , which outperforms the baseline and other promising methods.
期刊介绍:
Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities.
Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.