基于DenseNet通道空间和语义引导注意力的生物医学图像分割

IF 7.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Knowledge-Based Systems Pub Date : 2025-03-01 DOI:10.1016/j.knosys.2025.113233

Tahir Hussain , Hayaru Shouno , Mazin Abed Mohammad , Haydar Abdulameer Marhoon , Taukir Alam

{"title":"基于DenseNet通道空间和语义引导注意力的生物医学图像分割","authors":"Tahir Hussain , Hayaru Shouno , Mazin Abed Mohammad , Haydar Abdulameer Marhoon , Taukir Alam","doi":"10.1016/j.knosys.2025.113233","DOIUrl":null,"url":null,"abstract":"<div><div>Convolutional neural networks have progressed significantly in the field of biomedical image segmentation, although precision remains a challenge. The inconsistent sizes and shapes of the lesion regions make it difficult for the existing deep learning methods to extract their discriminatory features. Additionally, spatial and semantic information is not effectively merged during decoding, resulting in redundant information and semantic gaps. To address these challenges, we propose the Dense Channel Spatial Semantic Guidance Attention UNet (DCSSGA-UNet) architecture, which integrates DenseNet201 as the base encoder and attention mechanisms to enhance segmentation performance. The decoder follows the standard U-Net pipeline, with the encoder capturing global multi-scale features through dense convolutional and transition blocks, which enhance the model’s ability to distinguish between intricate details. The introduction of the channel spatial attention (CSA) and semantic guidance attention (SGA) modules selectively focuses on important features and reduces redundancy, effectively bridging semantic gaps. Tests conducted on three medical image datasets (CVC-ClinicDB, CVC-ColonDB, and Kvasir-SEG) showed that our proposed DCSSGA-UNet model could detect object variabilities with improved results and outperformed other comparable methods. It achieved the mean intersection-over-union (mIoU) scores of 95.67%, 92.39%, and 93.97%, as well as mean dice coefficient (mDice) of 98.85%, 95.71%, and 96.10%, respectively. These results highlight the model’s superior precision and exceptional versatility, making it a valuable tool for clinical applications, particularly for accurate lesion segmentation and assisting in the diagnosis and treatment of diseases like colorectal cancer.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"314 ","pages":"Article 113233"},"PeriodicalIF":7.6000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DCSSGA-UNet: Biomedical image segmentation with DenseNet channel spatial and Semantic Guidance Attention\",\"authors\":\"Tahir Hussain , Hayaru Shouno , Mazin Abed Mohammad , Haydar Abdulameer Marhoon , Taukir Alam\",\"doi\":\"10.1016/j.knosys.2025.113233\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Convolutional neural networks have progressed significantly in the field of biomedical image segmentation, although precision remains a challenge. The inconsistent sizes and shapes of the lesion regions make it difficult for the existing deep learning methods to extract their discriminatory features. Additionally, spatial and semantic information is not effectively merged during decoding, resulting in redundant information and semantic gaps. To address these challenges, we propose the Dense Channel Spatial Semantic Guidance Attention UNet (DCSSGA-UNet) architecture, which integrates DenseNet201 as the base encoder and attention mechanisms to enhance segmentation performance. The decoder follows the standard U-Net pipeline, with the encoder capturing global multi-scale features through dense convolutional and transition blocks, which enhance the model’s ability to distinguish between intricate details. The introduction of the channel spatial attention (CSA) and semantic guidance attention (SGA) modules selectively focuses on important features and reduces redundancy, effectively bridging semantic gaps. Tests conducted on three medical image datasets (CVC-ClinicDB, CVC-ColonDB, and Kvasir-SEG) showed that our proposed DCSSGA-UNet model could detect object variabilities with improved results and outperformed other comparable methods. It achieved the mean intersection-over-union (mIoU) scores of 95.67%, 92.39%, and 93.97%, as well as mean dice coefficient (mDice) of 98.85%, 95.71%, and 96.10%, respectively. These results highlight the model’s superior precision and exceptional versatility, making it a valuable tool for clinical applications, particularly for accurate lesion segmentation and assisting in the diagnosis and treatment of diseases like colorectal cancer.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"314 \",\"pages\":\"Article 113233\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705125002801\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125002801","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

卷积神经网络在生物医学图像分割领域取得了重大进展，但精度仍然是一个挑战。损伤区域的大小和形状不一致，使得现有的深度学习方法难以提取其区别特征。此外，在解码过程中，空间信息和语义信息没有有效融合，导致信息冗余和语义空白。为了解决这些问题，我们提出了密集信道空间语义引导注意UNet （DCSSGA-UNet）架构，该架构集成了DenseNet201作为基本编码器和注意机制，以提高分割性能。解码器遵循标准的U-Net管道，编码器通过密集的卷积和过渡块捕获全局多尺度特征，这增强了模型区分复杂细节的能力。信道空间注意（CSA）和语义引导注意（SGA）模块的引入选择性地关注重要特征，减少冗余，有效地弥合语义差距。在三个医学图像数据集（CVC-ClinicDB、CVC-ColonDB和Kvasir-SEG）上进行的测试表明，我们提出的DCSSGA-UNet模型可以更好地检测目标变量，并且优于其他可比方法。平均交叉超并度（mIoU）得分分别为95.67%、92.39%和93.97%，平均骰子系数（mdevice）分别为98.85%、95.71%和96.10%。这些结果突出了该模型的卓越精度和卓越的多功能性，使其成为临床应用的宝贵工具，特别是用于准确的病变分割和协助诊断和治疗结直肠癌等疾病。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

DCSSGA-UNet: Biomedical image segmentation with DenseNet channel spatial and Semantic Guidance Attention

Convolutional neural networks have progressed significantly in the field of biomedical image segmentation, although precision remains a challenge. The inconsistent sizes and shapes of the lesion regions make it difficult for the existing deep learning methods to extract their discriminatory features. Additionally, spatial and semantic information is not effectively merged during decoding, resulting in redundant information and semantic gaps. To address these challenges, we propose the Dense Channel Spatial Semantic Guidance Attention UNet (DCSSGA-UNet) architecture, which integrates DenseNet201 as the base encoder and attention mechanisms to enhance segmentation performance. The decoder follows the standard U-Net pipeline, with the encoder capturing global multi-scale features through dense convolutional and transition blocks, which enhance the model’s ability to distinguish between intricate details. The introduction of the channel spatial attention (CSA) and semantic guidance attention (SGA) modules selectively focuses on important features and reduces redundancy, effectively bridging semantic gaps. Tests conducted on three medical image datasets (CVC-ClinicDB, CVC-ColonDB, and Kvasir-SEG) showed that our proposed DCSSGA-UNet model could detect object variabilities with improved results and outperformed other comparable methods. It achieved the mean intersection-over-union (mIoU) scores of 95.67%, 92.39%, and 93.97%, as well as mean dice coefficient (mDice) of 98.85%, 95.71%, and 96.10%, respectively. These results highlight the model’s superior precision and exceptional versatility, making it a valuable tool for clinical applications, particularly for accurate lesion segmentation and assisting in the diagnosis and treatment of diseases like colorectal cancer.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Knowledge-Based Systems 工程技术-计算机：人工智能

CiteScore

14.80

自引率

12.50%

发文量

1245

审稿时长

7.8 months

期刊介绍： Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.