ADPNet：用于结肠镜检查中自动息肉分割的注意驱动双路径网络

IF 4.2 3区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Image and Vision Computing Pub Date : 2025-07-14 DOI:10.1016/j.imavis.2025.105648

Mukhtiar Khan , Inam Ullah , Nadeem Khan , Sumaira Hussain , Muhammad ILyas Khattak

{"title":"ADPNet：用于结肠镜检查中自动息肉分割的注意驱动双路径网络","authors":"Mukhtiar Khan , Inam Ullah , Nadeem Khan , Sumaira Hussain , Muhammad ILyas Khattak","doi":"10.1016/j.imavis.2025.105648","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate automated polyp segmentation in colonoscopy images is crucial for early colorectal cancer detection and treatment, a major global health concern. Effective segmentation aids clinical decision-making and surgical planning. Leveraging advancements in deep learning, we propose an Attention-Driven Dual-Path Network (ADPNet) for precise polyp segmentation. ADPNet features a novel architecture with a specialized bridge integrating the Atrous Self-Attention Pyramid Module (ASAPM) and Dilated Convolution-Transformer Module (DCTM) between the encoder and decoder, enabling efficient feature extraction, long-range dependency capture, and enriched semantic representation. The decoder employs pixel shuffle, gated attention mechanisms, and residual blocks to enhance contextual and spatial feature refinement, ensuring precise boundary delineation and noise suppression. Comprehensive evaluations on public polyp datasets show ADPNet outperforms state-of-the-art models, demonstrating superior accuracy and robustness, particularly in challenging scenarios such as small or concealed polyps. ADPNet offers a robust solution for automated polyp segmentation, with potential to revolutionize early colorectal cancer detection and improve clinical outcomes. The code and results of this article are publicly available at https://github.com/Mkhan143/ADPNet.</div></div>","PeriodicalId":50374,"journal":{"name":"Image and Vision Computing","volume":"162 ","pages":"Article 105648"},"PeriodicalIF":4.2000,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ADPNet: Attention-Driven Dual-Path Network for automated polyp segmentation in colonoscopy\",\"authors\":\"Mukhtiar Khan , Inam Ullah , Nadeem Khan , Sumaira Hussain , Muhammad ILyas Khattak\",\"doi\":\"10.1016/j.imavis.2025.105648\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate automated polyp segmentation in colonoscopy images is crucial for early colorectal cancer detection and treatment, a major global health concern. Effective segmentation aids clinical decision-making and surgical planning. Leveraging advancements in deep learning, we propose an Attention-Driven Dual-Path Network (ADPNet) for precise polyp segmentation. ADPNet features a novel architecture with a specialized bridge integrating the Atrous Self-Attention Pyramid Module (ASAPM) and Dilated Convolution-Transformer Module (DCTM) between the encoder and decoder, enabling efficient feature extraction, long-range dependency capture, and enriched semantic representation. The decoder employs pixel shuffle, gated attention mechanisms, and residual blocks to enhance contextual and spatial feature refinement, ensuring precise boundary delineation and noise suppression. Comprehensive evaluations on public polyp datasets show ADPNet outperforms state-of-the-art models, demonstrating superior accuracy and robustness, particularly in challenging scenarios such as small or concealed polyps. ADPNet offers a robust solution for automated polyp segmentation, with potential to revolutionize early colorectal cancer detection and improve clinical outcomes. The code and results of this article are publicly available at https://github.com/Mkhan143/ADPNet.</div></div>\",\"PeriodicalId\":50374,\"journal\":{\"name\":\"Image and Vision Computing\",\"volume\":\"162 \",\"pages\":\"Article 105648\"},\"PeriodicalIF\":4.2000,\"publicationDate\":\"2025-07-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Image and Vision Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0262885625002367\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Image and Vision Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0262885625002367","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

结肠镜图像中准确的自动息肉分割对于早期结直肠癌的检测和治疗至关重要，这是一个主要的全球健康问题。有效的分割有助于临床决策和手术计划。利用深度学习的进步，我们提出了一种注意力驱动的双路径网络（ADPNet），用于精确的息肉分割。ADPNet采用了一种新颖的架构，在编码器和解码器之间集成了Atrous自关注金字塔模块（ASAPM）和扩展卷积-变压器模块（DCTM），实现了高效的特征提取、远程依赖捕获和丰富的语义表示。解码器采用像素洗刷、门控注意机制和残差块来增强上下文和空间特征的细化，确保精确的边界描绘和噪声抑制。对公共息肉数据集的综合评估表明，ADPNet优于最先进的模型，显示出卓越的准确性和鲁棒性，特别是在具有挑战性的情况下，如小的或隐藏的息肉。ADPNet为自动息肉分割提供了强大的解决方案，有可能彻底改变早期结直肠癌的检测并改善临床结果。本文的代码和结果可在https://github.com/Mkhan143/ADPNet上公开获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

ADPNet: Attention-Driven Dual-Path Network for automated polyp segmentation in colonoscopy

Accurate automated polyp segmentation in colonoscopy images is crucial for early colorectal cancer detection and treatment, a major global health concern. Effective segmentation aids clinical decision-making and surgical planning. Leveraging advancements in deep learning, we propose an Attention-Driven Dual-Path Network (ADPNet) for precise polyp segmentation. ADPNet features a novel architecture with a specialized bridge integrating the Atrous Self-Attention Pyramid Module (ASAPM) and Dilated Convolution-Transformer Module (DCTM) between the encoder and decoder, enabling efficient feature extraction, long-range dependency capture, and enriched semantic representation. The decoder employs pixel shuffle, gated attention mechanisms, and residual blocks to enhance contextual and spatial feature refinement, ensuring precise boundary delineation and noise suppression. Comprehensive evaluations on public polyp datasets show ADPNet outperforms state-of-the-art models, demonstrating superior accuracy and robustness, particularly in challenging scenarios such as small or concealed polyps. ADPNet offers a robust solution for automated polyp segmentation, with potential to revolutionize early colorectal cancer detection and improve clinical outcomes. The code and results of this article are publicly available at https://github.com/Mkhan143/ADPNet.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Image and Vision Computing 工程技术-工程：电子与电气

CiteScore

8.50

自引率

8.50%

发文量

143

审稿时长

7.8 months

期刊介绍： Image and Vision Computing has as a primary aim the provision of an effective medium of interchange for the results of high quality theoretical and applied research fundamental to all aspects of image interpretation and computer vision. The journal publishes work that proposes new image interpretation and computer vision methodology or addresses the application of such methods to real world scenes. It seeks to strengthen a deeper understanding in the discipline by encouraging the quantitative comparison and performance evaluation of the proposed methodology. The coverage includes: image interpretation, scene modelling, object recognition and tracking, shape analysis, monitoring and surveillance, active vision and robotic systems, SLAM, biologically-inspired computer vision, motion analysis, stereo vision, document image understanding, character and handwritten text recognition, face and gesture recognition, biometrics, vision-based human-computer interaction, human activity and behavior understanding, data fusion from multiple sensor inputs, image databases.