XPolypNet: A U-Net-Based Model for Semantic Segmentation of Gastrointestinal Polyps With Explainable AI

Arjun Kumar Bose Arnob;Muhammad Mostafa Monowar;Md. Abdul Hamid;M. F. Mridha
{"title":"XPolypNet: A U-Net-Based Model for Semantic Segmentation of Gastrointestinal Polyps With Explainable AI","authors":"Arjun Kumar Bose Arnob;Muhammad Mostafa Monowar;Md. Abdul Hamid;M. F. Mridha","doi":"10.1109/OJCS.2025.3592204","DOIUrl":null,"url":null,"abstract":"Automated segmentation of gastrointestinal polyps is a critical step in the early detection and prevention of colorectal cancer (CRC), which is one of the most common causes of cancer-related deaths worldwide. This article presents a U-Net-based model enhanced with Attention Mechanisms and Atrous Spatial Pyramid Pooling (ASPP) for accurate polyp segmentation. To address the challenges of varying polyp sizes, indistinct boundaries, and complex textures, the model used a combined loss function (Binary Cross-Entropy and Dice Loss). Additionally, Gradient-Weighted Class Activation Mapping (Grad-CAM) was integrated to provide visual explanations of the model’s decisions to increase trust and interpretability by clinical practitioners. The presented model was evaluated on five benchmark datasets, achieving a Dice Coefficient of 0.8378 and a Mean Intersection over Union (mIoU) of 0.8427. The comparative analysis highlighted its superiority when compared to state-of-the-art contemporary approaches, with a precision and accuracy of 97%. Qualitative analyses also underline the ability to accurately delineate polyps, even in difficult situations. Although the model exhibited satisfactory performance, it still faced challenges regarding boundary misclassification and reduced efficacy in datasets with high variability. The next steps of this research will focus on domain adaptation and integration of additional modalities to enhance generalizability. This study provides a step toward automated polyp detection and demonstrates the potential of explainable artificial intelligence (XAI) to change the accuracy of diagnosis and healthcare for patients.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"6 ","pages":"1283-1293"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11095343","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of the Computer Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11095343/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Automated segmentation of gastrointestinal polyps is a critical step in the early detection and prevention of colorectal cancer (CRC), which is one of the most common causes of cancer-related deaths worldwide. This article presents a U-Net-based model enhanced with Attention Mechanisms and Atrous Spatial Pyramid Pooling (ASPP) for accurate polyp segmentation. To address the challenges of varying polyp sizes, indistinct boundaries, and complex textures, the model used a combined loss function (Binary Cross-Entropy and Dice Loss). Additionally, Gradient-Weighted Class Activation Mapping (Grad-CAM) was integrated to provide visual explanations of the model’s decisions to increase trust and interpretability by clinical practitioners. The presented model was evaluated on five benchmark datasets, achieving a Dice Coefficient of 0.8378 and a Mean Intersection over Union (mIoU) of 0.8427. The comparative analysis highlighted its superiority when compared to state-of-the-art contemporary approaches, with a precision and accuracy of 97%. Qualitative analyses also underline the ability to accurately delineate polyps, even in difficult situations. Although the model exhibited satisfactory performance, it still faced challenges regarding boundary misclassification and reduced efficacy in datasets with high variability. The next steps of this research will focus on domain adaptation and integration of additional modalities to enhance generalizability. This study provides a step toward automated polyp detection and demonstrates the potential of explainable artificial intelligence (XAI) to change the accuracy of diagnosis and healthcare for patients.
XPolypNet:一种基于u - net的胃肠息肉语义分割模型
胃肠道息肉的自动分割是早期发现和预防结直肠癌(CRC)的关键步骤,结直肠癌是全球癌症相关死亡的最常见原因之一。本文提出了一种基于u - net的模型,增强了注意机制和空间金字塔池(ASPP),用于精确的息肉分割。为了解决不同息肉大小、模糊边界和复杂纹理的挑战,该模型使用了组合损失函数(二元交叉熵和骰子损失)。此外,集成了梯度加权类激活映射(Grad-CAM),以提供模型决策的可视化解释,以增加临床从业者的信任和可解释性。在5个基准数据集上对该模型进行了评估,得到Dice系数为0.8378,mIoU均值为0.8427。对比分析突出了其与当代最先进的方法相比的优势,精确度和准确度达到97%。定性分析也强调了准确描绘息肉的能力,即使在困难的情况下也是如此。尽管该模型表现出了令人满意的性能,但在高变异性数据集中仍然面临边界分类错误和有效性降低的挑战。本研究的下一步将集中在领域适应和其他模式的整合,以提高普遍性。这项研究向自动化息肉检测迈出了一步,并展示了可解释人工智能(XAI)在改变患者诊断和医疗保健准确性方面的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
12.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信