XPolypNet：一种基于u - net的胃肠息肉语义分割模型

IEEE Open Journal of the Computer Society Pub Date : 2025-07-23 DOI:10.1109/OJCS.2025.3592204

Arjun Kumar Bose Arnob;Muhammad Mostafa Monowar;Md. Abdul Hamid;M. F. Mridha

{"title":"XPolypNet：一种基于u - net的胃肠息肉语义分割模型","authors":"Arjun Kumar Bose Arnob;Muhammad Mostafa Monowar;Md. Abdul Hamid;M. F. Mridha","doi":"10.1109/OJCS.2025.3592204","DOIUrl":null,"url":null,"abstract":"Automated segmentation of gastrointestinal polyps is a critical step in the early detection and prevention of colorectal cancer (CRC), which is one of the most common causes of cancer-related deaths worldwide. This article presents a U-Net-based model enhanced with Attention Mechanisms and Atrous Spatial Pyramid Pooling (ASPP) for accurate polyp segmentation. To address the challenges of varying polyp sizes, indistinct boundaries, and complex textures, the model used a combined loss function (Binary Cross-Entropy and Dice Loss). Additionally, Gradient-Weighted Class Activation Mapping (Grad-CAM) was integrated to provide visual explanations of the model’s decisions to increase trust and interpretability by clinical practitioners. The presented model was evaluated on five benchmark datasets, achieving a Dice Coefficient of 0.8378 and a Mean Intersection over Union (mIoU) of 0.8427. The comparative analysis highlighted its superiority when compared to state-of-the-art contemporary approaches, with a precision and accuracy of 97%. Qualitative analyses also underline the ability to accurately delineate polyps, even in difficult situations. Although the model exhibited satisfactory performance, it still faced challenges regarding boundary misclassification and reduced efficacy in datasets with high variability. The next steps of this research will focus on domain adaptation and integration of additional modalities to enhance generalizability. This study provides a step toward automated polyp detection and demonstrates the potential of explainable artificial intelligence (XAI) to change the accuracy of diagnosis and healthcare for patients.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"6 ","pages":"1283-1293"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11095343","citationCount":"0","resultStr":"{\"title\":\"XPolypNet: A U-Net-Based Model for Semantic Segmentation of Gastrointestinal Polyps With Explainable AI\",\"authors\":\"Arjun Kumar Bose Arnob;Muhammad Mostafa Monowar;Md. Abdul Hamid;M. F. Mridha\",\"doi\":\"10.1109/OJCS.2025.3592204\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automated segmentation of gastrointestinal polyps is a critical step in the early detection and prevention of colorectal cancer (CRC), which is one of the most common causes of cancer-related deaths worldwide. This article presents a U-Net-based model enhanced with Attention Mechanisms and Atrous Spatial Pyramid Pooling (ASPP) for accurate polyp segmentation. To address the challenges of varying polyp sizes, indistinct boundaries, and complex textures, the model used a combined loss function (Binary Cross-Entropy and Dice Loss). Additionally, Gradient-Weighted Class Activation Mapping (Grad-CAM) was integrated to provide visual explanations of the model’s decisions to increase trust and interpretability by clinical practitioners. The presented model was evaluated on five benchmark datasets, achieving a Dice Coefficient of 0.8378 and a Mean Intersection over Union (mIoU) of 0.8427. The comparative analysis highlighted its superiority when compared to state-of-the-art contemporary approaches, with a precision and accuracy of 97%. Qualitative analyses also underline the ability to accurately delineate polyps, even in difficult situations. Although the model exhibited satisfactory performance, it still faced challenges regarding boundary misclassification and reduced efficacy in datasets with high variability. The next steps of this research will focus on domain adaptation and integration of additional modalities to enhance generalizability. This study provides a step toward automated polyp detection and demonstrates the potential of explainable artificial intelligence (XAI) to change the accuracy of diagnosis and healthcare for patients.\",\"PeriodicalId\":13205,\"journal\":{\"name\":\"IEEE Open Journal of the Computer Society\",\"volume\":\"6 \",\"pages\":\"1283-1293\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-07-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11095343\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Open Journal of the Computer Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11095343/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of the Computer Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11095343/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

胃肠道息肉的自动分割是早期发现和预防结直肠癌（CRC）的关键步骤，结直肠癌是全球癌症相关死亡的最常见原因之一。本文提出了一种基于u - net的模型，增强了注意机制和空间金字塔池（ASPP），用于精确的息肉分割。为了解决不同息肉大小、模糊边界和复杂纹理的挑战，该模型使用了组合损失函数（二元交叉熵和骰子损失）。此外，集成了梯度加权类激活映射（Grad-CAM），以提供模型决策的可视化解释，以增加临床从业者的信任和可解释性。在5个基准数据集上对该模型进行了评估，得到Dice系数为0.8378，mIoU均值为0.8427。对比分析突出了其与当代最先进的方法相比的优势，精确度和准确度达到97%。定性分析也强调了准确描绘息肉的能力，即使在困难的情况下也是如此。尽管该模型表现出了令人满意的性能，但在高变异性数据集中仍然面临边界分类错误和有效性降低的挑战。本研究的下一步将集中在领域适应和其他模式的整合，以提高普遍性。这项研究向自动化息肉检测迈出了一步，并展示了可解释人工智能（XAI）在改变患者诊断和医疗保健准确性方面的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

XPolypNet: A U-Net-Based Model for Semantic Segmentation of Gastrointestinal Polyps With Explainable AI

Automated segmentation of gastrointestinal polyps is a critical step in the early detection and prevention of colorectal cancer (CRC), which is one of the most common causes of cancer-related deaths worldwide. This article presents a U-Net-based model enhanced with Attention Mechanisms and Atrous Spatial Pyramid Pooling (ASPP) for accurate polyp segmentation. To address the challenges of varying polyp sizes, indistinct boundaries, and complex textures, the model used a combined loss function (Binary Cross-Entropy and Dice Loss). Additionally, Gradient-Weighted Class Activation Mapping (Grad-CAM) was integrated to provide visual explanations of the model’s decisions to increase trust and interpretability by clinical practitioners. The presented model was evaluated on five benchmark datasets, achieving a Dice Coefficient of 0.8378 and a Mean Intersection over Union (mIoU) of 0.8427. The comparative analysis highlighted its superiority when compared to state-of-the-art contemporary approaches, with a precision and accuracy of 97%. Qualitative analyses also underline the ability to accurately delineate polyps, even in difficult situations. Although the model exhibited satisfactory performance, it still faced challenges regarding boundary misclassification and reduced efficacy in datasets with high variability. The next steps of this research will focus on domain adaptation and integration of additional modalities to enhance generalizability. This study provides a step toward automated polyp detection and demonstrates the potential of explainable artificial intelligence (XAI) to change the accuracy of diagnosis and healthcare for patients.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Open Journal of the Computer Society

CiteScore

12.60

自引率

0.00%

发文量