Arjun Kumar Bose Arnob;Muhammad Mostafa Monowar;Md. Abdul Hamid;M. F. Mridha
{"title":"XPolypNet:一种基于u - net的胃肠息肉语义分割模型","authors":"Arjun Kumar Bose Arnob;Muhammad Mostafa Monowar;Md. Abdul Hamid;M. F. Mridha","doi":"10.1109/OJCS.2025.3592204","DOIUrl":null,"url":null,"abstract":"Automated segmentation of gastrointestinal polyps is a critical step in the early detection and prevention of colorectal cancer (CRC), which is one of the most common causes of cancer-related deaths worldwide. This article presents a U-Net-based model enhanced with Attention Mechanisms and Atrous Spatial Pyramid Pooling (ASPP) for accurate polyp segmentation. To address the challenges of varying polyp sizes, indistinct boundaries, and complex textures, the model used a combined loss function (Binary Cross-Entropy and Dice Loss). Additionally, Gradient-Weighted Class Activation Mapping (Grad-CAM) was integrated to provide visual explanations of the model’s decisions to increase trust and interpretability by clinical practitioners. The presented model was evaluated on five benchmark datasets, achieving a Dice Coefficient of 0.8378 and a Mean Intersection over Union (mIoU) of 0.8427. The comparative analysis highlighted its superiority when compared to state-of-the-art contemporary approaches, with a precision and accuracy of 97%. Qualitative analyses also underline the ability to accurately delineate polyps, even in difficult situations. Although the model exhibited satisfactory performance, it still faced challenges regarding boundary misclassification and reduced efficacy in datasets with high variability. The next steps of this research will focus on domain adaptation and integration of additional modalities to enhance generalizability. This study provides a step toward automated polyp detection and demonstrates the potential of explainable artificial intelligence (XAI) to change the accuracy of diagnosis and healthcare for patients.","PeriodicalId":13205,"journal":{"name":"IEEE Open Journal of the Computer Society","volume":"6 ","pages":"1283-1293"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11095343","citationCount":"0","resultStr":"{\"title\":\"XPolypNet: A U-Net-Based Model for Semantic Segmentation of Gastrointestinal Polyps With Explainable AI\",\"authors\":\"Arjun Kumar Bose Arnob;Muhammad Mostafa Monowar;Md. Abdul Hamid;M. F. Mridha\",\"doi\":\"10.1109/OJCS.2025.3592204\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automated segmentation of gastrointestinal polyps is a critical step in the early detection and prevention of colorectal cancer (CRC), which is one of the most common causes of cancer-related deaths worldwide. This article presents a U-Net-based model enhanced with Attention Mechanisms and Atrous Spatial Pyramid Pooling (ASPP) for accurate polyp segmentation. To address the challenges of varying polyp sizes, indistinct boundaries, and complex textures, the model used a combined loss function (Binary Cross-Entropy and Dice Loss). Additionally, Gradient-Weighted Class Activation Mapping (Grad-CAM) was integrated to provide visual explanations of the model’s decisions to increase trust and interpretability by clinical practitioners. The presented model was evaluated on five benchmark datasets, achieving a Dice Coefficient of 0.8378 and a Mean Intersection over Union (mIoU) of 0.8427. The comparative analysis highlighted its superiority when compared to state-of-the-art contemporary approaches, with a precision and accuracy of 97%. Qualitative analyses also underline the ability to accurately delineate polyps, even in difficult situations. Although the model exhibited satisfactory performance, it still faced challenges regarding boundary misclassification and reduced efficacy in datasets with high variability. The next steps of this research will focus on domain adaptation and integration of additional modalities to enhance generalizability. This study provides a step toward automated polyp detection and demonstrates the potential of explainable artificial intelligence (XAI) to change the accuracy of diagnosis and healthcare for patients.\",\"PeriodicalId\":13205,\"journal\":{\"name\":\"IEEE Open Journal of the Computer Society\",\"volume\":\"6 \",\"pages\":\"1283-1293\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-07-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11095343\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Open Journal of the Computer Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11095343/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of the Computer Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11095343/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
XPolypNet: A U-Net-Based Model for Semantic Segmentation of Gastrointestinal Polyps With Explainable AI
Automated segmentation of gastrointestinal polyps is a critical step in the early detection and prevention of colorectal cancer (CRC), which is one of the most common causes of cancer-related deaths worldwide. This article presents a U-Net-based model enhanced with Attention Mechanisms and Atrous Spatial Pyramid Pooling (ASPP) for accurate polyp segmentation. To address the challenges of varying polyp sizes, indistinct boundaries, and complex textures, the model used a combined loss function (Binary Cross-Entropy and Dice Loss). Additionally, Gradient-Weighted Class Activation Mapping (Grad-CAM) was integrated to provide visual explanations of the model’s decisions to increase trust and interpretability by clinical practitioners. The presented model was evaluated on five benchmark datasets, achieving a Dice Coefficient of 0.8378 and a Mean Intersection over Union (mIoU) of 0.8427. The comparative analysis highlighted its superiority when compared to state-of-the-art contemporary approaches, with a precision and accuracy of 97%. Qualitative analyses also underline the ability to accurately delineate polyps, even in difficult situations. Although the model exhibited satisfactory performance, it still faced challenges regarding boundary misclassification and reduced efficacy in datasets with high variability. The next steps of this research will focus on domain adaptation and integration of additional modalities to enhance generalizability. This study provides a step toward automated polyp detection and demonstrates the potential of explainable artificial intelligence (XAI) to change the accuracy of diagnosis and healthcare for patients.