Effectiveness of encoder-decoder deep learning approach for colorectal polyp segmentation in colonoscopy images

IF 3.4 2区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Applied Intelligence Pub Date : 2025-01-10 DOI:10.1007/s10489-024-06167-6

Ameer Hamza, Muhammad Bilal, Muhammad Ramzan, Nadia Malik

{"title":"Effectiveness of encoder-decoder deep learning approach for colorectal polyp segmentation in colonoscopy images","authors":"Ameer Hamza, Muhammad Bilal, Muhammad Ramzan, Nadia Malik","doi":"10.1007/s10489-024-06167-6","DOIUrl":null,"url":null,"abstract":"<div><p>Colorectal cancer is considered one of the deadliest diseases, contributing to an alarming increase in annual deaths worldwide, with colorectal polyps recognized as precursors to this malignancy. Early and accurate detection of these polyps is crucial for reducing the mortality rate of colorectal cancer. However, the manual detection of polyps is a time-consuming process and requires the expertise of trained medical professionals. Moreover, it often misses polyps due to their varied size, color, and texture. Computer-aided diagnosis systems offer potential improvements, but they often struggle with precision in complex visual environments. This study presents an enhanced deep learning approach using encoder-decoder architecture for colorectal polyp segmentation to capture and utilize complex feature representations. Our approach introduces an enhanced dual attention mechanism, combining spatial and channel-wise attention to focus precisely on critical features. Channel-wise attention, implemented via an optimized Squeeze-and-Excitation (S&E) block, allows the network to capture comprehensive contextual information and interrelationships among different channels, ensuring a more refined feature selection process. The experimental results showed that the proposed model achieved a mean Intersection over Union (IoU) of 0.9054 and 0.9277, a dice coefficient of 0.9006 and 0.9128, a precision of 0.8985 and 0.9517, a recall of 0.9190 and 0.9094, and an accuracy of 0.9806 and 0.9907 on the Kvasir-SEG and CVC-ClinicDB datasets, respectively. Moreover, the proposed model outperforms the existing state-of-the-art resulting in improved patient outcomes with the potential to enhance the early detection of colorectal polyps.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 4","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-024-06167-6","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Colorectal cancer is considered one of the deadliest diseases, contributing to an alarming increase in annual deaths worldwide, with colorectal polyps recognized as precursors to this malignancy. Early and accurate detection of these polyps is crucial for reducing the mortality rate of colorectal cancer. However, the manual detection of polyps is a time-consuming process and requires the expertise of trained medical professionals. Moreover, it often misses polyps due to their varied size, color, and texture. Computer-aided diagnosis systems offer potential improvements, but they often struggle with precision in complex visual environments. This study presents an enhanced deep learning approach using encoder-decoder architecture for colorectal polyp segmentation to capture and utilize complex feature representations. Our approach introduces an enhanced dual attention mechanism, combining spatial and channel-wise attention to focus precisely on critical features. Channel-wise attention, implemented via an optimized Squeeze-and-Excitation (S&E) block, allows the network to capture comprehensive contextual information and interrelationships among different channels, ensuring a more refined feature selection process. The experimental results showed that the proposed model achieved a mean Intersection over Union (IoU) of 0.9054 and 0.9277, a dice coefficient of 0.9006 and 0.9128, a precision of 0.8985 and 0.9517, a recall of 0.9190 and 0.9094, and an accuracy of 0.9806 and 0.9907 on the Kvasir-SEG and CVC-ClinicDB datasets, respectively. Moreover, the proposed model outperforms the existing state-of-the-art resulting in improved patient outcomes with the potential to enhance the early detection of colorectal polyps.

查看原文本刊更多论文

编码器-解码器深度学习方法在结肠镜图像中结肠息肉分割中的有效性

结直肠癌被认为是最致命的疾病之一，导致全球每年死亡人数的惊人增长，结直肠癌息肉被认为是这种恶性肿瘤的前兆。早期准确发现这些息肉对于降低结直肠癌的死亡率至关重要。然而，人工检测息肉是一个耗时的过程，需要训练有素的医疗专业人员的专业知识。此外，由于息肉的大小、颜色和质地各异，它经常会遗漏息肉。计算机辅助诊断系统提供了潜在的改进，但它们在复杂的视觉环境中往往难以达到精度。本研究提出了一种增强的深度学习方法，使用编码器-解码器架构用于结肠直肠息肉分割，以捕获和利用复杂的特征表示。我们的方法引入了一种增强的双重注意机制，将空间和渠道的注意结合起来，精确地关注关键特征。通过优化的挤压和激励（S&；E）块实现的通道智能关注，允许网络捕获不同通道之间的综合上下文信息和相互关系，确保更精细的特征选择过程。实验结果表明，该模型在Kvasir-SEG和CVC-ClinicDB数据集上的平均IoU分别为0.9054和0.9277，骰子系数分别为0.9006和0.9128，精度分别为0.8985和0.9517，召回率分别为0.9190和0.9094，准确率分别为0.9806和0.9907。此外，所提出的模型优于现有的最先进的技术，从而改善了患者的预后，有可能提高结肠直肠息肉的早期发现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Applied Intelligence 工程技术-计算机：人工智能

CiteScore

6.60

自引率

20.80%

发文量

1361

审稿时长

5.9 months

期刊介绍： With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance. The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.