Malik Abdul Manan , Jinchao Feng , Shahzad Ahmed , Abdul Raheem
{"title":"Enhancing colorectal polyp segmentation with TCFMA-Net: A transformer-based cross feature and multi-attention network","authors":"Malik Abdul Manan , Jinchao Feng , Shahzad Ahmed , Abdul Raheem","doi":"10.1016/j.artmed.2025.103167","DOIUrl":null,"url":null,"abstract":"<div><div>To enhance polyp segmentation in colonoscopy images for early detection and diagnosis of colorectal cancer. The study proposed the Transformer-based cross feature multi-attention network (TCFMA-Net) for polyp segmentation by addressing challenges such as varying polyp sizes and the problem of accurate boundaries. TCFMA-Net utilizes swin transformer-based encoders, a cross-feature enhancer network with multiple cross-feature enhancer blocks, and multi-attention modules integrated within and outside the decoder blocks. This enables comprehensive cross-feature fusion, preserving image clarity and facilitating the flow of information, allowing efficient processing of both low-level and high-level features. TCFMA-Net effectively captures the complexities of polyp size variations and boundaries issues and consistently outperforms existing methods on six benchmark datasets with confidence interval (CI), achieving a Dice score of 92.74 ± 0.10, (CI: 91.92, 94.04), 91.46 ± 0.14 (CI: 91.12, 92.72), and 87.34 ± 0.13, (CI: 86.19, 88.10) on the CVC-ClinicDB, Kvasir-SEG and BKAI-IGH datasets respectively, demonstrating its robustness in diverse polyp segmentation tasks. Generalizability tests also yielded Dice scores of 89.51 ± 0.10, (CI: 88.67, 89.71), 72.91 ± 0.09, (CI: 71.39, 74.14), and 65.83 ± 0.22, (CI: 65.47, 66.52) on the CVC-300, CVC-ColonDB, and Polypgen databases respectively. TCFMA-Net demonstrates superior performance in segmenting polyps across datasets, effectively handling variations in polyp characteristics and demonstrating robust generalization capabilities. This study presents a significant advancement in polyp segmentation methods, offering an accurate and reliable tool for colorectal cancer diagnosis.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"167 ","pages":"Article 103167"},"PeriodicalIF":6.2000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0933365725001022","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
To enhance polyp segmentation in colonoscopy images for early detection and diagnosis of colorectal cancer. The study proposed the Transformer-based cross feature multi-attention network (TCFMA-Net) for polyp segmentation by addressing challenges such as varying polyp sizes and the problem of accurate boundaries. TCFMA-Net utilizes swin transformer-based encoders, a cross-feature enhancer network with multiple cross-feature enhancer blocks, and multi-attention modules integrated within and outside the decoder blocks. This enables comprehensive cross-feature fusion, preserving image clarity and facilitating the flow of information, allowing efficient processing of both low-level and high-level features. TCFMA-Net effectively captures the complexities of polyp size variations and boundaries issues and consistently outperforms existing methods on six benchmark datasets with confidence interval (CI), achieving a Dice score of 92.74 ± 0.10, (CI: 91.92, 94.04), 91.46 ± 0.14 (CI: 91.12, 92.72), and 87.34 ± 0.13, (CI: 86.19, 88.10) on the CVC-ClinicDB, Kvasir-SEG and BKAI-IGH datasets respectively, demonstrating its robustness in diverse polyp segmentation tasks. Generalizability tests also yielded Dice scores of 89.51 ± 0.10, (CI: 88.67, 89.71), 72.91 ± 0.09, (CI: 71.39, 74.14), and 65.83 ± 0.22, (CI: 65.47, 66.52) on the CVC-300, CVC-ColonDB, and Polypgen databases respectively. TCFMA-Net demonstrates superior performance in segmenting polyps across datasets, effectively handling variations in polyp characteristics and demonstrating robust generalization capabilities. This study presents a significant advancement in polyp segmentation methods, offering an accurate and reliable tool for colorectal cancer diagnosis.
期刊介绍:
Artificial Intelligence in Medicine publishes original articles from a wide variety of interdisciplinary perspectives concerning the theory and practice of artificial intelligence (AI) in medicine, medically-oriented human biology, and health care.
Artificial intelligence in medicine may be characterized as the scientific discipline pertaining to research studies, projects, and applications that aim at supporting decision-based medical tasks through knowledge- and/or data-intensive computer-based solutions that ultimately support and improve the performance of a human care provider.