{"title":"BRMSA-Net: Disclosing concealed colorectal polyps in colonoscopy images via Boundary Recalibration and Multi-Scale Aggregation Network","authors":"Meriem Sebai , Evgin Goceri","doi":"10.1016/j.bspc.2025.108083","DOIUrl":null,"url":null,"abstract":"<div><div>For the early detection of colorectal cancer, automated polyp segmentation from colonoscopy images is very crucial. Polyps, often amorphous, exhibit a wide range of sizes and shapes. Furthermore, the boundaries separating polyps from the surrounding mucosal tissues are frequently not distinct. Consequently, segmenting polyps in practical settings is a particularly challenging task. In this study, we propose a Boundary Recalibration and Multi-Scale Aggregation Network (BRMSA-Net) for precise and consistent colon polyp segmentation. In particular, we first introduce the Multi-level Feature Aggregation (MFA) decoder to generate a global segmentation map by fusing high-level features for polyp localization. The MFA decoder incorporates the Criss-cross Feature Fusion (CFF) module, which effectively combines the cross-level features and addresses the issue of semantic gap across different levels. The segmentation map predicted by the MFA decoder is improved by the novel Incremental Boundary Refinement (IBR) module. The IBR module includes the Incremental Scale Fusion (ISF) module, which uses dilated convolutions to extract features with different receptive fields and strategically fuses them to reduce feature disparity. These features are then utilized by the Boundary Recalibration Cross-attention (BRC) module, which gradually refines the polyp boundaries by combining spatial and channel cross-attention. Extensive experiments on public polyp datasets show that BRMSA-Net outperforms several competing approaches in learning and generalization. It achieves 94.02% DSC and 89.46% IoU on CVC-ClinicDB, 92.26% DSC and 87.28% IoU on KvasirSEG, 82.45% DSC and 74.52% IoU on CVC-ColonDB, and 82.28% DSC and 73.81% IoU on ETIS-Larib. The code is available at: <span><span>https://github.com/Meriem-Sebai/BRMSA-Net</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"110 ","pages":"Article 108083"},"PeriodicalIF":4.9000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425005944","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
For the early detection of colorectal cancer, automated polyp segmentation from colonoscopy images is very crucial. Polyps, often amorphous, exhibit a wide range of sizes and shapes. Furthermore, the boundaries separating polyps from the surrounding mucosal tissues are frequently not distinct. Consequently, segmenting polyps in practical settings is a particularly challenging task. In this study, we propose a Boundary Recalibration and Multi-Scale Aggregation Network (BRMSA-Net) for precise and consistent colon polyp segmentation. In particular, we first introduce the Multi-level Feature Aggregation (MFA) decoder to generate a global segmentation map by fusing high-level features for polyp localization. The MFA decoder incorporates the Criss-cross Feature Fusion (CFF) module, which effectively combines the cross-level features and addresses the issue of semantic gap across different levels. The segmentation map predicted by the MFA decoder is improved by the novel Incremental Boundary Refinement (IBR) module. The IBR module includes the Incremental Scale Fusion (ISF) module, which uses dilated convolutions to extract features with different receptive fields and strategically fuses them to reduce feature disparity. These features are then utilized by the Boundary Recalibration Cross-attention (BRC) module, which gradually refines the polyp boundaries by combining spatial and channel cross-attention. Extensive experiments on public polyp datasets show that BRMSA-Net outperforms several competing approaches in learning and generalization. It achieves 94.02% DSC and 89.46% IoU on CVC-ClinicDB, 92.26% DSC and 87.28% IoU on KvasirSEG, 82.45% DSC and 74.52% IoU on CVC-ColonDB, and 82.28% DSC and 73.81% IoU on ETIS-Larib. The code is available at: https://github.com/Meriem-Sebai/BRMSA-Net.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.