PSFS-Net: Dynamic frequency-spatial synergistic perception network for polyp segmentation via hierarchical context refinement and frequency-domain decoupling

IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL
Chenxing Xia , Hailong Chen , Bin Ge , Xiaolong Peng , Chaofan Liu , Zihan Jia , Shishui Bao
{"title":"PSFS-Net: Dynamic frequency-spatial synergistic perception network for polyp segmentation via hierarchical context refinement and frequency-domain decoupling","authors":"Chenxing Xia ,&nbsp;Hailong Chen ,&nbsp;Bin Ge ,&nbsp;Xiaolong Peng ,&nbsp;Chaofan Liu ,&nbsp;Zihan Jia ,&nbsp;Shishui Bao","doi":"10.1016/j.bspc.2025.108920","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate polyp segmentation from colonoscopy images is pivotal for the early detection of colorectal cancer (CRC), significantly enhancing diagnostic efficiency and reliability in clinical practice. While recent methods have achieved notable progress, they often suffer from two critical limitations: (1) inadequate frequency and spatial feature representation, as most approaches remain biased toward spatial-domain learning and, even when incorporating frequency information, tend to overlook the hierarchical variability of frequency distributions across feature levels, resulting in suboptimal utilization of frequency cues; and (2) insufficient cross-level feature integration, limiting the ability to effectively capture both global semantics and fine-grained boundary details. To address these issues, we propose PSFS-Net, a novel dynamic frequency-spatial synergistic polyp segmentation framework that jointly leverages spatial and frequency domain information for hierarchical context refinement and cross-level fusion, which mainly includes Frequency-aware Cross-scale Fusion Module (FACFM), Dual-stream Global–Local Interaction Module (DGIM), and Dual Attention Cross-modulation Module (DCM). Specifically, FACFM is designed to extract frequency domain cues and adaptively decoupling high/low-frequency components from full-spectrum information by employs Discrete Fourier Transform and an adaptive Dynamic Gaussian Filters. DGIM is introduced to enable mutual refinement between high-level semantic representations and low-level spatial details through dedicated global and local processing branches. DCM is presented to further aggregate global contexts with local details via dual-attention mechanisms, alleviating semantic gaps. Extensive evaluations on five public polyp segmentation datasets demonstrate that PSFS-Net delivers competitive and excellent performances. Our code is available at <span><span>https://github.com/chljzh25/PSFS-Net</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108920"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425014314","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate polyp segmentation from colonoscopy images is pivotal for the early detection of colorectal cancer (CRC), significantly enhancing diagnostic efficiency and reliability in clinical practice. While recent methods have achieved notable progress, they often suffer from two critical limitations: (1) inadequate frequency and spatial feature representation, as most approaches remain biased toward spatial-domain learning and, even when incorporating frequency information, tend to overlook the hierarchical variability of frequency distributions across feature levels, resulting in suboptimal utilization of frequency cues; and (2) insufficient cross-level feature integration, limiting the ability to effectively capture both global semantics and fine-grained boundary details. To address these issues, we propose PSFS-Net, a novel dynamic frequency-spatial synergistic polyp segmentation framework that jointly leverages spatial and frequency domain information for hierarchical context refinement and cross-level fusion, which mainly includes Frequency-aware Cross-scale Fusion Module (FACFM), Dual-stream Global–Local Interaction Module (DGIM), and Dual Attention Cross-modulation Module (DCM). Specifically, FACFM is designed to extract frequency domain cues and adaptively decoupling high/low-frequency components from full-spectrum information by employs Discrete Fourier Transform and an adaptive Dynamic Gaussian Filters. DGIM is introduced to enable mutual refinement between high-level semantic representations and low-level spatial details through dedicated global and local processing branches. DCM is presented to further aggregate global contexts with local details via dual-attention mechanisms, alleviating semantic gaps. Extensive evaluations on five public polyp segmentation datasets demonstrate that PSFS-Net delivers competitive and excellent performances. Our code is available at https://github.com/chljzh25/PSFS-Net.
PSFS-Net:基于层次上下文细化和频域解耦的动态频率-空间协同感知网络
从结肠镜图像中准确分割息肉对于早期发现结直肠癌(CRC)至关重要,可以显著提高临床诊断的效率和可靠性。虽然最近的方法取得了显著的进展,但它们经常受到两个关键限制的影响:(1)频率和空间特征表示不足,因为大多数方法仍然偏向于空域学习,即使在纳入频率信息时,也往往忽略了频率分布在特征水平上的分层可变性,导致频率线索的利用不理想;(2)跨层特征集成不足,限制了有效捕获全局语义和细粒度边界细节的能力。为了解决这些问题,我们提出了PSFS-Net,这是一个新的动态频率-空间协同的polyp分割框架,它共同利用空间和频域信息进行分层上下文细化和跨级别融合,主要包括频率感知跨尺度融合模块(FACFM)、双流全局-局部交互模块(DGIM)和双注意交叉调制模块(DCM)。具体来说,FACFM通过采用离散傅立叶变换和自适应动态高斯滤波器提取频域信号,并从全谱信息中自适应解耦高/低频分量。引入DGIM是为了通过专用的全局和本地处理分支实现高级语义表示和低级空间细节之间的相互细化。DCM通过双注意机制将全局上下文与局部细节进一步聚合,从而减小语义差距。对五个公共息肉分割数据集的广泛评估表明PSFS-Net具有竞争力和卓越的性能。我们的代码可在https://github.com/chljzh25/PSFS-Net上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信