Chenxing Xia , Hailong Chen , Bin Ge , Xiaolong Peng , Chaofan Liu , Zihan Jia , Shishui Bao
{"title":"PSFS-Net:基于层次上下文细化和频域解耦的动态频率-空间协同感知网络","authors":"Chenxing Xia , Hailong Chen , Bin Ge , Xiaolong Peng , Chaofan Liu , Zihan Jia , Shishui Bao","doi":"10.1016/j.bspc.2025.108920","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate polyp segmentation from colonoscopy images is pivotal for the early detection of colorectal cancer (CRC), significantly enhancing diagnostic efficiency and reliability in clinical practice. While recent methods have achieved notable progress, they often suffer from two critical limitations: (1) inadequate frequency and spatial feature representation, as most approaches remain biased toward spatial-domain learning and, even when incorporating frequency information, tend to overlook the hierarchical variability of frequency distributions across feature levels, resulting in suboptimal utilization of frequency cues; and (2) insufficient cross-level feature integration, limiting the ability to effectively capture both global semantics and fine-grained boundary details. To address these issues, we propose PSFS-Net, a novel dynamic frequency-spatial synergistic polyp segmentation framework that jointly leverages spatial and frequency domain information for hierarchical context refinement and cross-level fusion, which mainly includes Frequency-aware Cross-scale Fusion Module (FACFM), Dual-stream Global–Local Interaction Module (DGIM), and Dual Attention Cross-modulation Module (DCM). Specifically, FACFM is designed to extract frequency domain cues and adaptively decoupling high/low-frequency components from full-spectrum information by employs Discrete Fourier Transform and an adaptive Dynamic Gaussian Filters. DGIM is introduced to enable mutual refinement between high-level semantic representations and low-level spatial details through dedicated global and local processing branches. DCM is presented to further aggregate global contexts with local details via dual-attention mechanisms, alleviating semantic gaps. Extensive evaluations on five public polyp segmentation datasets demonstrate that PSFS-Net delivers competitive and excellent performances. Our code is available at <span><span>https://github.com/chljzh25/PSFS-Net</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"112 ","pages":"Article 108920"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PSFS-Net: Dynamic frequency-spatial synergistic perception network for polyp segmentation via hierarchical context refinement and frequency-domain decoupling\",\"authors\":\"Chenxing Xia , Hailong Chen , Bin Ge , Xiaolong Peng , Chaofan Liu , Zihan Jia , Shishui Bao\",\"doi\":\"10.1016/j.bspc.2025.108920\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate polyp segmentation from colonoscopy images is pivotal for the early detection of colorectal cancer (CRC), significantly enhancing diagnostic efficiency and reliability in clinical practice. While recent methods have achieved notable progress, they often suffer from two critical limitations: (1) inadequate frequency and spatial feature representation, as most approaches remain biased toward spatial-domain learning and, even when incorporating frequency information, tend to overlook the hierarchical variability of frequency distributions across feature levels, resulting in suboptimal utilization of frequency cues; and (2) insufficient cross-level feature integration, limiting the ability to effectively capture both global semantics and fine-grained boundary details. To address these issues, we propose PSFS-Net, a novel dynamic frequency-spatial synergistic polyp segmentation framework that jointly leverages spatial and frequency domain information for hierarchical context refinement and cross-level fusion, which mainly includes Frequency-aware Cross-scale Fusion Module (FACFM), Dual-stream Global–Local Interaction Module (DGIM), and Dual Attention Cross-modulation Module (DCM). Specifically, FACFM is designed to extract frequency domain cues and adaptively decoupling high/low-frequency components from full-spectrum information by employs Discrete Fourier Transform and an adaptive Dynamic Gaussian Filters. DGIM is introduced to enable mutual refinement between high-level semantic representations and low-level spatial details through dedicated global and local processing branches. DCM is presented to further aggregate global contexts with local details via dual-attention mechanisms, alleviating semantic gaps. Extensive evaluations on five public polyp segmentation datasets demonstrate that PSFS-Net delivers competitive and excellent performances. Our code is available at <span><span>https://github.com/chljzh25/PSFS-Net</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"112 \",\"pages\":\"Article 108920\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-10-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809425014314\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425014314","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
PSFS-Net: Dynamic frequency-spatial synergistic perception network for polyp segmentation via hierarchical context refinement and frequency-domain decoupling
Accurate polyp segmentation from colonoscopy images is pivotal for the early detection of colorectal cancer (CRC), significantly enhancing diagnostic efficiency and reliability in clinical practice. While recent methods have achieved notable progress, they often suffer from two critical limitations: (1) inadequate frequency and spatial feature representation, as most approaches remain biased toward spatial-domain learning and, even when incorporating frequency information, tend to overlook the hierarchical variability of frequency distributions across feature levels, resulting in suboptimal utilization of frequency cues; and (2) insufficient cross-level feature integration, limiting the ability to effectively capture both global semantics and fine-grained boundary details. To address these issues, we propose PSFS-Net, a novel dynamic frequency-spatial synergistic polyp segmentation framework that jointly leverages spatial and frequency domain information for hierarchical context refinement and cross-level fusion, which mainly includes Frequency-aware Cross-scale Fusion Module (FACFM), Dual-stream Global–Local Interaction Module (DGIM), and Dual Attention Cross-modulation Module (DCM). Specifically, FACFM is designed to extract frequency domain cues and adaptively decoupling high/low-frequency components from full-spectrum information by employs Discrete Fourier Transform and an adaptive Dynamic Gaussian Filters. DGIM is introduced to enable mutual refinement between high-level semantic representations and low-level spatial details through dedicated global and local processing branches. DCM is presented to further aggregate global contexts with local details via dual-attention mechanisms, alleviating semantic gaps. Extensive evaluations on five public polyp segmentation datasets demonstrate that PSFS-Net delivers competitive and excellent performances. Our code is available at https://github.com/chljzh25/PSFS-Net.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.