基于多尺度空间频率特征增强的肝脏肿瘤精确分割

IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL
Jinlin Ma , Kai Zhang , Ziping Ma , Ke Lu
{"title":"基于多尺度空间频率特征增强的肝脏肿瘤精确分割","authors":"Jinlin Ma ,&nbsp;Kai Zhang ,&nbsp;Ziping Ma ,&nbsp;Ke Lu","doi":"10.1016/j.bspc.2025.108963","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate liver tumor segmentation is a crucial aspect for early diagnosis and surgery, but existing segmentation methods struggle with tumor heterogeneity, unclear boundaries, and small lesions due to limited multi-scale feature fusion and spatial perception. To alleviate these issues, we propose MSFFE-Net, a novel segmentation network that imposes a Multi-scale Spatial-Frequency Feature Enhancement mechanism, with the objective of unifying spatial and frequency domains to enrich feature representational power. Moreover, a Spatial-Frequency Domain Fusion (SFDF) module is incorporated to unify Fourier features with a dual-branch encoder, where standard convolutions and Residual Dilated Convolutions (RDC) are jointly employed to enable multi-scale feature extraction and to enhance edge perception. In addition, a Multi-scale Semantic Enhancement (MSE) module is introduced at the bottleneck to model global context, and CBAM attention is integrated into the skip connections to further optimize feature aggregation. Extensive experiments on the LiTS_2017 and 3Dircadb datasets further validate the effectiveness of the proposed method, achieving Dice coefficients of 98.12% and 97.24% for liver segmentation, and 89.61% and 92.87% for tumor segmentation, respectively. Compared with mainstream approaches such as nnU-Net and TransUNet, our model delivers Dice gains of 0.07%, 2.57%, and 1.00%, 1.83% on complex tumor datasets. In addition, the architecture maintains a favorable trade-off between accuracy and efficiency, with only 17.46 MB of parameters and an inference speed of 68.74 FPS. Ablation studies validate the model’s effectiveness in complex boundary and small target segmentation, advancing intelligent liver cancer diagnosis with potential for other organs tumor segmentation.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"113 ","pages":"Article 108963"},"PeriodicalIF":4.9000,"publicationDate":"2025-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MSFFE-Net: Multi-scale Spatial-Frequency Feature Enhancement for accurate liver tumor segmentation\",\"authors\":\"Jinlin Ma ,&nbsp;Kai Zhang ,&nbsp;Ziping Ma ,&nbsp;Ke Lu\",\"doi\":\"10.1016/j.bspc.2025.108963\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate liver tumor segmentation is a crucial aspect for early diagnosis and surgery, but existing segmentation methods struggle with tumor heterogeneity, unclear boundaries, and small lesions due to limited multi-scale feature fusion and spatial perception. To alleviate these issues, we propose MSFFE-Net, a novel segmentation network that imposes a Multi-scale Spatial-Frequency Feature Enhancement mechanism, with the objective of unifying spatial and frequency domains to enrich feature representational power. Moreover, a Spatial-Frequency Domain Fusion (SFDF) module is incorporated to unify Fourier features with a dual-branch encoder, where standard convolutions and Residual Dilated Convolutions (RDC) are jointly employed to enable multi-scale feature extraction and to enhance edge perception. In addition, a Multi-scale Semantic Enhancement (MSE) module is introduced at the bottleneck to model global context, and CBAM attention is integrated into the skip connections to further optimize feature aggregation. Extensive experiments on the LiTS_2017 and 3Dircadb datasets further validate the effectiveness of the proposed method, achieving Dice coefficients of 98.12% and 97.24% for liver segmentation, and 89.61% and 92.87% for tumor segmentation, respectively. Compared with mainstream approaches such as nnU-Net and TransUNet, our model delivers Dice gains of 0.07%, 2.57%, and 1.00%, 1.83% on complex tumor datasets. In addition, the architecture maintains a favorable trade-off between accuracy and efficiency, with only 17.46 MB of parameters and an inference speed of 68.74 FPS. Ablation studies validate the model’s effectiveness in complex boundary and small target segmentation, advancing intelligent liver cancer diagnosis with potential for other organs tumor segmentation.</div></div>\",\"PeriodicalId\":55362,\"journal\":{\"name\":\"Biomedical Signal Processing and Control\",\"volume\":\"113 \",\"pages\":\"Article 108963\"},\"PeriodicalIF\":4.9000,\"publicationDate\":\"2025-10-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biomedical Signal Processing and Control\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1746809425014740\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809425014740","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

摘要

准确的肝脏肿瘤分割是早期诊断和手术的关键,但现有的分割方法存在肿瘤异质性、边界不清、病灶小等问题,多尺度特征融合和空间感知能力有限。为了解决这些问题,我们提出了一种新的分割网络MSFFE-Net,它施加了多尺度空间频率特征增强机制,目的是统一空间和频率域,以丰富特征的表征能力。此外,在双支路编码器中加入了空间-频域融合(SFDF)模块来统一傅里叶特征,其中联合使用标准卷积和残差扩展卷积(RDC)来实现多尺度特征提取并增强边缘感知。此外,在瓶颈处引入多尺度语义增强(Multi-scale Semantic Enhancement, MSE)模块对全局上下文进行建模,并将CBAM关注集成到跳跃连接中,进一步优化特征聚合。在LiTS_2017和3Dircadb数据集上的大量实验进一步验证了该方法的有效性,肝脏分割的Dice系数分别为98.12%和97.24%,肿瘤分割的Dice系数分别为89.61%和92.87%。与nnU-Net和TransUNet等主流方法相比,我们的模型在复杂肿瘤数据集上的Dice增益分别为0.07%、2.57%和1.00%、1.83%。此外,该架构在精度和效率之间保持了良好的平衡,参数仅为17.46 MB,推理速度为68.74 FPS。消融实验验证了该模型在复杂边界和小靶点分割中的有效性,推进肝癌智能诊断,对其他脏器肿瘤分割具有潜在的应用前景。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
MSFFE-Net: Multi-scale Spatial-Frequency Feature Enhancement for accurate liver tumor segmentation
Accurate liver tumor segmentation is a crucial aspect for early diagnosis and surgery, but existing segmentation methods struggle with tumor heterogeneity, unclear boundaries, and small lesions due to limited multi-scale feature fusion and spatial perception. To alleviate these issues, we propose MSFFE-Net, a novel segmentation network that imposes a Multi-scale Spatial-Frequency Feature Enhancement mechanism, with the objective of unifying spatial and frequency domains to enrich feature representational power. Moreover, a Spatial-Frequency Domain Fusion (SFDF) module is incorporated to unify Fourier features with a dual-branch encoder, where standard convolutions and Residual Dilated Convolutions (RDC) are jointly employed to enable multi-scale feature extraction and to enhance edge perception. In addition, a Multi-scale Semantic Enhancement (MSE) module is introduced at the bottleneck to model global context, and CBAM attention is integrated into the skip connections to further optimize feature aggregation. Extensive experiments on the LiTS_2017 and 3Dircadb datasets further validate the effectiveness of the proposed method, achieving Dice coefficients of 98.12% and 97.24% for liver segmentation, and 89.61% and 92.87% for tumor segmentation, respectively. Compared with mainstream approaches such as nnU-Net and TransUNet, our model delivers Dice gains of 0.07%, 2.57%, and 1.00%, 1.83% on complex tumor datasets. In addition, the architecture maintains a favorable trade-off between accuracy and efficiency, with only 17.46 MB of parameters and an inference speed of 68.74 FPS. Ablation studies validate the model’s effectiveness in complex boundary and small target segmentation, advancing intelligent liver cancer diagnosis with potential for other organs tumor segmentation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信