An adaptive recalibrative contextual squeeze-and-excitation self-attention V-Net for kidney tumor segmentation in RCC imaging

IF 2.9 3区物理与天体物理 Q2 PHYSICS, MULTIDISCIPLINARY

The European Physical Journal Plus Pub Date : 2025-08-14 DOI:10.1140/epjp/s13360-025-06686-2

C. Pabitha, S. Benila, B. Vanathi

{"title":"An adaptive recalibrative contextual squeeze-and-excitation self-attention V-Net for kidney tumor segmentation in RCC imaging","authors":"C. Pabitha, S. Benila, B. Vanathi","doi":"10.1140/epjp/s13360-025-06686-2","DOIUrl":null,"url":null,"abstract":"<div><p>Accurate and efficient kidney tumor segmentation in renal cell carcinoma (RCC) imaging is essential for early diagnosis and surgical intervention. However, existing models struggle with class imbalance, small tumor detection, boundary irregularities, and imaging variations across CT protocols, limiting their clinical applicability and generalization. To address these challenges, we propose an advanced segmentation framework called as Adaptive Recalibrative Contextual Squeeze-and-Excitation Self-Attention V-Net (ARCSAV-Net). The novel ARCSAV-Net combines various innovations in the traditional V-Net architecture to more effectively segment kidney tumors in RCC images. First, Adaptive Recalibrative Contextual Squeeze-and-Excitation (AR-CSE) Blocks enhance feature prioritization by utilizing radiomic biomarkers such as entropy and vascular features to reduce class imbalance and tumor heterogeneity. Second, the Self-Attention V-Net Mechanism enhances boundary definition by reducing redundant features and enhancing focus on low-contrast and small tumors to enhance segmentation accuracy. Third, Task-Switching Self-Supervision (TSSS) reinforces feature learning through alternating between primary segmentation and secondary tasks such as rotation and intensity prediction to mitigate overfitting and enhance model robustness. Second, Context-Based Confidence Estimation (CBCT) strengthens uncertain predictions to impose consistency on segmentation across varying imaging protocols. Lastly, Bayesian Hyperparameter Optimization (ML-TPE) adjusts model parameters with low computational overhead, reducing computational overhead while ensuring generalization. Experimental results on KiTS19 and KiTS21 datasets demonstrate that AR-CSE-SAV-Net achieves better segmentation performance, with a Dice Similarity Coefficient (DSC) of 0.985, Volumetric Overlap Error (VOE) of 0.16, and Mean Surface Distance (MSD) of 0.6 mm, significantly outperforming existing methods in accuracy and inference speed.</p></div>","PeriodicalId":792,"journal":{"name":"The European Physical Journal Plus","volume":"140 8","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The European Physical Journal Plus","FirstCategoryId":"4","ListUrlMain":"https://link.springer.com/article/10.1140/epjp/s13360-025-06686-2","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSICS, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate and efficient kidney tumor segmentation in renal cell carcinoma (RCC) imaging is essential for early diagnosis and surgical intervention. However, existing models struggle with class imbalance, small tumor detection, boundary irregularities, and imaging variations across CT protocols, limiting their clinical applicability and generalization. To address these challenges, we propose an advanced segmentation framework called as Adaptive Recalibrative Contextual Squeeze-and-Excitation Self-Attention V-Net (ARCSAV-Net). The novel ARCSAV-Net combines various innovations in the traditional V-Net architecture to more effectively segment kidney tumors in RCC images. First, Adaptive Recalibrative Contextual Squeeze-and-Excitation (AR-CSE) Blocks enhance feature prioritization by utilizing radiomic biomarkers such as entropy and vascular features to reduce class imbalance and tumor heterogeneity. Second, the Self-Attention V-Net Mechanism enhances boundary definition by reducing redundant features and enhancing focus on low-contrast and small tumors to enhance segmentation accuracy. Third, Task-Switching Self-Supervision (TSSS) reinforces feature learning through alternating between primary segmentation and secondary tasks such as rotation and intensity prediction to mitigate overfitting and enhance model robustness. Second, Context-Based Confidence Estimation (CBCT) strengthens uncertain predictions to impose consistency on segmentation across varying imaging protocols. Lastly, Bayesian Hyperparameter Optimization (ML-TPE) adjusts model parameters with low computational overhead, reducing computational overhead while ensuring generalization. Experimental results on KiTS19 and KiTS21 datasets demonstrate that AR-CSE-SAV-Net achieves better segmentation performance, with a Dice Similarity Coefficient (DSC) of 0.985, Volumetric Overlap Error (VOE) of 0.16, and Mean Surface Distance (MSD) of 0.6 mm, significantly outperforming existing methods in accuracy and inference speed.

Abstract Image

查看原文本刊更多论文

RCC影像中肾肿瘤分割的自适应再校准上下文挤压-激发自关注V-Net

在肾细胞癌（RCC）影像中准确、高效的肾肿瘤分割对早期诊断和手术治疗至关重要。然而，现有模型存在分类不平衡、小肿瘤检测、边界不规则和不同CT方案的成像差异等问题，限制了其临床适用性和推广。为了解决这些挑战，我们提出了一种先进的分割框架，称为自适应再校准上下文挤压和激励自注意V-Net （ARCSAV-Net）。新颖的ARCSAV-Net结合了传统V-Net架构的各种创新，以更有效地分割肾细胞癌图像中的肾肿瘤。首先，自适应再校准上下文挤压和激励（AR-CSE）块通过利用熵和血管特征等放射生物标志物来减少类别不平衡和肿瘤异质性，从而增强特征优先级。其次，自关注V-Net机制通过减少冗余特征和增强对低对比度和小肿瘤的关注来增强边界定义，从而提高分割精度。第三，任务切换自我监督（TSSS）通过交替进行主要分割和次要任务（如旋转和强度预测）来加强特征学习，以减轻过拟合并增强模型的鲁棒性。其次，基于上下文的置信度估计（CBCT）增强了不确定性预测，从而在不同的成像协议中对分割施加一致性。最后，贝叶斯超参数优化（ML-TPE）以较低的计算量调整模型参数，在保证泛化的同时减少了计算量。在KiTS19和KiTS21数据集上的实验结果表明，AR-CSE-SAV-Net具有更好的分割性能，其Dice Similarity Coefficient （DSC）为0.985,Volumetric Overlap Error （VOE）为0.16,Mean Surface Distance （MSD）为0.6 mm，在准确率和推理速度上明显优于现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The European Physical Journal Plus PHYSICS, MULTIDISCIPLINARY-

CiteScore

5.40

自引率

8.80%

发文量

1150

审稿时长

4-8 weeks

期刊介绍： The aims of this peer-reviewed online journal are to distribute and archive all relevant material required to document, assess, validate and reconstruct in detail the body of knowledge in the physical and related sciences. The scope of EPJ Plus encompasses a broad landscape of fields and disciplines in the physical and related sciences - such as covered by the topical EPJ journals and with the explicit addition of geophysics, astrophysics, general relativity and cosmology, mathematical and quantum physics, classical and fluid mechanics, accelerator and medical physics, as well as physics techniques applied to any other topics, including energy, environment and cultural heritage.