Foreground Background Difference Knowledge-Based Small Sample Target Segmentation for Image-Guided Radiation Therapy

IF 3 4区计算机科学 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

International Journal of Imaging Systems and Technology Pub Date : 2025-03-25 DOI:10.1002/ima.70075

Yuanzhi Cheng, Pengfei Zhang, Chang Liu, Changyong Guo, Shinichi Tamura

{"title":"Foreground Background Difference Knowledge-Based Small Sample Target Segmentation for Image-Guided Radiation Therapy","authors":"Yuanzhi Cheng, Pengfei Zhang, Chang Liu, Changyong Guo, Shinichi Tamura","doi":"10.1002/ima.70075","DOIUrl":null,"url":null,"abstract":"<p>The aim of this paper is to exploit a small sample (data scarcity) target segmentation technique for image-guided radiation therapy. The technique is grounded on a prototype-based approach—widely used small sample segmentation method. In this paper, we propose a foreground–background difference knowledge learning framework to perform the small sample target segmentation task. Its main differences from the traditional prototype-based approaches and novel contributions may be enumerated in two aspects: (1) A subdivision strategy to generate multiple foreground–background prototypes for each class in the support images, and the generated prototype is used to build a collection of query foreground and background prototypes. (2) A cross-prototype attention module to learn the correlation and difference knowledge of inter-class prototypes and transfer the knowledge to the query prototype for iterative updates. The main advantage of our framework is that: (1) the intra-class prototype set can comprehensively reflect the class features, avoiding the high computational complexity caused by dense matching; and (2) knowledge of inter-class differences provides comprehensive foreground–background segmentation information, greatly supporting accurate segmentation of the query set. In the 5-shot SegRap dataset experiment, the proposed model achieved Dice coefficients of 82.23% in the same-domain setting and 81.01% in the cross-domain setting. Similarly, in the 5-shot HECKTOR2022 dataset experiment, it achieved 83.59% in the same-domain setting and 81.48% in the cross-domain setting. For the 5-shot BTCV and CHAOS datasets, the model attained Dice coefficients of 79.00% and 79.70%, respectively. These results demonstrate the model's accuracy, efficiency, and generalization. This study presents a significant advancement in medical image segmentation by introducing a prototype-based model that effectively addresses data scarcity. By leveraging intra- and inter-class attention mechanisms, the model ensures robust generalization and reliable performance across datasets, paving the way for efficient and precise clinical applications with minimal reliance on large annotated datasets.</p>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 2","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70075","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

The aim of this paper is to exploit a small sample (data scarcity) target segmentation technique for image-guided radiation therapy. The technique is grounded on a prototype-based approach—widely used small sample segmentation method. In this paper, we propose a foreground–background difference knowledge learning framework to perform the small sample target segmentation task. Its main differences from the traditional prototype-based approaches and novel contributions may be enumerated in two aspects: (1) A subdivision strategy to generate multiple foreground–background prototypes for each class in the support images, and the generated prototype is used to build a collection of query foreground and background prototypes. (2) A cross-prototype attention module to learn the correlation and difference knowledge of inter-class prototypes and transfer the knowledge to the query prototype for iterative updates. The main advantage of our framework is that: (1) the intra-class prototype set can comprehensively reflect the class features, avoiding the high computational complexity caused by dense matching; and (2) knowledge of inter-class differences provides comprehensive foreground–background segmentation information, greatly supporting accurate segmentation of the query set. In the 5-shot SegRap dataset experiment, the proposed model achieved Dice coefficients of 82.23% in the same-domain setting and 81.01% in the cross-domain setting. Similarly, in the 5-shot HECKTOR2022 dataset experiment, it achieved 83.59% in the same-domain setting and 81.48% in the cross-domain setting. For the 5-shot BTCV and CHAOS datasets, the model attained Dice coefficients of 79.00% and 79.70%, respectively. These results demonstrate the model's accuracy, efficiency, and generalization. This study presents a significant advancement in medical image segmentation by introducing a prototype-based model that effectively addresses data scarcity. By leveraging intra- and inter-class attention mechanisms, the model ensures robust generalization and reliable performance across datasets, paving the way for efficient and precise clinical applications with minimal reliance on large annotated datasets.

Abstract Image

查看原文本刊更多论文

基于前景背景差异知识的图像引导放射治疗小样本目标分割

本文的目的是开发一种用于图像引导放射治疗的小样本（数据稀缺性）目标分割技术。该技术是基于一种基于原型的方法-广泛使用的小样本分割方法。在本文中，我们提出了一个前景背景差异知识学习框架来执行小样本目标分割任务。它与传统的基于原型的方法的主要区别和新贡献体现在两个方面：(1)采用细分策略，为支持图像中的每个类生成多个前景-背景原型，并使用生成的原型构建查询前景和背景原型集合。(2)跨原型关注模块，学习类间原型的相关性和差异性知识，并将这些知识传递给查询原型进行迭代更新。该框架的主要优点是：(1)类内原型集能够全面反映类的特征，避免了密集匹配带来的高计算复杂度；(2)类间差异的知识提供了全面的前背景分割信息，极大地支持了查询集的准确分割。在5次SegRap数据集实验中，该模型在同域设置下的Dice系数为82.23%，在跨域设置下的Dice系数为81.01%。同样，在5次的HECKTOR2022数据集实验中，它在同域设置下达到了83.59%，在跨域设置下达到了81.48%。对于5弹BTCV和CHAOS数据集，模型的Dice系数分别达到79.00%和79.70%。这些结果证明了该模型的准确性、效率和通用性。本研究通过引入一种有效解决数据稀缺性的基于原型的模型，在医学图像分割方面取得了重大进展。通过利用类内和类间的注意力机制，该模型确保了跨数据集的鲁棒泛化和可靠性能，为有效和精确的临床应用铺平了道路，同时减少了对大型注释数据集的依赖。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

International Journal of Imaging Systems and Technology 工程技术-成像科学与照相技术

CiteScore

6.90

自引率

6.10%

发文量

138

审稿时长

3 months

期刊介绍： The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals. IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging. The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered. The scope of the journal includes, but is not limited to, the following in the context of biomedical research: Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.; Neuromodulation and brain stimulation techniques such as TMS and tDCS; Software and hardware for imaging, especially related to human and animal health; Image segmentation in normal and clinical populations; Pattern analysis and classification using machine learning techniques; Computational modeling and analysis; Brain connectivity and connectomics; Systems-level characterization of brain function; Neural networks and neurorobotics; Computer vision, based on human/animal physiology; Brain-computer interface (BCI) technology; Big data, databasing and data mining.