{"title":"Foreground Background Difference Knowledge-Based Small Sample Target Segmentation for Image-Guided Radiation Therapy","authors":"Yuanzhi Cheng, Pengfei Zhang, Chang Liu, Changyong Guo, Shinichi Tamura","doi":"10.1002/ima.70075","DOIUrl":null,"url":null,"abstract":"<p>The aim of this paper is to exploit a small sample (data scarcity) target segmentation technique for image-guided radiation therapy. The technique is grounded on a prototype-based approach—widely used small sample segmentation method. In this paper, we propose a foreground–background difference knowledge learning framework to perform the small sample target segmentation task. Its main differences from the traditional prototype-based approaches and novel contributions may be enumerated in two aspects: (1) A subdivision strategy to generate multiple foreground–background prototypes for each class in the support images, and the generated prototype is used to build a collection of query foreground and background prototypes. (2) A cross-prototype attention module to learn the correlation and difference knowledge of inter-class prototypes and transfer the knowledge to the query prototype for iterative updates. The main advantage of our framework is that: (1) the intra-class prototype set can comprehensively reflect the class features, avoiding the high computational complexity caused by dense matching; and (2) knowledge of inter-class differences provides comprehensive foreground–background segmentation information, greatly supporting accurate segmentation of the query set. In the 5-shot SegRap dataset experiment, the proposed model achieved Dice coefficients of 82.23% in the same-domain setting and 81.01% in the cross-domain setting. Similarly, in the 5-shot HECKTOR2022 dataset experiment, it achieved 83.59% in the same-domain setting and 81.48% in the cross-domain setting. For the 5-shot BTCV and CHAOS datasets, the model attained Dice coefficients of 79.00% and 79.70%, respectively. These results demonstrate the model's accuracy, efficiency, and generalization. This study presents a significant advancement in medical image segmentation by introducing a prototype-based model that effectively addresses data scarcity. By leveraging intra- and inter-class attention mechanisms, the model ensures robust generalization and reliable performance across datasets, paving the way for efficient and precise clinical applications with minimal reliance on large annotated datasets.</p>","PeriodicalId":14027,"journal":{"name":"International Journal of Imaging Systems and Technology","volume":"35 2","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-03-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Imaging Systems and Technology","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ima.70075","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
The aim of this paper is to exploit a small sample (data scarcity) target segmentation technique for image-guided radiation therapy. The technique is grounded on a prototype-based approach—widely used small sample segmentation method. In this paper, we propose a foreground–background difference knowledge learning framework to perform the small sample target segmentation task. Its main differences from the traditional prototype-based approaches and novel contributions may be enumerated in two aspects: (1) A subdivision strategy to generate multiple foreground–background prototypes for each class in the support images, and the generated prototype is used to build a collection of query foreground and background prototypes. (2) A cross-prototype attention module to learn the correlation and difference knowledge of inter-class prototypes and transfer the knowledge to the query prototype for iterative updates. The main advantage of our framework is that: (1) the intra-class prototype set can comprehensively reflect the class features, avoiding the high computational complexity caused by dense matching; and (2) knowledge of inter-class differences provides comprehensive foreground–background segmentation information, greatly supporting accurate segmentation of the query set. In the 5-shot SegRap dataset experiment, the proposed model achieved Dice coefficients of 82.23% in the same-domain setting and 81.01% in the cross-domain setting. Similarly, in the 5-shot HECKTOR2022 dataset experiment, it achieved 83.59% in the same-domain setting and 81.48% in the cross-domain setting. For the 5-shot BTCV and CHAOS datasets, the model attained Dice coefficients of 79.00% and 79.70%, respectively. These results demonstrate the model's accuracy, efficiency, and generalization. This study presents a significant advancement in medical image segmentation by introducing a prototype-based model that effectively addresses data scarcity. By leveraging intra- and inter-class attention mechanisms, the model ensures robust generalization and reliable performance across datasets, paving the way for efficient and precise clinical applications with minimal reliance on large annotated datasets.
期刊介绍:
The International Journal of Imaging Systems and Technology (IMA) is a forum for the exchange of ideas and results relevant to imaging systems, including imaging physics and informatics. The journal covers all imaging modalities in humans and animals.
IMA accepts technically sound and scientifically rigorous research in the interdisciplinary field of imaging, including relevant algorithmic research and hardware and software development, and their applications relevant to medical research. The journal provides a platform to publish original research in structural and functional imaging.
The journal is also open to imaging studies of the human body and on animals that describe novel diagnostic imaging and analyses methods. Technical, theoretical, and clinical research in both normal and clinical populations is encouraged. Submissions describing methods, software, databases, replication studies as well as negative results are also considered.
The scope of the journal includes, but is not limited to, the following in the context of biomedical research:
Imaging and neuro-imaging modalities: structural MRI, functional MRI, PET, SPECT, CT, ultrasound, EEG, MEG, NIRS etc.;
Neuromodulation and brain stimulation techniques such as TMS and tDCS;
Software and hardware for imaging, especially related to human and animal health;
Image segmentation in normal and clinical populations;
Pattern analysis and classification using machine learning techniques;
Computational modeling and analysis;
Brain connectivity and connectomics;
Systems-level characterization of brain function;
Neural networks and neurorobotics;
Computer vision, based on human/animal physiology;
Brain-computer interface (BCI) technology;
Big data, databasing and data mining.