利用扩散模型模拟软组织肉瘤临床靶体积描绘的解读器间变异性。

Medical physics Pub Date : 2025-05-03 DOI:10.1002/mp.17865

Yafei Dong, Thibault Marin, Yue Zhuo, Elie Najem, Arnaud Beddok, Laura Rozenblum, Maryam Moteabbed, Kira Grogg, Fangxu Xing, Jonghye Woo, Yen-Lin E Chen, Ruth Lim, Xiaofeng Liu, Chao Ma, Georges El Fakhri

{"title":"利用扩散模型模拟软组织肉瘤临床靶体积描绘的解读器间变异性。","authors":"Yafei Dong, Thibault Marin, Yue Zhuo, Elie Najem, Arnaud Beddok, Laura Rozenblum, Maryam Moteabbed, Kira Grogg, Fangxu Xing, Jonghye Woo, Yen-Lin E Chen, Ruth Lim, Xiaofeng Liu, Chao Ma, Georges El Fakhri","doi":"10.1002/mp.17865","DOIUrl":null,"url":null,"abstract":"Background: Accurate delineation of the clinical target volume (CTV) is essential in the radiotherapy treatment of soft tissue sarcomas. However, this process is subject to inter-reader variability due to the need for clinical assessment of risk and extent of potential microscopic spread. This can lead to inconsistencies in treatment planning, potentially impacting treatment outcomes. Most existing automatic CTV delineation methods do not account for this variability and can only generate a single CTV for each case.Purpose: This study aims to develop a deep learning-based technique to generate multiple CTV contours for each case, simulating the inter-reader variability in the clinical practice.Methods: We employed a publicly available dataset consisting of fluorodeoxyglucose positron emission tomography (FDG-PET), x-ray computed tomography (CT), and pre-contrast T1-weighted magnetic resonance imaging (MRI) scans from 51 patients with soft tissue sarcoma, along with an independent validation set containing five additional patients. An experienced reader drew a contour of the gross tumor volume (GTV) for each patient based on multi-modality images. Subsequently, two additional readers, together with the first one, were responsible for contouring three CTVs in total based on the GTV. We developed a diffusion model-based deep learning method that is capable of generating arbitrary number of different and plausible CTVs to mimic the inter-reader variability in CTV delineation. The proposed model incorporates a separate encoder to extract features from the GTV masks, leveraging the critical role of GTV information in accurate CTV delineation.Results: The proposed diffusion model demonstrated superior performance with the highest Dice Index (0.902 compared to values below 0.881 for state-of-the-art models) and the best generalized energy distance (GED) (0.209 compared to values exceeding 0.221 for state-of-the-art models). It also achieved the second-highest recall and precision metrics among the compared ambiguous image segmentation models. Results from both datasets exhibited consistent trends, reinforcing the reliability of our findings. Additionally, ablation studies exploring different model structures and input configurations highlighted the significance of incorporating prior GTV information for accurate CTV delineation.Conclusions: The proposed diffusion model successfully generates multiple plausible CTV contours for soft tissue sarcomas, effectively capturing inter-reader variability in CTV delineation.","PeriodicalId":94136,"journal":{"name":"Medical physics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Modeling inter-reader variability in clinical target volume delineation for soft tissue sarcomas using diffusion model.\",\"authors\":\"Yafei Dong, Thibault Marin, Yue Zhuo, Elie Najem, Arnaud Beddok, Laura Rozenblum, Maryam Moteabbed, Kira Grogg, Fangxu Xing, Jonghye Woo, Yen-Lin E Chen, Ruth Lim, Xiaofeng Liu, Chao Ma, Georges El Fakhri\",\"doi\":\"10.1002/mp.17865\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Accurate delineation of the clinical target volume (CTV) is essential in the radiotherapy treatment of soft tissue sarcomas. However, this process is subject to inter-reader variability due to the need for clinical assessment of risk and extent of potential microscopic spread. This can lead to inconsistencies in treatment planning, potentially impacting treatment outcomes. Most existing automatic CTV delineation methods do not account for this variability and can only generate a single CTV for each case.Purpose: This study aims to develop a deep learning-based technique to generate multiple CTV contours for each case, simulating the inter-reader variability in the clinical practice.Methods: We employed a publicly available dataset consisting of fluorodeoxyglucose positron emission tomography (FDG-PET), x-ray computed tomography (CT), and pre-contrast T1-weighted magnetic resonance imaging (MRI) scans from 51 patients with soft tissue sarcoma, along with an independent validation set containing five additional patients. An experienced reader drew a contour of the gross tumor volume (GTV) for each patient based on multi-modality images. Subsequently, two additional readers, together with the first one, were responsible for contouring three CTVs in total based on the GTV. We developed a diffusion model-based deep learning method that is capable of generating arbitrary number of different and plausible CTVs to mimic the inter-reader variability in CTV delineation. The proposed model incorporates a separate encoder to extract features from the GTV masks, leveraging the critical role of GTV information in accurate CTV delineation.Results: The proposed diffusion model demonstrated superior performance with the highest Dice Index (0.902 compared to values below 0.881 for state-of-the-art models) and the best generalized energy distance (GED) (0.209 compared to values exceeding 0.221 for state-of-the-art models). It also achieved the second-highest recall and precision metrics among the compared ambiguous image segmentation models. Results from both datasets exhibited consistent trends, reinforcing the reliability of our findings. Additionally, ablation studies exploring different model structures and input configurations highlighted the significance of incorporating prior GTV information for accurate CTV delineation.Conclusions: The proposed diffusion model successfully generates multiple plausible CTV contours for soft tissue sarcomas, effectively capturing inter-reader variability in CTV delineation.\",\"PeriodicalId\":94136,\"journal\":{\"name\":\"Medical physics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical physics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/mp.17865\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/mp.17865","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

背景：准确的临床靶体积（CTV）在软组织肉瘤的放射治疗中至关重要。然而，由于需要对潜在显微传播的风险和程度进行临床评估，这一过程受到读者之间的差异的影响。这可能导致治疗计划的不一致，潜在地影响治疗结果。大多数现有的自动CTV描述方法没有考虑到这种可变性，只能为每种情况生成单个CTV。目的：本研究旨在开发一种基于深度学习的技术，为每个病例生成多个CTV轮廓，模拟临床实践中读者间的变异性。方法：我们使用了一个公开可用的数据集，包括51例软组织肉瘤患者的氟脱氧葡萄糖正电子发射断层扫描（FDG-PET）、x射线计算机断层扫描（CT）和对比前t1加权磁共振成像（MRI）扫描，以及包含另外5例患者的独立验证集。一位经验丰富的读者根据多模态图像为每位患者绘制了总肿瘤体积（GTV）的轮廓。随后，另外两名读者与第一名读者一起，负责在GTV的基础上共绘制三个ctv。我们开发了一种基于扩散模型的深度学习方法，该方法能够生成任意数量的不同且可信的CTV，以模拟CTV描绘中的读者间可变性。该模型采用单独的编码器从GTV掩模中提取特征，充分利用GTV信息在准确描绘CTV中的关键作用。结果：所提出的扩散模型具有最高的Dice指数（0.902，低于现有模型的0.881）和最佳的广义能量距离（GED）（0.209，高于现有模型的0.221）。在比较的模糊图像分割模型中，它也达到了第二高的召回率和精度指标。两个数据集的结果显示出一致的趋势，加强了我们研究结果的可靠性。此外，探索不同模型结构和输入配置的消融研究强调了整合先验GTV信息对于准确描绘CTV的重要性。结论：所提出的扩散模型成功地为软组织肉瘤生成了多个似是而非的CTV轮廓，有效地捕获了CTV描绘中的读取器间变异性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Modeling inter-reader variability in clinical target volume delineation for soft tissue sarcomas using diffusion model.

Background: Accurate delineation of the clinical target volume (CTV) is essential in the radiotherapy treatment of soft tissue sarcomas. However, this process is subject to inter-reader variability due to the need for clinical assessment of risk and extent of potential microscopic spread. This can lead to inconsistencies in treatment planning, potentially impacting treatment outcomes. Most existing automatic CTV delineation methods do not account for this variability and can only generate a single CTV for each case.

Purpose: This study aims to develop a deep learning-based technique to generate multiple CTV contours for each case, simulating the inter-reader variability in the clinical practice.

Methods: We employed a publicly available dataset consisting of fluorodeoxyglucose positron emission tomography (FDG-PET), x-ray computed tomography (CT), and pre-contrast T1-weighted magnetic resonance imaging (MRI) scans from 51 patients with soft tissue sarcoma, along with an independent validation set containing five additional patients. An experienced reader drew a contour of the gross tumor volume (GTV) for each patient based on multi-modality images. Subsequently, two additional readers, together with the first one, were responsible for contouring three CTVs in total based on the GTV. We developed a diffusion model-based deep learning method that is capable of generating arbitrary number of different and plausible CTVs to mimic the inter-reader variability in CTV delineation. The proposed model incorporates a separate encoder to extract features from the GTV masks, leveraging the critical role of GTV information in accurate CTV delineation.

Results: The proposed diffusion model demonstrated superior performance with the highest Dice Index (0.902 compared to values below 0.881 for state-of-the-art models) and the best generalized energy distance (GED) (0.209 compared to values exceeding 0.221 for state-of-the-art models). It also achieved the second-highest recall and precision metrics among the compared ambiguous image segmentation models. Results from both datasets exhibited consistent trends, reinforcing the reliability of our findings. Additionally, ablation studies exploring different model structures and input configurations highlighted the significance of incorporating prior GTV information for accurate CTV delineation.

Conclusions: The proposed diffusion model successfully generates multiple plausible CTV contours for soft tissue sarcomas, effectively capturing inter-reader variability in CTV delineation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Medical physics

自引率

0.00%

发文量