Geometrically focused training and evaluation of organs-at-risk segmentation via deep learning

IF 3.2 2区 医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING
Medical physics Pub Date : 2025-04-25 DOI:10.1002/mp.17840
Ruiyan Ni, Elizabeth Chuk, Kathy Han, Jennifer Croke, Anthony Fyles, Jelena Lukovic, Michael Milosevic, Benjamin Haibe-Kains, Alexandra Rink
{"title":"Geometrically focused training and evaluation of organs-at-risk segmentation via deep learning","authors":"Ruiyan Ni,&nbsp;Elizabeth Chuk,&nbsp;Kathy Han,&nbsp;Jennifer Croke,&nbsp;Anthony Fyles,&nbsp;Jelena Lukovic,&nbsp;Michael Milosevic,&nbsp;Benjamin Haibe-Kains,&nbsp;Alexandra Rink","doi":"10.1002/mp.17840","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Deep learning methods are promising in automating segmentation of organs at risk (OARs) in radiotherapy. However, the lack of a geometric indicator for dosimetry accuracy remains to be a problem. This issue is particularly pronounced in specific radiotherapy treatments where only the proximity of structures to the radiotherapy target affects the dose planning. In cervical cancer high dose-rate (HDR) brachytherapy, treatment planning is motivated by limiting dose to the hottest 2 cubic centimeters (D2cm<sup>3</sup>) of the OARs. Similarly, Ethos online adaptive radiotherapy system prioritizes only the closest target structures for adaptive plan generation.</p>\n </section>\n \n <section>\n \n <h3> Purpose</h3>\n \n <p>We propose a novel geometrically focused deep learning training method and evaluation metric, using cervical brachytherapy as a case study. A distance-penalized (DP) loss function was developed to focus attention on the near-to-target OAR regions. We also introduced and evaluated a novel geometric metric, weighted dice similarity coefficient (wDSC), correlated with OARs D2cm<sup>3</sup>.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>A model was trained using a 3D U-Net architecture and 170 T2-weighted magnetic resonance (MR) images (56 patients) with clinical contours. The dataset was split into subsets at the patient level: 45 patients (150 scans) as the training set for five-fold cross-validation and 11 patients (20 scans) as the testing set. Another dataset from our institution, consisting of 35 MR scans from 22 cervical cancer patients, was used as an independent internal testing set. A distance map, emphasizing errors near high-risk clinical target volume (CTV<sub>HR</sub>), was used to penalize two commonly used loss functions, cross-entropy (CE) loss and DiceCE loss. The wDSC emphasizes the accuracy of OAR regions proximal to CTV<sub>HR</sub> by incorporating a weighted factor in the original vDSC. The Pearson correlation coefficient (<i>r</i>) was used to quantify the strength of the relationship between D2cm<sup>3</sup> accuracy and six evaluation metrics (wDSC and five standard metrics). A physician rated and revised the auto-contours for the clinical acceptability tests.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The wDSC moderately correlated (<i>r</i> = -0.55) with D2cm<sup>3</sup> accuracy, outperforming standard geometric metrics. Models using DP loss functions consistently yielded higher wDSCs compared to their respective non-DP counterparts. DP loss models also improved D2cm<sup>3</sup> accuracy, indicating an enhanced accuracy in dosimetry. The clinical acceptability tests revealed that more than 94% of bladder and rectum contours and approximately half of the sigmoid and small bowel contours were clinically accepted.</p>\n </section>\n \n <section>\n \n <h3> Conclusion</h3>\n \n <p>We developed and evaluated a new geometric metric, wDSC, as a better indicator of D2cm<sup>3</sup> accuracy, which has the potential to become a surrogate for dosimetric accuracy in cervical brachytherapy. The model with DP loss showed non-statistically significant improvements in geometric and dosimetric performance. This work also holds the potential to be used for precise OARs delineation in adaptive radiotherapy.</p>\n </section>\n </div>","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"52 7","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/mp.17840","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/mp.17840","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Deep learning methods are promising in automating segmentation of organs at risk (OARs) in radiotherapy. However, the lack of a geometric indicator for dosimetry accuracy remains to be a problem. This issue is particularly pronounced in specific radiotherapy treatments where only the proximity of structures to the radiotherapy target affects the dose planning. In cervical cancer high dose-rate (HDR) brachytherapy, treatment planning is motivated by limiting dose to the hottest 2 cubic centimeters (D2cm3) of the OARs. Similarly, Ethos online adaptive radiotherapy system prioritizes only the closest target structures for adaptive plan generation.

Purpose

We propose a novel geometrically focused deep learning training method and evaluation metric, using cervical brachytherapy as a case study. A distance-penalized (DP) loss function was developed to focus attention on the near-to-target OAR regions. We also introduced and evaluated a novel geometric metric, weighted dice similarity coefficient (wDSC), correlated with OARs D2cm3.

Methods

A model was trained using a 3D U-Net architecture and 170 T2-weighted magnetic resonance (MR) images (56 patients) with clinical contours. The dataset was split into subsets at the patient level: 45 patients (150 scans) as the training set for five-fold cross-validation and 11 patients (20 scans) as the testing set. Another dataset from our institution, consisting of 35 MR scans from 22 cervical cancer patients, was used as an independent internal testing set. A distance map, emphasizing errors near high-risk clinical target volume (CTVHR), was used to penalize two commonly used loss functions, cross-entropy (CE) loss and DiceCE loss. The wDSC emphasizes the accuracy of OAR regions proximal to CTVHR by incorporating a weighted factor in the original vDSC. The Pearson correlation coefficient (r) was used to quantify the strength of the relationship between D2cm3 accuracy and six evaluation metrics (wDSC and five standard metrics). A physician rated and revised the auto-contours for the clinical acceptability tests.

Results

The wDSC moderately correlated (r = -0.55) with D2cm3 accuracy, outperforming standard geometric metrics. Models using DP loss functions consistently yielded higher wDSCs compared to their respective non-DP counterparts. DP loss models also improved D2cm3 accuracy, indicating an enhanced accuracy in dosimetry. The clinical acceptability tests revealed that more than 94% of bladder and rectum contours and approximately half of the sigmoid and small bowel contours were clinically accepted.

Conclusion

We developed and evaluated a new geometric metric, wDSC, as a better indicator of D2cm3 accuracy, which has the potential to become a surrogate for dosimetric accuracy in cervical brachytherapy. The model with DP loss showed non-statistically significant improvements in geometric and dosimetric performance. This work also holds the potential to be used for precise OARs delineation in adaptive radiotherapy.

Abstract Image

基于深度学习的高危器官分割的几何聚焦训练与评估。
背景:深度学习方法在放射治疗中危及器官(OARs)的自动分割中具有广阔的应用前景。然而,缺乏测量剂量准确性的几何指标仍然是一个问题。这个问题在特定的放射治疗中尤其明显,因为只有结构靠近放射治疗目标才会影响剂量计划。在宫颈癌高剂量率(HDR)近距离治疗中,治疗计划的动机是将剂量限制在桨叶最热的2立方厘米(D2cm3)。同样,Ethos在线自适应放射治疗系统只优先考虑最近的目标结构进行自适应计划生成。目的:我们提出了一种新的几何聚焦深度学习训练方法和评估指标,并以颈部近距离治疗为例进行了研究。开发了一个距离惩罚(DP)损失函数,将注意力集中在接近目标的桨叶区域。我们还引入并评估了一种新的几何度量,加权骰子相似系数(wDSC),与桨数D2cm3相关。方法:采用三维U-Net结构和具有临床轮廓的170张t2加权磁共振(MR)图像(56例)对模型进行训练。数据集在患者水平上被分成子集:45名患者(150次扫描)作为五倍交叉验证的训练集,11名患者(20次扫描)作为测试集。我们机构的另一个数据集,包括来自22名宫颈癌患者的35次磁共振扫描,被用作独立的内部测试集。一个距离图,强调高危临床靶体积(CTVHR)附近的误差,用于惩罚两种常用的损失函数,交叉熵(CE)损失和DiceCE损失。wDSC通过在原始vDSC中加入加权因子来强调靠近CTVHR的桨叶区域的准确性。使用Pearson相关系数(r)来量化D2cm3准确度与6个评价指标(wDSC和5个标准指标)之间的关系强度。医生对临床可接受性测试的自动轮廓进行评分和修改。结果:wDSC与D2cm3准确度中度相关(r = -0.55),优于标准几何指标。与各自的非DP对应模型相比,使用DP损失函数的模型始终产生更高的wdsc。DP损失模型也提高了D2cm3的准确性,表明剂量学的准确性得到了提高。临床可接受性测试显示,超过94%的膀胱和直肠轮廓以及大约一半的乙状结肠和小肠轮廓在临床上被接受。结论:我们开发并评估了一种新的几何指标wDSC,作为D2cm3准确性的更好指标,它有可能成为宫颈近距离放疗剂量学准确性的替代指标。具有DP损失的模型在几何和剂量学性能方面显示出非统计学意义上的显著改善。这项工作也有可能用于适应性放疗中精确的OARs描述。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Medical physics
Medical physics 医学-核医学
CiteScore
6.80
自引率
15.80%
发文量
660
审稿时长
1.7 months
期刊介绍: Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信