Comparative analysis of open-source against commercial AI-based segmentation models for online adaptive MR-guided radiotherapy.

Dominik Langner, Marcel Nachbar, Monica Lo Russo, Simon Boeke, Cihan Gani, Maximilian Niyazi, Daniela Thorwarth
{"title":"Comparative analysis of open-source against commercial AI-based segmentation models for online adaptive MR-guided radiotherapy.","authors":"Dominik Langner, Marcel Nachbar, Monica Lo Russo, Simon Boeke, Cihan Gani, Maximilian Niyazi, Daniela Thorwarth","doi":"10.1016/j.zemedi.2025.04.008","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and purpose: </strong>Online adaptive magnetic resonance-guided radiotherapy (MRgRT) has emerged as a state-of-the-art treatment option for multiple tumour entities, accounting for daily anatomical and tumour volume changes, thus allowing sparing of relevant organs at risk (OARs). However, the annotation of treatment-relevant anatomical structures in context of online plan adaptation remains challenging, often relying on commercial segmentation solutions due to limited availability of clinically validated alternatives. The aim of this study was to investigate whether an open-source artificial intelligence (AI) segmentation network can compete with the annotation accuracy of a commercial solution, both trained on the identical dataset, questioning the need for commercial models in clinical practice.</p><p><strong>Materials and methods: </strong>For 47 pelvic patients, T2w MR imaging data acquired on a 1.5 T MR-Linac were manually contoured, identifying prostate, seminal vesicles, rectum, anal canal, bladder, penile bulb, and bony structures. These training data were used for the generation of an in-house AI segmentation model, a nnU-Net with residual encoder architecture featuring a streamlined single image inference pipeline, and re-training of a commercial solution. For quantitative evaluation, 20 MR images were contoured by a radiation oncologist, considered as ground truth contours (GTC) and compared with the in-house/commercial AI-based contours (iAIC/cAIC) using Dice Similarity Coefficient (DSC), 95% Hausdorff distances (HD95), and surface DSC (sDSC). For qualitative evaluation, four radiation oncologists assessed the usability of OAR/target iAIC within an online adaptive workflow using a four-point Likert scale: (1) acceptable without modification, (2) requiring minor adjustments, (3) requiring major adjustments, and (4) not usable.</p><p><strong>Results: </strong>Patient-individual annotations were generated in a median [range] time of 23 [16-34] s for iAIC and 152 [121-198] s for cAIC, respectively. OARs showed a maximum median DSC of 0.97/0.97 (iAIC/cAIC) for bladder and minimum median DSC of 0.78/0.79 (iAIC/cAIC) for anal canal/penile bulb. Maximal respectively minimal median HD95 were detected for rectum with 17.3/20.6 mm (iAIC/cAIC) and for bladder with 5.6/6.0 mm (iAIC/cAIC). Overall, the average median DSC/HD95 values were 0.87/11.8mm (iAIC) and 0.83/10.2mm (cAIC) for OAR/targets and 0.90/11.9mm (iAIC) and 0.91/16.5mm (cAIC) for bony structures. For a tolerance of 3 mm, the highest and lowest sDSC were determined for bladder (iAIC:1.00, cAIC:0.99) and prostate in iAIC (0.89) and anal canal in cAIC (0.80), respectively. Qualitatively, 84.8% of analysed contours were considered as clinically acceptable for iAIC, while 12.9% required minor and 2.3% major adjustments or were classed as unusable. Contour-specific analysis showed that iAIC achieved the highest mean scores with 1.00 for the anal canal and the lowest with 1.61 for the prostate.</p><p><strong>Conclusion: </strong>This study demonstrates that open-source segmentation framework can achieve comparable annotation accuracy to commercial solutions for pelvic anatomy in online adaptive MRgRT. The adapted framework not only maintained high segmentation performance, with 84.8% of contours accepted by physicians or requiring only minor corrections (12.9%) but also enhanced clinical workflow efficiency of online adaptive MRgRT through reduced inference times. These findings establish open-source frameworks as viable alternatives to commercial systems in supervised clinical workflows.</p>","PeriodicalId":101315,"journal":{"name":"Zeitschrift fur medizinische Physik","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Zeitschrift fur medizinische Physik","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.zemedi.2025.04.008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background and purpose: Online adaptive magnetic resonance-guided radiotherapy (MRgRT) has emerged as a state-of-the-art treatment option for multiple tumour entities, accounting for daily anatomical and tumour volume changes, thus allowing sparing of relevant organs at risk (OARs). However, the annotation of treatment-relevant anatomical structures in context of online plan adaptation remains challenging, often relying on commercial segmentation solutions due to limited availability of clinically validated alternatives. The aim of this study was to investigate whether an open-source artificial intelligence (AI) segmentation network can compete with the annotation accuracy of a commercial solution, both trained on the identical dataset, questioning the need for commercial models in clinical practice.

Materials and methods: For 47 pelvic patients, T2w MR imaging data acquired on a 1.5 T MR-Linac were manually contoured, identifying prostate, seminal vesicles, rectum, anal canal, bladder, penile bulb, and bony structures. These training data were used for the generation of an in-house AI segmentation model, a nnU-Net with residual encoder architecture featuring a streamlined single image inference pipeline, and re-training of a commercial solution. For quantitative evaluation, 20 MR images were contoured by a radiation oncologist, considered as ground truth contours (GTC) and compared with the in-house/commercial AI-based contours (iAIC/cAIC) using Dice Similarity Coefficient (DSC), 95% Hausdorff distances (HD95), and surface DSC (sDSC). For qualitative evaluation, four radiation oncologists assessed the usability of OAR/target iAIC within an online adaptive workflow using a four-point Likert scale: (1) acceptable without modification, (2) requiring minor adjustments, (3) requiring major adjustments, and (4) not usable.

Results: Patient-individual annotations were generated in a median [range] time of 23 [16-34] s for iAIC and 152 [121-198] s for cAIC, respectively. OARs showed a maximum median DSC of 0.97/0.97 (iAIC/cAIC) for bladder and minimum median DSC of 0.78/0.79 (iAIC/cAIC) for anal canal/penile bulb. Maximal respectively minimal median HD95 were detected for rectum with 17.3/20.6 mm (iAIC/cAIC) and for bladder with 5.6/6.0 mm (iAIC/cAIC). Overall, the average median DSC/HD95 values were 0.87/11.8mm (iAIC) and 0.83/10.2mm (cAIC) for OAR/targets and 0.90/11.9mm (iAIC) and 0.91/16.5mm (cAIC) for bony structures. For a tolerance of 3 mm, the highest and lowest sDSC were determined for bladder (iAIC:1.00, cAIC:0.99) and prostate in iAIC (0.89) and anal canal in cAIC (0.80), respectively. Qualitatively, 84.8% of analysed contours were considered as clinically acceptable for iAIC, while 12.9% required minor and 2.3% major adjustments or were classed as unusable. Contour-specific analysis showed that iAIC achieved the highest mean scores with 1.00 for the anal canal and the lowest with 1.61 for the prostate.

Conclusion: This study demonstrates that open-source segmentation framework can achieve comparable annotation accuracy to commercial solutions for pelvic anatomy in online adaptive MRgRT. The adapted framework not only maintained high segmentation performance, with 84.8% of contours accepted by physicians or requiring only minor corrections (12.9%) but also enhanced clinical workflow efficiency of online adaptive MRgRT through reduced inference times. These findings establish open-source frameworks as viable alternatives to commercial systems in supervised clinical workflows.

开源与商用人工智能在线自适应磁共振引导放疗分割模型的对比分析。
背景和目的:在线自适应磁共振引导放射治疗(MRgRT)已成为多种肿瘤实体的最先进治疗选择,考虑到日常解剖和肿瘤体积的变化,从而允许保留相关危险器官(OARs)。然而,在在线计划适应的背景下,治疗相关解剖结构的注释仍然具有挑战性,由于临床验证替代方案的可用性有限,通常依赖于商业分割解决方案。本研究的目的是研究开源人工智能(AI)分割网络是否可以与商业解决方案的注释准确性竞争,两者都是在相同的数据集上训练的,质疑临床实践中对商业模型的需求。材料和方法:对47例盆腔患者,在1.5 T MR- linac上获取T2w MR成像数据,手工绘制轮廓,识别前列腺、精囊、直肠、肛管、膀胱、阴茎球和骨结构。这些训练数据用于生成内部AI分割模型,具有流线型单图像推理管道的残余编码器架构的nnU-Net,以及商业解决方案的重新训练。为了进行定量评估,20张MR图像由放射肿瘤学家绘制,被认为是地面真实轮廓(GTC),并使用Dice相似系数(DSC)、95% Hausdorff距离(HD95)和表面DSC (sDSC)与内部/商业基于ai的轮廓(iAIC/cAIC)进行比较。为了进行定性评估,四名放射肿瘤学家使用李克特量表评估了在线自适应工作流程中OAR/target iAIC的可用性:(1)无需修改即可接受,(2)需要小调整,(3)需要大调整,(4)不可用。结果:iAIC和cAIC分别在23 [16-34]s和152 [121-198]s的中位[范围]时间内生成患者个体注释。膀胱的最大中位DSC为0.97/0.97 (iAIC/cAIC),肛管/阴茎球的最小中位DSC为0.78/0.79 (iAIC/cAIC)。最大最小中位HD95分别为直肠17.3/20.6 mm (iAIC/cAIC)和膀胱5.6/6.0 mm (iAIC/cAIC)。总体而言,桨叶/目标的平均DSC/HD95中位数为0.87/11.8mm (iAIC)和0.83/10.2mm (cAIC),骨结构的平均DSC/HD95中位数为0.90/11.9mm (iAIC)和0.91/16.5mm (cAIC)。当耐受量为3 mm时,膀胱(iAIC为1.00,cAIC为0.99)、前列腺(iAIC为0.89)和肛管(cAIC为0.80)的sDSC最高和最低。定性地说,84.8%的分析轮廓线被认为是临床可接受的iAIC,而12.9%需要轻微调整,2.3%需要重大调整或被归类为不可用。等高线特异性分析显示,iAIC的平均评分最高,肛管为1.00,前列腺为1.61,最低。结论:本研究表明,开源分割框架可以在在线自适应MRgRT中实现与商业解决方案相当的骨盆解剖注释精度。调整后的框架不仅保持了较高的分割性能,84.8%的轮廓被医生接受或只需要少量的修正(12.9%),而且通过减少推理时间,提高了在线自适应MRgRT的临床工作效率。这些发现确立了开源框架在监督临床工作流程中作为商业系统的可行替代方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信