基于深度学习的自分割网络用于自适应mri引导肺放疗的患者特异性不确定度校准。

IF 3.3 3区医学 Q2 ENGINEERING, BIOMEDICAL

Physics in medicine and biology Pub Date : 2025-05-16 DOI:10.1088/1361-6560/add640

Moritz Rabe, Ettore F Meliadò, Sebastian N Marschner, Claus Belka, Stefanie Corradini, Cornelis A T van den Berg, Guillaume Landry, Christopher Kurz

{"title":"基于深度学习的自分割网络用于自适应mri引导肺放疗的患者特异性不确定度校准。","authors":"Moritz Rabe, Ettore F Meliadò, Sebastian N Marschner, Claus Belka, Stefanie Corradini, Cornelis A T van den Berg, Guillaume Landry, Christopher Kurz","doi":"10.1088/1361-6560/add640","DOIUrl":null,"url":null,"abstract":"Objective.Uncertainty assessment of deep learning autosegmentation (DLAS) models can support contour corrections in adaptive radiotherapy (ART), e.g. by utilizing Monte Carlo Dropout (MCD) uncertainty maps. However, poorly calibrated uncertainties at the patient level often render these clinically nonviable. We evaluated population-based and patient-specific DLAS accuracy and uncertainty calibration and propose a patient-specific post-training uncertainty calibration method for DLAS in ART.Approach.The study included 122 lung cancer patients treated with a low-field MR-linac (80/19/23 training/validation/test cases). Ten single-label 3D-U-Net population-based baseline models (BM) were trained with dropout using planning MRIs (pMRIs) and contours for nine organs-at-riks (OARs) and gross tumor volumes (GTVs). Patient-specific models (PS) were created by fine-tuning BMs with each test patient's pMRI. Model uncertainty was assessed with MCD, averaged into probability maps. Uncertainty calibration was evaluated with reliability diagrams and expected calibration error (ECE). A proposed post-training calibration method rescaled MCD probabilities for fraction images in BM (calBM) and PS (calPS) after fitting reliability diagrams from pMRIs. All models were evaluated on fraction images using Dice similarity coefficient (DSC), 95th percentile Hausdorff distance (HD95) and ECE. Metrics were compared among models for all OARs combined (n = 163), and the GTV (n = 23), using Friedman and posthoc-Nemenyi tests (α = 0.05).Main results.For the OARs, patient-specific fine-tuning significantly (p < 0.001) increased median DSC from 0.78 (BM) to 0.86 (PS) and reduced HD95 from 14 mm (BM) to 6.0 mm (PS). Uncertainty calibration achieved substantial reductions in ECE, from 0.25 (BM) to 0.091 (calBM) and 0.22 (PS) to 0.11 (calPS) (p < 0.001), without significantly affecting DSC or HD95 (p > 0.05). For the GTV, BM performance was poor (DSC = 0.05) but significantly (p < 0.001) improved with PS training (DSC = 0.75) while uncertainty calibration reduced ECE from 0.22 (PS) to 0.15 (calPS) (p = 0.45).Significance.Post-training uncertainty calibration yields geometrically accurate DLAS models with well-calibrated uncertainty estimates, crucial for ART applications.","PeriodicalId":20185,"journal":{"name":"Physics in medicine and biology","volume":" ","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Patient-specific uncertainty calibration of deep learning-based autosegmentation networks for adaptive MRI-guided lung radiotherapy.\",\"authors\":\"Moritz Rabe, Ettore F Meliadò, Sebastian N Marschner, Claus Belka, Stefanie Corradini, Cornelis A T van den Berg, Guillaume Landry, Christopher Kurz\",\"doi\":\"10.1088/1361-6560/add640\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objective.Uncertainty assessment of deep learning autosegmentation (DLAS) models can support contour corrections in adaptive radiotherapy (ART), e.g. by utilizing Monte Carlo Dropout (MCD) uncertainty maps. However, poorly calibrated uncertainties at the patient level often render these clinically nonviable. We evaluated population-based and patient-specific DLAS accuracy and uncertainty calibration and propose a patient-specific post-training uncertainty calibration method for DLAS in ART.Approach.The study included 122 lung cancer patients treated with a low-field MR-linac (80/19/23 training/validation/test cases). Ten single-label 3D-U-Net population-based baseline models (BM) were trained with dropout using planning MRIs (pMRIs) and contours for nine organs-at-riks (OARs) and gross tumor volumes (GTVs). Patient-specific models (PS) were created by fine-tuning BMs with each test patient's pMRI. Model uncertainty was assessed with MCD, averaged into probability maps. Uncertainty calibration was evaluated with reliability diagrams and expected calibration error (ECE). A proposed post-training calibration method rescaled MCD probabilities for fraction images in BM (calBM) and PS (calPS) after fitting reliability diagrams from pMRIs. All models were evaluated on fraction images using Dice similarity coefficient (DSC), 95th percentile Hausdorff distance (HD95) and ECE. Metrics were compared among models for all OARs combined (n = 163), and the GTV (n = 23), using Friedman and posthoc-Nemenyi tests (α = 0.05).Main results.For the OARs, patient-specific fine-tuning significantly (p < 0.001) increased median DSC from 0.78 (BM) to 0.86 (PS) and reduced HD95 from 14 mm (BM) to 6.0 mm (PS). Uncertainty calibration achieved substantial reductions in ECE, from 0.25 (BM) to 0.091 (calBM) and 0.22 (PS) to 0.11 (calPS) (p < 0.001), without significantly affecting DSC or HD95 (p > 0.05). For the GTV, BM performance was poor (DSC = 0.05) but significantly (p < 0.001) improved with PS training (DSC = 0.75) while uncertainty calibration reduced ECE from 0.22 (PS) to 0.15 (calPS) (p = 0.45).Significance.Post-training uncertainty calibration yields geometrically accurate DLAS models with well-calibrated uncertainty estimates, crucial for ART applications.\",\"PeriodicalId\":20185,\"journal\":{\"name\":\"Physics in medicine and biology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-05-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physics in medicine and biology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1088/1361-6560/add640\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics in medicine and biology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6560/add640","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

摘要

目的：深度学习自动分割（DLAS）模型的不确定性评估可以支持自适应放疗（ART）中的轮廓校正，例如利用蒙特卡罗Dropout （MCD）不确定性图。然而，在患者水平上，校准不当的不确定性往往使这些在临床上不可行。我们评估了基于人群和患者特异性的DLAS精度和不确定度校准，并提出了一种针对ART中DLAS的患者特异性训练后不确定度校准方法。该研究包括122例接受低场MR-linac治疗的肺癌患者（80/19/23例训练/验证/测试病例）。使用计划核磁共振（pmri）和9个危险器官（OARs）和总肿瘤体积（gtv）的轮廓，对10个单标签3D-U-Net基于人群的基线模型（BM）进行dropout训练。患者特异性模型（PS）是通过对每个测试患者的pMRI进行微调来创建的。用MCD评估模型的不确定性，将其平均到概率图中。用可靠性图和期望校准误差（ECE）对不确定度校准进行评价。提出了一种训练后校正方法，在拟合pmri的可靠性图后，对BM （calBM）和PS （calPS）中分数图像的MCD概率进行了重新标定。采用Dice相似系数（DSC）、第95百分位Hausdorff距离（HD95）和ECE在分数图像上对所有模型进行评价。采用Friedman和postthoc - nemenyi检验（α=0.05）比较所有桨叶组合模型（n=163）和GTV （n=23）的指标。；对于OARs，患者特异性微调显著（p0.05）。对于GTV， BM性能较差（DSC=0.05），但显著(p

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Patient-specific uncertainty calibration of deep learning-based autosegmentation networks for adaptive MRI-guided lung radiotherapy.

Objective.Uncertainty assessment of deep learning autosegmentation (DLAS) models can support contour corrections in adaptive radiotherapy (ART), e.g. by utilizing Monte Carlo Dropout (MCD) uncertainty maps. However, poorly calibrated uncertainties at the patient level often render these clinically nonviable. We evaluated population-based and patient-specific DLAS accuracy and uncertainty calibration and propose a patient-specific post-training uncertainty calibration method for DLAS in ART.Approach.The study included 122 lung cancer patients treated with a low-field MR-linac (80/19/23 training/validation/test cases). Ten single-label 3D-U-Net population-based baseline models (BM) were trained with dropout using planning MRIs (pMRIs) and contours for nine organs-at-riks (OARs) and gross tumor volumes (GTVs). Patient-specific models (PS) were created by fine-tuning BMs with each test patient's pMRI. Model uncertainty was assessed with MCD, averaged into probability maps. Uncertainty calibration was evaluated with reliability diagrams and expected calibration error (ECE). A proposed post-training calibration method rescaled MCD probabilities for fraction images in BM (calBM) and PS (calPS) after fitting reliability diagrams from pMRIs. All models were evaluated on fraction images using Dice similarity coefficient (DSC), 95th percentile Hausdorff distance (HD95) and ECE. Metrics were compared among models for all OARs combined (n = 163), and the GTV (n = 23), using Friedman and posthoc-Nemenyi tests (α = 0.05).Main results.For the OARs, patient-specific fine-tuning significantly (p < 0.001) increased median DSC from 0.78 (BM) to 0.86 (PS) and reduced HD95 from 14 mm (BM) to 6.0 mm (PS). Uncertainty calibration achieved substantial reductions in ECE, from 0.25 (BM) to 0.091 (calBM) and 0.22 (PS) to 0.11 (calPS) (p < 0.001), without significantly affecting DSC or HD95 (p > 0.05). For the GTV, BM performance was poor (DSC = 0.05) but significantly (p < 0.001) improved with PS training (DSC = 0.75) while uncertainty calibration reduced ECE from 0.22 (PS) to 0.15 (calPS) (p = 0.45).Significance.Post-training uncertainty calibration yields geometrically accurate DLAS models with well-calibrated uncertainty estimates, crucial for ART applications.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Physics in medicine and biology 医学-工程：生物医学

CiteScore

6.50

自引率

14.30%

发文量

409

审稿时长

2 months

期刊介绍： The development and application of theoretical, computational and experimental physics to medicine, physiology and biology. Topics covered are: therapy physics (including ionizing and non-ionizing radiation); biomedical imaging (e.g. x-ray, magnetic resonance, ultrasound, optical and nuclear imaging); image-guided interventions; image reconstruction and analysis (including kinetic modelling); artificial intelligence in biomedical physics and analysis; nanoparticles in imaging and therapy; radiobiology; radiation protection and patient dose monitoring; radiation dosimetry