使用基于场复杂性特征和影响图的贝叶斯神经网络的不确定性量化指导的患者特异性质量保证。

IF 3.4 3区医学 Q2 ENGINEERING, BIOMEDICAL

Physics in medicine and biology Pub Date : 2025-08-08 DOI:10.1088/1361-6560/adf40b

Xueying Yang, Xiangxiang Cui, Xile Zhang, Haoze Li, Hongqing Zhuang, Ruijie Yang, Jing Sui, Lisheng Geng

{"title":"使用基于场复杂性特征和影响图的贝叶斯神经网络的不确定性量化指导的患者特异性质量保证。","authors":"Xueying Yang, Xiangxiang Cui, Xile Zhang, Haoze Li, Hongqing Zhuang, Ruijie Yang, Jing Sui, Lisheng Geng","doi":"10.1088/1361-6560/adf40b","DOIUrl":null,"url":null,"abstract":"Objective.Recent advancements in artificial intelligence (AI)-driven prediction models for measurement-based patient-specific quality assurance (PSQA) necessitate uncertainty quantification (UQ) to ensure clinical safety.Approach.An uncertainty-guided framework was proposed for PSQA prediction. A classification model utilizing field complexity features and fluence maps was trained to categorize PSQA outcomes. Monte Carlo approximate Bayesian inference-based UQ method was employed to approximate the distribution of classification results. Pre-defined clinical uncertainty thresholds were defined via Correct-Certain (CC) and Incorrect-Uncertain (IU) curves to trigger manual intervention for high-uncertainty cases. A Multilayer Perceptron then predictedγpassing rates (GPR) by integrating classification model embeddings with uncertainty metrics. Prospective test was conducted on clinical test sets to systematically assess clinical reliability of the proposed framework.Main results.The classification model achieved sensitivities of 83.33% at 3%/3 mm, 93.33% at 3%/2 mm, and 94.74% at 2%/2 mm. The pre-defined clinical uncertainty thresholds were determined from CC and IU curves, ranged from 0.057-0.240 at 3%/3 mm, 0.014-0.207 at 3%/2 mm, and 0.080-0.244 at 2%/2 mm. Only 57.27% of manual intervention was required to ensure clinical safety, achieving 100% clinical sensitivity at 3%/3 mm. Compared to conventional measurement-based PSQA, this reduces the workload by 42.73%. The GPR prediction model yielded mean absolute errors of 1.64%, 1.88%, and 2.57% at 3%/3 mm, 3%/2 mm, and 2%/2 mmγcriteria, respectively, with uncertainty integration leading to a relative MAE reduction of 21.03% for the 'failed' cases. For clinical prospective test, the clinical sensitivity, specificity, and accuracy reached 100% under the minimum pre-defined uncertainty threshold at threeγcriteria after manual intervention.Significance.This framework enhances AI-based PSQA reliability by systematically addressing prediction uncertainty, reducing manual workload, and improving GPR accuracy for 'failed' fields, thereby facilitating safe clinical adoption of automated PSQA systems.","PeriodicalId":20185,"journal":{"name":"Physics in medicine and biology","volume":" ","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Uncertainty quantification-guided patient-specific quality assurance using Bayesian neural networks based on field complexity features and fluence maps.\",\"authors\":\"Xueying Yang, Xiangxiang Cui, Xile Zhang, Haoze Li, Hongqing Zhuang, Ruijie Yang, Jing Sui, Lisheng Geng\",\"doi\":\"10.1088/1361-6560/adf40b\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objective.Recent advancements in artificial intelligence (AI)-driven prediction models for measurement-based patient-specific quality assurance (PSQA) necessitate uncertainty quantification (UQ) to ensure clinical safety.Approach.An uncertainty-guided framework was proposed for PSQA prediction. A classification model utilizing field complexity features and fluence maps was trained to categorize PSQA outcomes. Monte Carlo approximate Bayesian inference-based UQ method was employed to approximate the distribution of classification results. Pre-defined clinical uncertainty thresholds were defined via Correct-Certain (CC) and Incorrect-Uncertain (IU) curves to trigger manual intervention for high-uncertainty cases. A Multilayer Perceptron then predictedγpassing rates (GPR) by integrating classification model embeddings with uncertainty metrics. Prospective test was conducted on clinical test sets to systematically assess clinical reliability of the proposed framework.Main results.The classification model achieved sensitivities of 83.33% at 3%/3 mm, 93.33% at 3%/2 mm, and 94.74% at 2%/2 mm. The pre-defined clinical uncertainty thresholds were determined from CC and IU curves, ranged from 0.057-0.240 at 3%/3 mm, 0.014-0.207 at 3%/2 mm, and 0.080-0.244 at 2%/2 mm. Only 57.27% of manual intervention was required to ensure clinical safety, achieving 100% clinical sensitivity at 3%/3 mm. Compared to conventional measurement-based PSQA, this reduces the workload by 42.73%. The GPR prediction model yielded mean absolute errors of 1.64%, 1.88%, and 2.57% at 3%/3 mm, 3%/2 mm, and 2%/2 mmγcriteria, respectively, with uncertainty integration leading to a relative MAE reduction of 21.03% for the 'failed' cases. For clinical prospective test, the clinical sensitivity, specificity, and accuracy reached 100% under the minimum pre-defined uncertainty threshold at threeγcriteria after manual intervention.Significance.This framework enhances AI-based PSQA reliability by systematically addressing prediction uncertainty, reducing manual workload, and improving GPR accuracy for 'failed' fields, thereby facilitating safe clinical adoption of automated PSQA systems.\",\"PeriodicalId\":20185,\"journal\":{\"name\":\"Physics in medicine and biology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.4000,\"publicationDate\":\"2025-08-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Physics in medicine and biology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1088/1361-6560/adf40b\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics in medicine and biology","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1088/1361-6560/adf40b","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

摘要

目的：人工智能驱动的基于测量的患者特异性质量保证（PSQA）预测模型的最新进展需要对不确定性进行量化以确保临床安全。方法：提出了一个不确定性指导的PSQA预测框架。利用场复杂性特征和影响图训练分类模型对PSQA结果进行分类。采用蒙特卡罗近似贝叶斯推理的不确定性量化方法对分类结果的分布进行近似。通过正确-确定（CC）和不正确-不确定（IU）曲线定义预定义的临床不确定性阈值，以触发对高不确定性病例的人工干预。然后，多层感知器（MLP）通过集成分类模型嵌入和不确定性度量来预测γ通过率（GPR）。对临床试验集进行前瞻性检验，系统评估所提出框架的临床可靠性。主要结果：分类模型在3%/3mm时的敏感性为83.33%，在3%/2mm时为93.33%，在2%/2mm时为94.74%。预先定义的临床不确定性阈值由CC和IU曲线确定，范围为0.057-0.240,3%/3mm, 0.014-0.207, 2%/2mm, 0.080-0.244。仅需要57.27%的人工干预即可确保临床安全，在3%/3mm时达到100%的临床敏感性。与传统的基于测量的PSQA相比，这减少了42.73%的工作量。GPR预测模型在3%/3mm， 3%/2mm和2%/2mm γ标准下的平均绝对误差（MAEs）分别为1.64%，1.88%和2.57%，不确定性积分将“失败”案例的MAE降低了21.03%。对于临床前瞻性试验，在人工干预后的三个γ标准下，在最小预定义不确定性阈值下，临床敏感性、特异性和准确性达到100%。意义：该框架通过系统地解决预测不确定性，减少人工工作量，提高GPR对“失败”领域的准确性，增强了基于人工智能的PSQA可靠性，从而促进了自动化PSQA系统的安全临床应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Uncertainty quantification-guided patient-specific quality assurance using Bayesian neural networks based on field complexity features and fluence maps.

Objective.Recent advancements in artificial intelligence (AI)-driven prediction models for measurement-based patient-specific quality assurance (PSQA) necessitate uncertainty quantification (UQ) to ensure clinical safety.Approach.An uncertainty-guided framework was proposed for PSQA prediction. A classification model utilizing field complexity features and fluence maps was trained to categorize PSQA outcomes. Monte Carlo approximate Bayesian inference-based UQ method was employed to approximate the distribution of classification results. Pre-defined clinical uncertainty thresholds were defined via Correct-Certain (CC) and Incorrect-Uncertain (IU) curves to trigger manual intervention for high-uncertainty cases. A Multilayer Perceptron then predictedγpassing rates (GPR) by integrating classification model embeddings with uncertainty metrics. Prospective test was conducted on clinical test sets to systematically assess clinical reliability of the proposed framework.Main results.The classification model achieved sensitivities of 83.33% at 3%/3 mm, 93.33% at 3%/2 mm, and 94.74% at 2%/2 mm. The pre-defined clinical uncertainty thresholds were determined from CC and IU curves, ranged from 0.057-0.240 at 3%/3 mm, 0.014-0.207 at 3%/2 mm, and 0.080-0.244 at 2%/2 mm. Only 57.27% of manual intervention was required to ensure clinical safety, achieving 100% clinical sensitivity at 3%/3 mm. Compared to conventional measurement-based PSQA, this reduces the workload by 42.73%. The GPR prediction model yielded mean absolute errors of 1.64%, 1.88%, and 2.57% at 3%/3 mm, 3%/2 mm, and 2%/2 mmγcriteria, respectively, with uncertainty integration leading to a relative MAE reduction of 21.03% for the 'failed' cases. For clinical prospective test, the clinical sensitivity, specificity, and accuracy reached 100% under the minimum pre-defined uncertainty threshold at threeγcriteria after manual intervention.Significance.This framework enhances AI-based PSQA reliability by systematically addressing prediction uncertainty, reducing manual workload, and improving GPR accuracy for 'failed' fields, thereby facilitating safe clinical adoption of automated PSQA systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Physics in medicine and biology 医学-工程：生物医学

CiteScore

6.50

自引率

14.30%

发文量

409

审稿时长

2 months

期刊介绍： The development and application of theoretical, computational and experimental physics to medicine, physiology and biology. Topics covered are: therapy physics (including ionizing and non-ionizing radiation); biomedical imaging (e.g. x-ray, magnetic resonance, ultrasound, optical and nuclear imaging); image-guided interventions; image reconstruction and analysis (including kinetic modelling); artificial intelligence in biomedical physics and analysis; nanoparticles in imaging and therapy; radiobiology; radiation protection and patient dose monitoring; radiation dosimetry