{"title":"Evaluation and tuning of a commercial automated planning system for prostate radiotherapy","authors":"Antony Carver, Stuart Green","doi":"10.1016/j.phro.2025.100834","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and Purpose:</h3><div>Artificial Intelligence (AI) based automated planning is increasing in popularity. Guidance has been published recommending approaches for the safe implementation and monitoring of these techniques. An evaluation of a commercial AI tool was undertaken and reported along with the specific methods used to evaluate, tune and monitor the plan quality.</div></div><div><h3>Materials and Methods:</h3><div>A total of 335 previously planned prostate patients were used to evaluate and commission a commercial AI based planning solution. The quality of the automatically produced plans was compared to previous practice using models inspired by existing research into knowledge based planning. A quantile regression based technique identified the most optimal historic plans to be used for model tuning. Finally, a control chart based method was validated for post-deployment monitoring of the produced plan quality.</div></div><div><h3>Results:</h3><div>The baseline model provided by the manufacturer was found to provide good plan quality overall with 9 out 15 plan quality metrics found to have lower variance after accounting for anatomy. Rectum sparing was found to be inferior to human generated plans. Two further iterations of the model were produced in collaboration with the manufacturer. Further iterations of the model resulted in comparable rectum sparing, a 0.02 mean difference in rectum V50 Gy, was achieved whilst maintaining much of the improved consistency.</div></div><div><h3>Conclusions:</h3><div>A method to implement the guidance for commissioning of automated and AI based planning tools is presented alongside a method for monitoring the subsequent plan quality. The final plan quality achieved was comparable to or better than the original plans following two revisions.</div></div>","PeriodicalId":36850,"journal":{"name":"Physics and Imaging in Radiation Oncology","volume":"36 ","pages":"Article 100834"},"PeriodicalIF":3.3000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physics and Imaging in Radiation Oncology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2405631625001393","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background and Purpose:
Artificial Intelligence (AI) based automated planning is increasing in popularity. Guidance has been published recommending approaches for the safe implementation and monitoring of these techniques. An evaluation of a commercial AI tool was undertaken and reported along with the specific methods used to evaluate, tune and monitor the plan quality.
Materials and Methods:
A total of 335 previously planned prostate patients were used to evaluate and commission a commercial AI based planning solution. The quality of the automatically produced plans was compared to previous practice using models inspired by existing research into knowledge based planning. A quantile regression based technique identified the most optimal historic plans to be used for model tuning. Finally, a control chart based method was validated for post-deployment monitoring of the produced plan quality.
Results:
The baseline model provided by the manufacturer was found to provide good plan quality overall with 9 out 15 plan quality metrics found to have lower variance after accounting for anatomy. Rectum sparing was found to be inferior to human generated plans. Two further iterations of the model were produced in collaboration with the manufacturer. Further iterations of the model resulted in comparable rectum sparing, a 0.02 mean difference in rectum V50 Gy, was achieved whilst maintaining much of the improved consistency.
Conclusions:
A method to implement the guidance for commissioning of automated and AI based planning tools is presented alongside a method for monitoring the subsequent plan quality. The final plan quality achieved was comparable to or better than the original plans following two revisions.