{"title":"Physics-informed machine learning for predicting MLC and gantry errors in VMAT: a feature engineering approach","authors":"Perumal Murugan, Ravikumar Manickam","doi":"10.1016/j.ejmp.2025.105064","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>This study presents a physics-informed, feature-engineered machine learning (ML) framework to predict multileaf collimator (MLC) and gantry positional errors in volumetric modulated arc therapy (VMAT)</div></div><div><h3>Methods</h3><div>Data from 32 VMAT trajectory logs (TrueBeam linac, HD120 MLC) were synchronized with DICOMRT plans to extract delivery dynamics. Novel physics-based parameters were introduced: a friction factor, an enhanced gravity vector, and MLC speed-normalized features. Three ML models XGBoost, LightGBM, and deep neural networks (DNNs) were optimized using Optuna and trained on trajectory log and DICOM-RT-derived datasets. Feature importance was evaluated via Spearman correlation, mutual information, and SHapley Additive Explanations (SHAP).</div></div><div><h3>Results</h3><div>Systematic discrepancies between DICOM-RT and trajectory log data were identified, with mean absolute deviations of 7.0 % (MLC speed), 8.0 % (gantry speed), and 8.5 % (dose rate). MLC speed emerged as the dominant predictor (Spearman: r<sub>s</sub> = 0.891), while friction and gravity features exhibited significant correlations (r<sub>s</sub> = 0.46 and 0.33, respectively). Mutual information revealed non-monotonic dependencies between gantry error and gantry angle (score: 0.34). LightGBM and XGBoost achieved superior MLC error prediction (MAE: 0.0019 mm, RMSE: 0.0027 mm), capturing > 90 % of observed errors, while DNNs lagged by 30 %. Engineered features reduced residual errors by 30 %. Gantry error predictions showed lower accuracy (MAE: 0.012°–0.015°). SHAP analysis highlighted physics-driven features as top contributors.</div></div><div><h3>Conclusion</h3><div>This work underscores the critical role of domain knowledge in ML for radiotherapy, achieving a 30% error reduction through physics-based feature engineering. The findings advocate for prioritizing feature space exploration alongside model optimization to enhance VMAT quality assurance.</div></div>","PeriodicalId":56092,"journal":{"name":"Physica Medica-European Journal of Medical Physics","volume":"136 ","pages":"Article 105064"},"PeriodicalIF":2.7000,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Physica Medica-European Journal of Medical Physics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1120179725001747","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Background
This study presents a physics-informed, feature-engineered machine learning (ML) framework to predict multileaf collimator (MLC) and gantry positional errors in volumetric modulated arc therapy (VMAT)
Methods
Data from 32 VMAT trajectory logs (TrueBeam linac, HD120 MLC) were synchronized with DICOMRT plans to extract delivery dynamics. Novel physics-based parameters were introduced: a friction factor, an enhanced gravity vector, and MLC speed-normalized features. Three ML models XGBoost, LightGBM, and deep neural networks (DNNs) were optimized using Optuna and trained on trajectory log and DICOM-RT-derived datasets. Feature importance was evaluated via Spearman correlation, mutual information, and SHapley Additive Explanations (SHAP).
Results
Systematic discrepancies between DICOM-RT and trajectory log data were identified, with mean absolute deviations of 7.0 % (MLC speed), 8.0 % (gantry speed), and 8.5 % (dose rate). MLC speed emerged as the dominant predictor (Spearman: rs = 0.891), while friction and gravity features exhibited significant correlations (rs = 0.46 and 0.33, respectively). Mutual information revealed non-monotonic dependencies between gantry error and gantry angle (score: 0.34). LightGBM and XGBoost achieved superior MLC error prediction (MAE: 0.0019 mm, RMSE: 0.0027 mm), capturing > 90 % of observed errors, while DNNs lagged by 30 %. Engineered features reduced residual errors by 30 %. Gantry error predictions showed lower accuracy (MAE: 0.012°–0.015°). SHAP analysis highlighted physics-driven features as top contributors.
Conclusion
This work underscores the critical role of domain knowledge in ML for radiotherapy, achieving a 30% error reduction through physics-based feature engineering. The findings advocate for prioritizing feature space exploration alongside model optimization to enhance VMAT quality assurance.
期刊介绍:
Physica Medica, European Journal of Medical Physics, publishing with Elsevier from 2007, provides an international forum for research and reviews on the following main topics:
Medical Imaging
Radiation Therapy
Radiation Protection
Measuring Systems and Signal Processing
Education and training in Medical Physics
Professional issues in Medical Physics.