{"title":"基于自动机器学习的医用放射性核素生产厚靶产量预测","authors":"YanBang Tang","doi":"10.1016/j.radphyschem.2025.113281","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate knowledge of thick-target yields (TTY) is critical for the efficient and reliable production of medical radionuclides. In this study, we developed and evaluated a suite of machine learning models to perform prediction of the TTY for four medically relevant (p,n) reactions: <sup>167</sup>Er(p,n)<sup>167</sup>Tm, <sup>58</sup>Fe(p,n)<sup>58m</sup>Co, <sup>119</sup>Sn(p,n)<sup>119</sup>Sb, and <sup>186</sup>W(p,n)<sup>186g</sup>Re. A unified dataset was curated from the IAEA's evaluated data library. Physics-informed features, including the proton, neutron, and mass numbers of both the target and product nuclides, were engineered to provide a physical context for the models. The performance of fourteen algorithms, including ensemble methods, kernel-based models, and a Automated Machine Learning (AutoML) framework, Autogluon, was systematically evaluated. The Autogluon model demonstrated strong predictive performance, achieving a high coefficient of determination (R<sup>2</sup> = 0.999995) and a low root mean squared error (RMSE = 0.021 MBq/μA·h) on a held-out test set. It outperformed all other models, particularly simple linear models (R<sup>2</sup> < 0.5) which failed to capture the non-linear nature of the yield curves. The model closely reproduced the TTY curves for four individual reactions in trained data this study. The observed large relative errors were confined to physically insignificant, near-threshold energy regions where absolute errors were negligible. This work presents a successful application of machine learning for the prediction of thick-target yields. The results establish that data-driven models, particularly those developed through AutoML, show promise as a complementary tool for nuclear data evaluation, supporting the optimization of radionuclide production for medical applications.</div></div>","PeriodicalId":20861,"journal":{"name":"Radiation Physics and Chemistry","volume":"239 ","pages":"Article 113281"},"PeriodicalIF":2.8000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of thick-target yields for medical radionuclide production based on automated machine learning\",\"authors\":\"YanBang Tang\",\"doi\":\"10.1016/j.radphyschem.2025.113281\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate knowledge of thick-target yields (TTY) is critical for the efficient and reliable production of medical radionuclides. In this study, we developed and evaluated a suite of machine learning models to perform prediction of the TTY for four medically relevant (p,n) reactions: <sup>167</sup>Er(p,n)<sup>167</sup>Tm, <sup>58</sup>Fe(p,n)<sup>58m</sup>Co, <sup>119</sup>Sn(p,n)<sup>119</sup>Sb, and <sup>186</sup>W(p,n)<sup>186g</sup>Re. A unified dataset was curated from the IAEA's evaluated data library. Physics-informed features, including the proton, neutron, and mass numbers of both the target and product nuclides, were engineered to provide a physical context for the models. The performance of fourteen algorithms, including ensemble methods, kernel-based models, and a Automated Machine Learning (AutoML) framework, Autogluon, was systematically evaluated. The Autogluon model demonstrated strong predictive performance, achieving a high coefficient of determination (R<sup>2</sup> = 0.999995) and a low root mean squared error (RMSE = 0.021 MBq/μA·h) on a held-out test set. It outperformed all other models, particularly simple linear models (R<sup>2</sup> < 0.5) which failed to capture the non-linear nature of the yield curves. The model closely reproduced the TTY curves for four individual reactions in trained data this study. The observed large relative errors were confined to physically insignificant, near-threshold energy regions where absolute errors were negligible. This work presents a successful application of machine learning for the prediction of thick-target yields. The results establish that data-driven models, particularly those developed through AutoML, show promise as a complementary tool for nuclear data evaluation, supporting the optimization of radionuclide production for medical applications.</div></div>\",\"PeriodicalId\":20861,\"journal\":{\"name\":\"Radiation Physics and Chemistry\",\"volume\":\"239 \",\"pages\":\"Article 113281\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Radiation Physics and Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0969806X2500773X\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Radiation Physics and Chemistry","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0969806X2500773X","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
Prediction of thick-target yields for medical radionuclide production based on automated machine learning
Accurate knowledge of thick-target yields (TTY) is critical for the efficient and reliable production of medical radionuclides. In this study, we developed and evaluated a suite of machine learning models to perform prediction of the TTY for four medically relevant (p,n) reactions: 167Er(p,n)167Tm, 58Fe(p,n)58mCo, 119Sn(p,n)119Sb, and 186W(p,n)186gRe. A unified dataset was curated from the IAEA's evaluated data library. Physics-informed features, including the proton, neutron, and mass numbers of both the target and product nuclides, were engineered to provide a physical context for the models. The performance of fourteen algorithms, including ensemble methods, kernel-based models, and a Automated Machine Learning (AutoML) framework, Autogluon, was systematically evaluated. The Autogluon model demonstrated strong predictive performance, achieving a high coefficient of determination (R2 = 0.999995) and a low root mean squared error (RMSE = 0.021 MBq/μA·h) on a held-out test set. It outperformed all other models, particularly simple linear models (R2 < 0.5) which failed to capture the non-linear nature of the yield curves. The model closely reproduced the TTY curves for four individual reactions in trained data this study. The observed large relative errors were confined to physically insignificant, near-threshold energy regions where absolute errors were negligible. This work presents a successful application of machine learning for the prediction of thick-target yields. The results establish that data-driven models, particularly those developed through AutoML, show promise as a complementary tool for nuclear data evaluation, supporting the optimization of radionuclide production for medical applications.
期刊介绍:
Radiation Physics and Chemistry is a multidisciplinary journal that provides a medium for publication of substantial and original papers, reviews, and short communications which focus on research and developments involving ionizing radiation in radiation physics, radiation chemistry and radiation processing.
The journal aims to publish papers with significance to an international audience, containing substantial novelty and scientific impact. The Editors reserve the rights to reject, with or without external review, papers that do not meet these criteria. This could include papers that are very similar to previous publications, only with changed target substrates, employed materials, analyzed sites and experimental methods, report results without presenting new insights and/or hypothesis testing, or do not focus on the radiation effects.