Anna Drynda, Jacek Podlewski, Karolina Kucharczyk, Grzegorz Sokołowski, Anna Sowa-Staszczak, Alicja Hubalewska-Dydejczyk, Małgorzata Trofimiuk-Müldner
{"title":"Evaluation of multiple machine learning models predicting the results of hybrid imaging in primary hyperparathyroidism.","authors":"Anna Drynda, Jacek Podlewski, Karolina Kucharczyk, Grzegorz Sokołowski, Anna Sowa-Staszczak, Alicja Hubalewska-Dydejczyk, Małgorzata Trofimiuk-Müldner","doi":"10.5603/nmr.105377","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Primary hyperparathyroidism (PHP) diagnosis is based on abnormalities in biochemical blood tests. Preoperative localization of the affected gland with imaging may increase the effectiveness of the surgical treatment. The aim of this study is to evaluate predictive strategies for the assessment of radiotracer uptake in pre-operative [99mTc]Tc-sestamibi scintigraphy ([99mTc] Tc-MIBI SPECT-CT) among PHP patients to identify individuals with a high probability of negative results, and to develop clinical decision-making tools.</p><p><strong>Material and methods: </strong>Development and evaluation of logistic regression (LR), classification trees utilizing the classification and regression trees (CART) algorithm, random forest (RF), and boosted trees employing XGBoost (XGB) predictive models. All models were constructed using data obtained from 499 patients diagnosed with PHP who underwent [99mTc]Tc-MIBI SPECT-CT imaging between 2010 and 2022 at the University Hospital in Cracow, Poland.</p><p><strong>Results: </strong>The LR model demonstrated the best out-of-sample performance, achieving a specificity of 81.3% and an accuracy of 69.3%, with a sensitivity of 55.7%. Along with CART and XGB, LR performed well when using only 5 predictors: concentrations of parathormone (PTH), serum calcium, serum phosphates, total serum vitamin D, and maximal lesion diameter measured in ultrasound. Random forest (RF) exhibited higher sensitivity (62.7%), but lower specificity (74.2%) and accuracy (68.6%). Other models demonstrated subpar performance.</p><p><strong>Conclusions: </strong>Logistic regression and RF models were the most effective in predicting radiotracer uptake in pre-operative hybrid imaging of the parathyroids, suggesting their suitability as the foundation for software to be used in clinical settings. However, opting for the CART model, despite its easier interpretation, would come at the expense of performance.</p>","PeriodicalId":520725,"journal":{"name":"Nuclear medicine review. Central & Eastern Europe","volume":"28 0","pages":"47-54"},"PeriodicalIF":0.7000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nuclear medicine review. Central & Eastern Europe","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5603/nmr.105377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Primary hyperparathyroidism (PHP) diagnosis is based on abnormalities in biochemical blood tests. Preoperative localization of the affected gland with imaging may increase the effectiveness of the surgical treatment. The aim of this study is to evaluate predictive strategies for the assessment of radiotracer uptake in pre-operative [99mTc]Tc-sestamibi scintigraphy ([99mTc] Tc-MIBI SPECT-CT) among PHP patients to identify individuals with a high probability of negative results, and to develop clinical decision-making tools.
Material and methods: Development and evaluation of logistic regression (LR), classification trees utilizing the classification and regression trees (CART) algorithm, random forest (RF), and boosted trees employing XGBoost (XGB) predictive models. All models were constructed using data obtained from 499 patients diagnosed with PHP who underwent [99mTc]Tc-MIBI SPECT-CT imaging between 2010 and 2022 at the University Hospital in Cracow, Poland.
Results: The LR model demonstrated the best out-of-sample performance, achieving a specificity of 81.3% and an accuracy of 69.3%, with a sensitivity of 55.7%. Along with CART and XGB, LR performed well when using only 5 predictors: concentrations of parathormone (PTH), serum calcium, serum phosphates, total serum vitamin D, and maximal lesion diameter measured in ultrasound. Random forest (RF) exhibited higher sensitivity (62.7%), but lower specificity (74.2%) and accuracy (68.6%). Other models demonstrated subpar performance.
Conclusions: Logistic regression and RF models were the most effective in predicting radiotracer uptake in pre-operative hybrid imaging of the parathyroids, suggesting their suitability as the foundation for software to be used in clinical settings. However, opting for the CART model, despite its easier interpretation, would come at the expense of performance.