Tran Thi Ngan, Dang Huong Tra, Ngo Thi Quynh Mai, Hoang Van Dung, Nguyen Van Khai, Pham Van Linh, Nguyen Thi Thu Phuong
{"title":"Developing a machine learning-based predictive model for levothyroxine dosage estimation in hypothyroid patients: a retrospective study.","authors":"Tran Thi Ngan, Dang Huong Tra, Ngo Thi Quynh Mai, Hoang Van Dung, Nguyen Van Khai, Pham Van Linh, Nguyen Thi Thu Phuong","doi":"10.3389/fendo.2025.1415206","DOIUrl":null,"url":null,"abstract":"<p><p>Hypothyroidism, a common endocrine disorder, has a high incidence in women and increases with age. Levothyroxine (LT4) is the standard therapy; however, achieving clinical and biochemical euthyroidism is challenging. Therefore, developing an accurate model for predicting LT4 dosage is crucial. This retrospective study aimed to identify factors affecting the daily dose of LT4 and develop a model to estimate the dose of LT4 in hypothyroidism from a cohort of 1,864 patients through a comprehensive analysis of electronic medical records. Univariate analysis was conducted to explore the relationships between clinical and non-clinical variables, including weight, sex, age, body mass index, diastolic blood pressure, comorbidities, food effects, drug-drug interactions, liver function, serum albumin and TSH levels. Among the models tested, the Extra Trees Regressor (ETR) demonstrated the highest predictive accuracy, achieving an R² of 87.37% and the lowest mean absolute error of 9.4 mcg (95% CI: 7.7-11.2) in the test set. Other ensemble models, including Random Forest and Gradient Boosting, also showed strong performance (R² > 80%). Feature importance analysis highlighted BMI (0.516 ± 0.015) as the most influential predictor, followed by comorbidities (0.120 ± 0.010) and age (0.080 ± 0.005). The findings underscore the potential of machine learning in refining LT4 dose estimation by incorporating diverse clinical factors beyond traditional weight-based approaches. The model provides a solid foundation for personalized LT4 dosing, which can enhance treatment precision and reduce the risk of under- or over-medication. Further validation in external cohorts is essential to confirm its clinical applicability.</p>","PeriodicalId":12447,"journal":{"name":"Frontiers in Endocrinology","volume":"16 ","pages":"1415206"},"PeriodicalIF":3.9000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11949781/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Endocrinology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fendo.2025.1415206","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0
Abstract
Hypothyroidism, a common endocrine disorder, has a high incidence in women and increases with age. Levothyroxine (LT4) is the standard therapy; however, achieving clinical and biochemical euthyroidism is challenging. Therefore, developing an accurate model for predicting LT4 dosage is crucial. This retrospective study aimed to identify factors affecting the daily dose of LT4 and develop a model to estimate the dose of LT4 in hypothyroidism from a cohort of 1,864 patients through a comprehensive analysis of electronic medical records. Univariate analysis was conducted to explore the relationships between clinical and non-clinical variables, including weight, sex, age, body mass index, diastolic blood pressure, comorbidities, food effects, drug-drug interactions, liver function, serum albumin and TSH levels. Among the models tested, the Extra Trees Regressor (ETR) demonstrated the highest predictive accuracy, achieving an R² of 87.37% and the lowest mean absolute error of 9.4 mcg (95% CI: 7.7-11.2) in the test set. Other ensemble models, including Random Forest and Gradient Boosting, also showed strong performance (R² > 80%). Feature importance analysis highlighted BMI (0.516 ± 0.015) as the most influential predictor, followed by comorbidities (0.120 ± 0.010) and age (0.080 ± 0.005). The findings underscore the potential of machine learning in refining LT4 dose estimation by incorporating diverse clinical factors beyond traditional weight-based approaches. The model provides a solid foundation for personalized LT4 dosing, which can enhance treatment precision and reduce the risk of under- or over-medication. Further validation in external cohorts is essential to confirm its clinical applicability.
期刊介绍:
Frontiers in Endocrinology is a field journal of the "Frontiers in" journal series.
In today’s world, endocrinology is becoming increasingly important as it underlies many of the challenges societies face - from obesity and diabetes to reproduction, population control and aging. Endocrinology covers a broad field from basic molecular and cellular communication through to clinical care and some of the most crucial public health issues. The journal, thus, welcomes outstanding contributions in any domain of endocrinology.
Frontiers in Endocrinology publishes articles on the most outstanding discoveries across a wide research spectrum of Endocrinology. The mission of Frontiers in Endocrinology is to bring all relevant Endocrinology areas together on a single platform.