{"title":"Machine learning-based online web calculator predicts the risk of sarcopenic obesity in older adults.","authors":"Jiale Guo, Qionghan He, Yehai Li","doi":"10.1007/s40520-025-03024-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Sarcopenic obesity (SO) has a higher risk of adverse health events compared to having obesity or sarcopenia alone due to the dual burden of both muscle loss and fat gain. The prevalence of SO is progressively increasing as the population ages and the obesity epidemic progresses. Currently, there are no tools for predicting the risk of SO, and this study aimed to construct a well-performing prediction tool based on machine learning.</p><p><strong>Methods: </strong>The National Health and Nutritional Examination Surveys (NHANES) 1999-2004 dataset was used for the analysis, and the included data were randomly divided into training and validation sets in the ratio of 70:30. Missing data is processed using multiple interpolation technique. A 5-fold cross-validated recursive feature elimination algorithm is used to rank the importance of features, and the top three important features from each algorithm are used as the features of the model for constructing the machine learning model. Six machine learning methods, CART, GBM, KNN, LR, NNet, XGBoost, were used to develop models and evaluated for discrimination, calibration, clinical utility, and robustness. The combined best-performing model was further developed into an online web calculator for clinical applications.</p><p><strong>Results: </strong>The study had 5607 participants, and 1139 (20.3%) of them had SO, with a prevalence of 21.2% among females and 19.4% among males. Combining all the performance evaluations, the GBM-based model has the best performance, which uses age, race, and BMI as the features of the model, and its AUC values in the training and validation sets are 0.820 and 0.832, and it has good calibration, clinical utility, and robustness.</p><p><strong>Conclusion: </strong>In this study, the GBM-based model performed well, and an online web calculator constructed on the basis of the model was used to identify the risk of SO in the US community for those over the age of 60.</p>","PeriodicalId":7720,"journal":{"name":"Aging Clinical and Experimental Research","volume":"37 1","pages":"210"},"PeriodicalIF":3.4000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12241294/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Aging Clinical and Experimental Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s40520-025-03024-5","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Sarcopenic obesity (SO) has a higher risk of adverse health events compared to having obesity or sarcopenia alone due to the dual burden of both muscle loss and fat gain. The prevalence of SO is progressively increasing as the population ages and the obesity epidemic progresses. Currently, there are no tools for predicting the risk of SO, and this study aimed to construct a well-performing prediction tool based on machine learning.
Methods: The National Health and Nutritional Examination Surveys (NHANES) 1999-2004 dataset was used for the analysis, and the included data were randomly divided into training and validation sets in the ratio of 70:30. Missing data is processed using multiple interpolation technique. A 5-fold cross-validated recursive feature elimination algorithm is used to rank the importance of features, and the top three important features from each algorithm are used as the features of the model for constructing the machine learning model. Six machine learning methods, CART, GBM, KNN, LR, NNet, XGBoost, were used to develop models and evaluated for discrimination, calibration, clinical utility, and robustness. The combined best-performing model was further developed into an online web calculator for clinical applications.
Results: The study had 5607 participants, and 1139 (20.3%) of them had SO, with a prevalence of 21.2% among females and 19.4% among males. Combining all the performance evaluations, the GBM-based model has the best performance, which uses age, race, and BMI as the features of the model, and its AUC values in the training and validation sets are 0.820 and 0.832, and it has good calibration, clinical utility, and robustness.
Conclusion: In this study, the GBM-based model performed well, and an online web calculator constructed on the basis of the model was used to identify the risk of SO in the US community for those over the age of 60.
期刊介绍:
Aging clinical and experimental research offers a multidisciplinary forum on the progressing field of gerontology and geriatrics. The areas covered by the journal include: biogerontology, neurosciences, epidemiology, clinical gerontology and geriatric assessment, social, economical and behavioral gerontology. “Aging clinical and experimental research” appears bimonthly and publishes review articles, original papers and case reports.