Intelligent predictive risk assessment and management of sarcopenia in chronic disease patients using machine learning and a web-based tool.

IF 2.8 3区医学 Q2 MEDICINE, RESEARCH & EXPERIMENTAL

European Journal of Medical Research Pub Date : 2025-04-29 DOI:10.1186/s40001-025-02606-3

Ke Rong, Gu Li Jiang Yi Ke Ran, Changgui Zhou, Xinglin Yi

{"title":"Intelligent predictive risk assessment and management of sarcopenia in chronic disease patients using machine learning and a web-based tool.","authors":"Ke Rong, Gu Li Jiang Yi Ke Ran, Changgui Zhou, Xinglin Yi","doi":"10.1186/s40001-025-02606-3","DOIUrl":null,"url":null,"abstract":"Background: Individuals with chronic diseases are at higher risk of sarcopenia, and precise prediction is essential for its prevention. This study aims to develop a risk scoring model using longitudinal data to predict the probability of sarcopenia in this population over next 3-5 years, thereby enabling early warning and intervention.Methods: Using data from a nationwide survey initiated in 2011, we selected patient data records from wave 1 (2011-2012) and follow-up data from wave 3 (2015-2016) as the study cohort. Retrospective data collection included demographic information, health conditions, and biochemical markers. After excluding records with missing values, a total of 2891 adults with chronic conditions were enrolled. Sarcopenia was assessed based on the Asian Working Group for Sarcopenia (AWGS) 2019 guidelines. A generalized linear mixed model (GLMM) with random effects and diverse machine learning models were utilized to explore feature contributions to sarcopenia risk. The Recursive Feature Elimination (RFE) algorithm was employed to optimize the full Multilayer Perceptron (MLP) model and develop an online application tool.Results: Among total population, 580 (20.1%) individuals were diagnosed with sarcopenia in wave 1 (2011-2012), and 638 (22.1%) were diagnosed in wave 3 (2015-2016), while 2165 (74.9%) individuals were not diagnosed with sarcopenia across the study period. MLP model, performed better than other three classic machine learning models, demonstrated a ROC AUC of 0.912, a PR AUC of 0.401, a sensitivity of 0.875, a specificity of 0.844, a Kappa value of 0.376, and an F1 score of 0.44. According to MLP model-based SHapley Additive exPlanations (SHAP) scoring, weight, age, BMI, height, total cholesterol, PEF, and gender were identified as the most important features of chronic disease individuals for sarcopenia. Using the RFE algorithm, we selected six key variables-weight, age, BMI, height, total cholesterol, and gender-achieving an ROC AUC of about 0.9 for the online application tool.Conclusion: We developed an MLP machine learning model that incorporates only six easily accessible variables, enabling the prediction of sarcopenia risk in individuals with chronic diseases. Additionally, we created a practical online application tool to assist in decision-making and streamline clinical assessments.","PeriodicalId":11949,"journal":{"name":"European Journal of Medical Research","volume":"30 1","pages":"345"},"PeriodicalIF":2.8000,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12039279/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Medical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s40001-025-02606-3","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Individuals with chronic diseases are at higher risk of sarcopenia, and precise prediction is essential for its prevention. This study aims to develop a risk scoring model using longitudinal data to predict the probability of sarcopenia in this population over next 3-5 years, thereby enabling early warning and intervention.

Methods: Using data from a nationwide survey initiated in 2011, we selected patient data records from wave 1 (2011-2012) and follow-up data from wave 3 (2015-2016) as the study cohort. Retrospective data collection included demographic information, health conditions, and biochemical markers. After excluding records with missing values, a total of 2891 adults with chronic conditions were enrolled. Sarcopenia was assessed based on the Asian Working Group for Sarcopenia (AWGS) 2019 guidelines. A generalized linear mixed model (GLMM) with random effects and diverse machine learning models were utilized to explore feature contributions to sarcopenia risk. The Recursive Feature Elimination (RFE) algorithm was employed to optimize the full Multilayer Perceptron (MLP) model and develop an online application tool.

Results: Among total population, 580 (20.1%) individuals were diagnosed with sarcopenia in wave 1 (2011-2012), and 638 (22.1%) were diagnosed in wave 3 (2015-2016), while 2165 (74.9%) individuals were not diagnosed with sarcopenia across the study period. MLP model, performed better than other three classic machine learning models, demonstrated a ROC AUC of 0.912, a PR AUC of 0.401, a sensitivity of 0.875, a specificity of 0.844, a Kappa value of 0.376, and an F1 score of 0.44. According to MLP model-based SHapley Additive exPlanations (SHAP) scoring, weight, age, BMI, height, total cholesterol, PEF, and gender were identified as the most important features of chronic disease individuals for sarcopenia. Using the RFE algorithm, we selected six key variables-weight, age, BMI, height, total cholesterol, and gender-achieving an ROC AUC of about 0.9 for the online application tool.

Conclusion: We developed an MLP machine learning model that incorporates only six easily accessible variables, enabling the prediction of sarcopenia risk in individuals with chronic diseases. Additionally, we created a practical online application tool to assist in decision-making and streamline clinical assessments.

查看原文本刊更多论文

使用机器学习和基于网络的工具对慢性疾病患者肌肉减少症进行智能预测风险评估和管理。

背景：患有慢性疾病的个体发生肌肉减少症的风险较高，准确的预测对其预防至关重要。本研究旨在利用纵向数据建立风险评分模型，预测该人群在未来3-5年内发生肌肉减少症的概率，从而实现早期预警和干预。方法：采用2011年开始的全国调查数据，选取第1期（2011-2012年）患者资料记录和第3期（2015-2016年）随访资料作为研究队列。回顾性数据收集包括人口统计信息、健康状况和生化指标。在排除缺失值的记录后，共有2891名患有慢性疾病的成年人被纳入研究。肌肉减少症是根据亚洲肌肉减少症工作组（AWGS） 2019指南进行评估的。利用具有随机效应和多种机器学习模型的广义线性混合模型（GLMM）来探索特征对肌肉减少症风险的贡献。采用递归特征消除（RFE）算法对全多层感知器（MLP）模型进行了优化，并开发了在线应用工具。结果：在总人口中，580人（20.1%）在第1波（2011-2012年）被诊断为肌肉减少症，638人（22.1%）在第3波（2015-2016年）被诊断为肌肉减少症，而2165人（74.9%）在整个研究期间未被诊断为肌肉减少症。MLP模型的ROC AUC为0.912，PR AUC为0.401，灵敏度为0.875，特异性为0.844，Kappa值为0.376，F1评分为0.44，优于其他三种经典机器学习模型。根据基于MLP模型的SHapley加性解释（SHAP）评分，体重、年龄、BMI、身高、总胆固醇、PEF和性别被确定为慢性肌肉减少症患者的最重要特征。使用RFE算法，我们选择了六个关键变量——体重、年龄、BMI、身高、总胆固醇和性别，为在线应用工具实现了约0.9的ROC AUC。结论：我们开发了一个MLP机器学习模型，该模型仅包含6个易于获取的变量，能够预测慢性疾病患者肌肉减少症的风险。此外，我们创建了一个实用的在线应用工具，以协助决策和简化临床评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

European Journal of Medical Research 医学-医学：研究与实验

CiteScore

3.20

自引率

0.00%

发文量

247

审稿时长

>12 weeks

期刊介绍： European Journal of Medical Research publishes translational and clinical research of international interest across all medical disciplines, enabling clinicians and other researchers to learn about developments and innovations within these disciplines and across the boundaries between disciplines. The journal publishes high quality research and reviews and aims to ensure that the results of all well-conducted research are published, regardless of their outcome.