Jie Zhou, Hao Wu, Linge Zhang, Qiaona Zhang, Jie Wang, Hang Zhao, Yongqi Dang, Shiyu Zhang, Lu Li
{"title":"用于评估透析患者透析充分性的机器学习预测模型的开发和验证。","authors":"Jie Zhou, Hao Wu, Linge Zhang, Qiaona Zhang, Jie Wang, Hang Zhao, Yongqi Dang, Shiyu Zhang, Lu Li","doi":"10.1177/03913988251355082","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>The assessment of dialysis adequacy is of great clinical importance. However, it depends on the nonlinear effects of numerous confounding factors and is therefore difficult to predict using traditional statistical methods. In this study, we used Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator Regression (LASSO) to assess dialysis adequacy.</p><p><strong>Methods: </strong>A training set (70%) and a test set (30%) were randomly selected from the 264 dialysis patient case records collected for this study. We compared the machine learning models with statistical logistic regression prediction models. In addition, we performed fivefold cross internal validation and external validation.</p><p><strong>Results: </strong>The test dataset showed sensitivity values of 0.86 (95% CI = 0.75-0.96), 0.81 (95% CI = 0.69-0.93), and 0.72 (95% CI = 0.58-0.85) for the RF, XGBoost, and LASSO models, respectively. The matched specificity was 0.73 (95% CI = 0.58-0.87), 0.81 (95% CI = 0.67-0.93), and 0.83 (95% CI = 0.71-0.95). Accuracy was 0.80 (95% CI = 0.71-0.89), 0.81 (95% CI = 0.72-0.90), and 0.77 (95% CI = 0.68-0.86). <i>F</i>1 scores were 0.83 (95% CI = 0.72-0.90), 0.82 (95% CI = 0.73-0.91), and 0.78 (95% CI = 0.67-0.87). The receiver operating characteristic curves (AUROC) were 0.88 (<i>p</i> < 0.05, 95% CI = 0.70-0.88), 0.86 (<i>p</i> = 0.12, 95% CI = 0.72-0.90), and 0.88 (<i>p</i> < 0.05, 95% CI = 0.69-0.86). The mean absolute errors (MAE) of the calibration curves were 0.15, 0.11, and 0.07. In addition, the decision curve analysis (DCA) showed wide intervals of net clinical benefit for the models.</p><p><strong>Conclusion: </strong>Machine learning can be used to predict dialysis adequacy for optimal RF performance.</p>","PeriodicalId":13932,"journal":{"name":"International Journal of Artificial Organs","volume":" ","pages":"557-565"},"PeriodicalIF":1.3000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Development and validation of machine learning predictive models for assessing dialysis adequacy in dialysis patients.\",\"authors\":\"Jie Zhou, Hao Wu, Linge Zhang, Qiaona Zhang, Jie Wang, Hang Zhao, Yongqi Dang, Shiyu Zhang, Lu Li\",\"doi\":\"10.1177/03913988251355082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>The assessment of dialysis adequacy is of great clinical importance. However, it depends on the nonlinear effects of numerous confounding factors and is therefore difficult to predict using traditional statistical methods. In this study, we used Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator Regression (LASSO) to assess dialysis adequacy.</p><p><strong>Methods: </strong>A training set (70%) and a test set (30%) were randomly selected from the 264 dialysis patient case records collected for this study. We compared the machine learning models with statistical logistic regression prediction models. In addition, we performed fivefold cross internal validation and external validation.</p><p><strong>Results: </strong>The test dataset showed sensitivity values of 0.86 (95% CI = 0.75-0.96), 0.81 (95% CI = 0.69-0.93), and 0.72 (95% CI = 0.58-0.85) for the RF, XGBoost, and LASSO models, respectively. The matched specificity was 0.73 (95% CI = 0.58-0.87), 0.81 (95% CI = 0.67-0.93), and 0.83 (95% CI = 0.71-0.95). Accuracy was 0.80 (95% CI = 0.71-0.89), 0.81 (95% CI = 0.72-0.90), and 0.77 (95% CI = 0.68-0.86). <i>F</i>1 scores were 0.83 (95% CI = 0.72-0.90), 0.82 (95% CI = 0.73-0.91), and 0.78 (95% CI = 0.67-0.87). The receiver operating characteristic curves (AUROC) were 0.88 (<i>p</i> < 0.05, 95% CI = 0.70-0.88), 0.86 (<i>p</i> = 0.12, 95% CI = 0.72-0.90), and 0.88 (<i>p</i> < 0.05, 95% CI = 0.69-0.86). The mean absolute errors (MAE) of the calibration curves were 0.15, 0.11, and 0.07. In addition, the decision curve analysis (DCA) showed wide intervals of net clinical benefit for the models.</p><p><strong>Conclusion: </strong>Machine learning can be used to predict dialysis adequacy for optimal RF performance.</p>\",\"PeriodicalId\":13932,\"journal\":{\"name\":\"International Journal of Artificial Organs\",\"volume\":\" \",\"pages\":\"557-565\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Artificial Organs\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1177/03913988251355082\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/7/28 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q4\",\"JCRName\":\"ENGINEERING, BIOMEDICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Artificial Organs","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1177/03913988251355082","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/28 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
摘要
目的:透析充分性评价具有重要的临床意义。然而,它依赖于许多混杂因素的非线性影响,因此很难用传统的统计方法来预测。在本研究中,我们使用随机森林(RF)、极端梯度增强(XGBoost)和最小绝对收缩和选择算子回归(LASSO)来评估透析充分性。方法:从本研究收集的264例透析患者病例记录中随机抽取一个训练集(70%)和一个测试集(30%)。我们将机器学习模型与统计逻辑回归预测模型进行了比较。此外,我们进行了五重交叉内部验证和外部验证。结果:测试数据集显示RF、XGBoost和LASSO模型的灵敏度值分别为0.86 (95% CI = 0.75-0.96)、0.81 (95% CI = 0.69-0.93)和0.72 (95% CI = 0.58-0.85)。匹配特异性分别为0.73 (95% CI = 0.58-0.87)、0.81 (95% CI = 0.67-0.93)和0.83 (95% CI = 0.71-0.95)。精度为0.80 (95% CI = 0.71 - -0.89), 0.81 (95% CI = 0.72 - -0.90),和0.77 (95% CI -0.86 = 0.68)。F1得分分别为0.83 (95% CI = 0.72 - -0.90), 0.82 (95% CI = 0.73 - -0.91),和0.78 (95% CI -0.87 = 0.67)。受试者工作特征曲线(AUROC)分别为0.88 (p = 0.12, 95% CI = 0.72-0.90)和0.88 (p)。结论:机器学习可用于预测透析充分性以获得最佳射频性能。
Development and validation of machine learning predictive models for assessing dialysis adequacy in dialysis patients.
Purpose: The assessment of dialysis adequacy is of great clinical importance. However, it depends on the nonlinear effects of numerous confounding factors and is therefore difficult to predict using traditional statistical methods. In this study, we used Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator Regression (LASSO) to assess dialysis adequacy.
Methods: A training set (70%) and a test set (30%) were randomly selected from the 264 dialysis patient case records collected for this study. We compared the machine learning models with statistical logistic regression prediction models. In addition, we performed fivefold cross internal validation and external validation.
Results: The test dataset showed sensitivity values of 0.86 (95% CI = 0.75-0.96), 0.81 (95% CI = 0.69-0.93), and 0.72 (95% CI = 0.58-0.85) for the RF, XGBoost, and LASSO models, respectively. The matched specificity was 0.73 (95% CI = 0.58-0.87), 0.81 (95% CI = 0.67-0.93), and 0.83 (95% CI = 0.71-0.95). Accuracy was 0.80 (95% CI = 0.71-0.89), 0.81 (95% CI = 0.72-0.90), and 0.77 (95% CI = 0.68-0.86). F1 scores were 0.83 (95% CI = 0.72-0.90), 0.82 (95% CI = 0.73-0.91), and 0.78 (95% CI = 0.67-0.87). The receiver operating characteristic curves (AUROC) were 0.88 (p < 0.05, 95% CI = 0.70-0.88), 0.86 (p = 0.12, 95% CI = 0.72-0.90), and 0.88 (p < 0.05, 95% CI = 0.69-0.86). The mean absolute errors (MAE) of the calibration curves were 0.15, 0.11, and 0.07. In addition, the decision curve analysis (DCA) showed wide intervals of net clinical benefit for the models.
Conclusion: Machine learning can be used to predict dialysis adequacy for optimal RF performance.
期刊介绍:
The International Journal of Artificial Organs (IJAO) publishes peer-reviewed research and clinical, experimental and theoretical, contributions to the field of artificial, bioartificial and tissue-engineered organs. The mission of the IJAO is to foster the development and optimization of artificial, bioartificial and tissue-engineered organs, for implantation or use in procedures, to treat functional deficits of all human tissues and organs.