Jie Zhou, Hao Wu, Linge Zhang, Qiaona Zhang, Jie Wang, Hang Zhao, Yongqi Dang, Shiyu Zhang, Lu Li
{"title":"Development and validation of machine learning predictive models for assessing dialysis adequacy in dialysis patients.","authors":"Jie Zhou, Hao Wu, Linge Zhang, Qiaona Zhang, Jie Wang, Hang Zhao, Yongqi Dang, Shiyu Zhang, Lu Li","doi":"10.1177/03913988251355082","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>The assessment of dialysis adequacy is of great clinical importance. However, it depends on the nonlinear effects of numerous confounding factors and is therefore difficult to predict using traditional statistical methods. In this study, we used Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator Regression (LASSO) to assess dialysis adequacy.</p><p><strong>Methods: </strong>A training set (70%) and a test set (30%) were randomly selected from the 264 dialysis patient case records collected for this study. We compared the machine learning models with statistical logistic regression prediction models. In addition, we performed fivefold cross internal validation and external validation.</p><p><strong>Results: </strong>The test dataset showed sensitivity values of 0.86 (95% CI = 0.75-0.96), 0.81 (95% CI = 0.69-0.93), and 0.72 (95% CI = 0.58-0.85) for the RF, XGBoost, and LASSO models, respectively. The matched specificity was 0.73 (95% CI = 0.58-0.87), 0.81 (95% CI = 0.67-0.93), and 0.83 (95% CI = 0.71-0.95). Accuracy was 0.80 (95% CI = 0.71-0.89), 0.81 (95% CI = 0.72-0.90), and 0.77 (95% CI = 0.68-0.86). <i>F</i>1 scores were 0.83 (95% CI = 0.72-0.90), 0.82 (95% CI = 0.73-0.91), and 0.78 (95% CI = 0.67-0.87). The receiver operating characteristic curves (AUROC) were 0.88 (<i>p</i> < 0.05, 95% CI = 0.70-0.88), 0.86 (<i>p</i> = 0.12, 95% CI = 0.72-0.90), and 0.88 (<i>p</i> < 0.05, 95% CI = 0.69-0.86). The mean absolute errors (MAE) of the calibration curves were 0.15, 0.11, and 0.07. In addition, the decision curve analysis (DCA) showed wide intervals of net clinical benefit for the models.</p><p><strong>Conclusion: </strong>Machine learning can be used to predict dialysis adequacy for optimal RF performance.</p>","PeriodicalId":13932,"journal":{"name":"International Journal of Artificial Organs","volume":" ","pages":"557-565"},"PeriodicalIF":1.3000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Artificial Organs","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1177/03913988251355082","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/28 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: The assessment of dialysis adequacy is of great clinical importance. However, it depends on the nonlinear effects of numerous confounding factors and is therefore difficult to predict using traditional statistical methods. In this study, we used Random Forest (RF), Extreme Gradient Boosting (XGBoost), and Least Absolute Shrinkage and Selection Operator Regression (LASSO) to assess dialysis adequacy.
Methods: A training set (70%) and a test set (30%) were randomly selected from the 264 dialysis patient case records collected for this study. We compared the machine learning models with statistical logistic regression prediction models. In addition, we performed fivefold cross internal validation and external validation.
Results: The test dataset showed sensitivity values of 0.86 (95% CI = 0.75-0.96), 0.81 (95% CI = 0.69-0.93), and 0.72 (95% CI = 0.58-0.85) for the RF, XGBoost, and LASSO models, respectively. The matched specificity was 0.73 (95% CI = 0.58-0.87), 0.81 (95% CI = 0.67-0.93), and 0.83 (95% CI = 0.71-0.95). Accuracy was 0.80 (95% CI = 0.71-0.89), 0.81 (95% CI = 0.72-0.90), and 0.77 (95% CI = 0.68-0.86). F1 scores were 0.83 (95% CI = 0.72-0.90), 0.82 (95% CI = 0.73-0.91), and 0.78 (95% CI = 0.67-0.87). The receiver operating characteristic curves (AUROC) were 0.88 (p < 0.05, 95% CI = 0.70-0.88), 0.86 (p = 0.12, 95% CI = 0.72-0.90), and 0.88 (p < 0.05, 95% CI = 0.69-0.86). The mean absolute errors (MAE) of the calibration curves were 0.15, 0.11, and 0.07. In addition, the decision curve analysis (DCA) showed wide intervals of net clinical benefit for the models.
Conclusion: Machine learning can be used to predict dialysis adequacy for optimal RF performance.
期刊介绍:
The International Journal of Artificial Organs (IJAO) publishes peer-reviewed research and clinical, experimental and theoretical, contributions to the field of artificial, bioartificial and tissue-engineered organs. The mission of the IJAO is to foster the development and optimization of artificial, bioartificial and tissue-engineered organs, for implantation or use in procedures, to treat functional deficits of all human tissues and organs.