Yuchen Gao, Chunrong Wang, Wenhao Dong, Bianfang Li, Jianhui Wang, Jun Li, Yu Tian, Jia Liu, Yuefu Wang
{"title":"An Explainable Machine Learning Model to Predict Acute Kidney Injury After Cardiac Surgery: A Retrospective Cohort Study","authors":"Yuchen Gao, Chunrong Wang, Wenhao Dong, Bianfang Li, Jianhui Wang, Jun Li, Yu Tian, Jia Liu, Yuefu Wang","doi":"10.2147/clep.s404580","DOIUrl":null,"url":null,"abstract":"<strong>Background:</strong> To derive and validate a machine learning (ML) prediction model of acute kidney injury (AKI) that could be used for AKI surveillance and management to improve clinical outcomes.<br/><strong>Methods:</strong> This retrospective cohort study was conducted in Fuwai Hospital, including patients aged 18 years and above undergoing cardiac surgery admitted between January 1, 2017, and December 31, 2018. Seventy percent of the observations were randomly selected for training and the remaining 30% for testing. The demographics, comorbidities, laboratory examination parameters, and operation details were used to construct a prediction model for AKI by logistic regression and eXtreme gradient boosting (Xgboost). The discrimination of each model was assessed on the test cohort by the area under the receiver operator characteristic (AUROC) curve, while calibration was performed by the calibration plot.<br/><strong>Results:</strong> A total of 15,880 patients were enrolled in this study, and 4845 (30.5%) had developed AKI. Xgboost model had the higher discriminative ability compared with logistic regression (AUROC, 0.849 [95% CI, 0.837– 0.861] vs 0.803[95% CI 0.790– 0.817], <em>P</em>< 0.001) in the test dataset. The estimated glomerular filtration (eGFR) and creatine on intensive care unit (ICU) arrival are the two most important prediction parameters. A SHAP summary plot was used to illustrate the effects of the top 15 features attributed to the Xgboost model.<br/><strong>Conclusion:</strong> ML models can provide clinical decision support to determine which patients should focus on perioperative preventive treatment to preemptively reduce acute kidney injury by predicting which patients are not at risk.<br/><br/>","PeriodicalId":10362,"journal":{"name":"Clinical Epidemiology","volume":"64 1","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2147/clep.s404580","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
Background: To derive and validate a machine learning (ML) prediction model of acute kidney injury (AKI) that could be used for AKI surveillance and management to improve clinical outcomes. Methods: This retrospective cohort study was conducted in Fuwai Hospital, including patients aged 18 years and above undergoing cardiac surgery admitted between January 1, 2017, and December 31, 2018. Seventy percent of the observations were randomly selected for training and the remaining 30% for testing. The demographics, comorbidities, laboratory examination parameters, and operation details were used to construct a prediction model for AKI by logistic regression and eXtreme gradient boosting (Xgboost). The discrimination of each model was assessed on the test cohort by the area under the receiver operator characteristic (AUROC) curve, while calibration was performed by the calibration plot. Results: A total of 15,880 patients were enrolled in this study, and 4845 (30.5%) had developed AKI. Xgboost model had the higher discriminative ability compared with logistic regression (AUROC, 0.849 [95% CI, 0.837– 0.861] vs 0.803[95% CI 0.790– 0.817], P< 0.001) in the test dataset. The estimated glomerular filtration (eGFR) and creatine on intensive care unit (ICU) arrival are the two most important prediction parameters. A SHAP summary plot was used to illustrate the effects of the top 15 features attributed to the Xgboost model. Conclusion: ML models can provide clinical decision support to determine which patients should focus on perioperative preventive treatment to preemptively reduce acute kidney injury by predicting which patients are not at risk.
期刊介绍:
Clinical Epidemiology is an international, peer reviewed, open access journal. Clinical Epidemiology focuses on the application of epidemiological principles and questions relating to patients and clinical care in terms of prevention, diagnosis, prognosis, and treatment.
Clinical Epidemiology welcomes papers covering these topics in form of original research and systematic reviews.
Clinical Epidemiology has a special interest in international electronic medical patient records and other routine health care data, especially as applied to safety of medical interventions, clinical utility of diagnostic procedures, understanding short- and long-term clinical course of diseases, clinical epidemiological and biostatistical methods, and systematic reviews.
When considering submission of a paper utilizing publicly-available data, authors should ensure that such studies add significantly to the body of knowledge and that they use appropriate validated methods for identifying health outcomes.
The journal has launched special series describing existing data sources for clinical epidemiology, international health care systems and validation studies of algorithms based on databases and registries.