John W Larkin, Suman Lama, Sheetal Chaudhuri, Joanna Willetts, Anke C Winter, Yue Jiao, Manuela Stauss-Grabo, Len A Usvyat, Jeffrey L Hymes, Franklin W Maddux, David C Wheeler, Peter Stenvinkel, Jürgen Floege
{"title":"Prediction of gastrointestinal bleeding hospitalization risk in hemodialysis using machine learning.","authors":"John W Larkin, Suman Lama, Sheetal Chaudhuri, Joanna Willetts, Anke C Winter, Yue Jiao, Manuela Stauss-Grabo, Len A Usvyat, Jeffrey L Hymes, Franklin W Maddux, David C Wheeler, Peter Stenvinkel, Jürgen Floege","doi":"10.1186/s12882-024-03809-2","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Gastrointestinal bleeding (GIB) is a clinical challenge in kidney failure. INSPIRE group assessed if machine learning could determine a hemodialysis (HD) patient's 180-day GIB hospitalization risk.</p><p><strong>Methods: </strong>An eXtreme Gradient Boosting (XGBoost) and logistic regression model were developed using an HD dataset in United States (2017-2020). Patient data was randomly split (50% training, 30% validation, and 20% testing). HD treatments ≤ 180 days before GIB hospitalization were classified as positive observations; others were negative. Models considered 1,303 exposures/covariates. Performance was measured using unseen testing data.</p><p><strong>Results: </strong>Incidence of 180-day GIB hospitalization was 1.18% in HD population (n = 451,579), and 1.12% in testing dataset (n = 38,853). XGBoost showed area under the receiver operating curve (AUROC) = 0.74 (95% confidence interval (CI) 0.72, 0.76) versus logistic regression showed AUROC = 0.68 (95% CI 0.66, 0.71). Sensitivity and specificity were 65.3% (60.9, 69.7) and 68.0% (67.6, 68.5) for XGBoost versus 68.9% (64.7, 73.0) and 57.0% (56.5, 57.5) for logistic regression, respectively. Associations in exposures were consistent for many factors. Both models showed GIB hospitalization risk was associated with older age, disturbances in anemia/iron indices, recent all-cause hospitalizations, and bone mineral metabolism markers. XGBoost showed high importance on outcome prediction for serum 25 hydroxy (25OH) vitamin D levels, while logistic regression showed high importance for parathyroid hormone (PTH) levels.</p><p><strong>Conclusions: </strong>Machine learning can be considered for early detection of GIB event risk in HD. XGBoost outperforms logistic regression, yet both appear suitable. External and prospective validation of these models is needed. Association between bone mineral metabolism markers and GIB events was unexpected and warrants investigation.</p><p><strong>Trial registration: </strong>This retrospective analysis of real-world data was not a prospective clinical trial and registration is not applicable.</p>","PeriodicalId":9089,"journal":{"name":"BMC Nephrology","volume":null,"pages":null},"PeriodicalIF":2.2000,"publicationDate":"2024-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11490046/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Nephrology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12882-024-03809-2","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Gastrointestinal bleeding (GIB) is a clinical challenge in kidney failure. INSPIRE group assessed if machine learning could determine a hemodialysis (HD) patient's 180-day GIB hospitalization risk.
Methods: An eXtreme Gradient Boosting (XGBoost) and logistic regression model were developed using an HD dataset in United States (2017-2020). Patient data was randomly split (50% training, 30% validation, and 20% testing). HD treatments ≤ 180 days before GIB hospitalization were classified as positive observations; others were negative. Models considered 1,303 exposures/covariates. Performance was measured using unseen testing data.
Results: Incidence of 180-day GIB hospitalization was 1.18% in HD population (n = 451,579), and 1.12% in testing dataset (n = 38,853). XGBoost showed area under the receiver operating curve (AUROC) = 0.74 (95% confidence interval (CI) 0.72, 0.76) versus logistic regression showed AUROC = 0.68 (95% CI 0.66, 0.71). Sensitivity and specificity were 65.3% (60.9, 69.7) and 68.0% (67.6, 68.5) for XGBoost versus 68.9% (64.7, 73.0) and 57.0% (56.5, 57.5) for logistic regression, respectively. Associations in exposures were consistent for many factors. Both models showed GIB hospitalization risk was associated with older age, disturbances in anemia/iron indices, recent all-cause hospitalizations, and bone mineral metabolism markers. XGBoost showed high importance on outcome prediction for serum 25 hydroxy (25OH) vitamin D levels, while logistic regression showed high importance for parathyroid hormone (PTH) levels.
Conclusions: Machine learning can be considered for early detection of GIB event risk in HD. XGBoost outperforms logistic regression, yet both appear suitable. External and prospective validation of these models is needed. Association between bone mineral metabolism markers and GIB events was unexpected and warrants investigation.
Trial registration: This retrospective analysis of real-world data was not a prospective clinical trial and registration is not applicable.
期刊介绍:
BMC Nephrology is an open access journal publishing original peer-reviewed research articles in all aspects of the prevention, diagnosis and management of kidney and associated disorders, as well as related molecular genetics, pathophysiology, and epidemiology.