Predictive models for posttransplant diabetes mellitus in kidney transplant recipients using machine learning and deep learning approach: a nationwide cohort study from South Korea.
Seoyoung Choi, Mi Ryung Pyo, Sangwoong Kim, Jong Cheol Jeong, Yu Ho Lee, Hyejin Mo, Jeong-Hoon Lee, Jaeseok Yang, Myoung Soo Kim, Hye Eun Yoon, Sejoong Kim
{"title":"Predictive models for posttransplant diabetes mellitus in kidney transplant recipients using machine learning and deep learning approach: a nationwide cohort study from South Korea.","authors":"Seoyoung Choi, Mi Ryung Pyo, Sangwoong Kim, Jong Cheol Jeong, Yu Ho Lee, Hyejin Mo, Jeong-Hoon Lee, Jaeseok Yang, Myoung Soo Kim, Hye Eun Yoon, Sejoong Kim","doi":"10.23876/j.krcp.24.113","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Posttransplant diabetes mellitus (PTDM) complicates kidney transplant recipients (KTRs) in morbidity and mortality. This study aimed to predict PTDM risk in KTRs using machine learning and deep learning models.</p><p><strong>Methods: </strong>Data were obtained from the Korea Organ Transplantation Registry, a nationwide cohort study of KTRs. Four machine learning algorithms, including eXtreme Gradient Boosting (XGBoost), CatBoost, light gradient boosting machine and logistic regression, and deep learning were implemented on 41 pretransplant and 31 posttransplant variables to predict PTDM. Model performance was assessed using the area under the curve (AUC) of the receiver operating characteristic curve, accuracy, precision, recall, and F1 score.</p><p><strong>Results: </strong>Among 3,213 KTRs, 497 patients (15.5%) developed PTDM within 1 year. The PTDM group had higher age, body mass index (BMI), triglyceride level, and prevalence of hypertension and cardiovascular disease, and lower total cholesterol level at baseline than the No-PTDM group. The XGBoost model showed the highest AUC (0.738) and F1 score (0.42), and modest accuracy (0.86), while the CatBoost model exhibited the highest accuracy (0.87) and precision (0.79). Feature importance in XGBoost was highest for recipient age, followed by baseline BMI, triglyceride level at posttransplant 6 months, baseline glycated hemoglobin and high-density lipoprotein cholesterol level, white blood cell (WBC) count and serum uric acid level at 6 months, baseline WBC count, and tacrolimus trough level at discharge.</p><p><strong>Conclusion: </strong>The XGBoost model demonstrated the best performance for predicting PTDM within 1 year, offering an accurate tool for early identification and personalized care of high-risk KTRs for PTDM.</p>","PeriodicalId":17716,"journal":{"name":"Kidney Research and Clinical Practice","volume":" ","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Kidney Research and Clinical Practice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.23876/j.krcp.24.113","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Posttransplant diabetes mellitus (PTDM) complicates kidney transplant recipients (KTRs) in morbidity and mortality. This study aimed to predict PTDM risk in KTRs using machine learning and deep learning models.
Methods: Data were obtained from the Korea Organ Transplantation Registry, a nationwide cohort study of KTRs. Four machine learning algorithms, including eXtreme Gradient Boosting (XGBoost), CatBoost, light gradient boosting machine and logistic regression, and deep learning were implemented on 41 pretransplant and 31 posttransplant variables to predict PTDM. Model performance was assessed using the area under the curve (AUC) of the receiver operating characteristic curve, accuracy, precision, recall, and F1 score.
Results: Among 3,213 KTRs, 497 patients (15.5%) developed PTDM within 1 year. The PTDM group had higher age, body mass index (BMI), triglyceride level, and prevalence of hypertension and cardiovascular disease, and lower total cholesterol level at baseline than the No-PTDM group. The XGBoost model showed the highest AUC (0.738) and F1 score (0.42), and modest accuracy (0.86), while the CatBoost model exhibited the highest accuracy (0.87) and precision (0.79). Feature importance in XGBoost was highest for recipient age, followed by baseline BMI, triglyceride level at posttransplant 6 months, baseline glycated hemoglobin and high-density lipoprotein cholesterol level, white blood cell (WBC) count and serum uric acid level at 6 months, baseline WBC count, and tacrolimus trough level at discharge.
Conclusion: The XGBoost model demonstrated the best performance for predicting PTDM within 1 year, offering an accurate tool for early identification and personalized care of high-risk KTRs for PTDM.
期刊介绍:
Kidney Research and Clinical Practice (formerly The Korean Journal of Nephrology; ISSN 1975-9460, launched in 1982), the official journal of the Korean Society of Nephrology, is an international, peer-reviewed journal published in English. Its ISO abbreviation is Kidney Res Clin Pract. To provide an efficient venue for dissemination of knowledge and discussion of topics related to basic renal science and clinical practice, the journal offers open access (free submission and free access) and considers articles on all aspects of clinical nephrology and hypertension as well as related molecular genetics, anatomy, pathology, physiology, pharmacology, and immunology. In particular, the journal focuses on translational renal research that helps bridging laboratory discovery with the diagnosis and treatment of human kidney disease. Topics covered include basic science with possible clinical applicability and papers on the pathophysiological basis of disease processes of the kidney. Original researches from areas of intervention nephrology or dialysis access are also welcomed. Major article types considered for publication include original research and reviews on current topics of interest. Accepted manuscripts are granted free online open-access immediately after publication, which permits its users to read, download, copy, distribute, print, search, or link to the full texts of its articles to facilitate access to a broad readership. Circulation number of print copies is 1,600.