Md Nakib Hayat Chowdhury , Mamun Bin Ibne Reaz , Sawal Hamid Md Ali , María Liz Crespo , Shamim Ahmad , Ghassan Maan Salim , Fahmida Haque , Luis Guillermo García Ordóñez , Md. Johirul Islam , Taher Muhammad Mahdee , Kh Shahriya Zaman , Md Shahriar Khan Hemel , Mohammad Arif Sobhan Bhuiyan
{"title":"Deep learning for early detection of chronic kidney disease stages in diabetes patients: A TabNet approach","authors":"Md Nakib Hayat Chowdhury , Mamun Bin Ibne Reaz , Sawal Hamid Md Ali , María Liz Crespo , Shamim Ahmad , Ghassan Maan Salim , Fahmida Haque , Luis Guillermo García Ordóñez , Md. Johirul Islam , Taher Muhammad Mahdee , Kh Shahriya Zaman , Md Shahriar Khan Hemel , Mohammad Arif Sobhan Bhuiyan","doi":"10.1016/j.artmed.2025.103153","DOIUrl":null,"url":null,"abstract":"<div><div>Chronic kidney disease (CKD) poses a significant risk for diabetes patients, often leading to severe complications. Early and accurate CKD stage detection is crucial for timely intervention. However, it remains challenging due to its asymptomatic progression, the oversight of routine CKD tests during diabetes checkups, and limited access to nephrologists. This study aimed to address these challenges by developing a multiclass CKD stage prediction model for diabetes patients using longitudinal data from the Chronic Renal Insufficiency Cohort (CRIC) study. A novel iterative backward feature selection strategy was employed to determine key predictors of the CKD stage. TabNet, an attention-based deep learning architecture, was used to build classification models in complete and simplified categories. The complete model used 31 features, including complex kidney biomarkers, while the simplified model used 15 features readily available from routine checkups. The performance of TabNet was compared against traditional tree-based ensemble methods (XGBoost, random forest, AdaBoost) and a multi-layer perceptron. Model-specific and model-agnostic explainable AI (XAI) techniques were applied to interpret model decisions, enhancing the transparency and clinical applicability of the proposed approach. The TabNet models demonstrated superior performance, achieving 94.06 % and 92.71 % accuracy in cross-validation for the complete and simplified models, respectively, and 91.00 % and 88.00 % accuracy on test sets. XAI analysis identified serum creatinine, cystatin C, sex, and age as the most influential factors in CKD stage classification. The proposed TabNet models offer a robust approach for early CKD severity detection in diabetes patients, potentially improving clinical decision-making and patient outcomes.</div></div>","PeriodicalId":55458,"journal":{"name":"Artificial Intelligence in Medicine","volume":"166 ","pages":"Article 103153"},"PeriodicalIF":6.2000,"publicationDate":"2025-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0933365725000880","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Chronic kidney disease (CKD) poses a significant risk for diabetes patients, often leading to severe complications. Early and accurate CKD stage detection is crucial for timely intervention. However, it remains challenging due to its asymptomatic progression, the oversight of routine CKD tests during diabetes checkups, and limited access to nephrologists. This study aimed to address these challenges by developing a multiclass CKD stage prediction model for diabetes patients using longitudinal data from the Chronic Renal Insufficiency Cohort (CRIC) study. A novel iterative backward feature selection strategy was employed to determine key predictors of the CKD stage. TabNet, an attention-based deep learning architecture, was used to build classification models in complete and simplified categories. The complete model used 31 features, including complex kidney biomarkers, while the simplified model used 15 features readily available from routine checkups. The performance of TabNet was compared against traditional tree-based ensemble methods (XGBoost, random forest, AdaBoost) and a multi-layer perceptron. Model-specific and model-agnostic explainable AI (XAI) techniques were applied to interpret model decisions, enhancing the transparency and clinical applicability of the proposed approach. The TabNet models demonstrated superior performance, achieving 94.06 % and 92.71 % accuracy in cross-validation for the complete and simplified models, respectively, and 91.00 % and 88.00 % accuracy on test sets. XAI analysis identified serum creatinine, cystatin C, sex, and age as the most influential factors in CKD stage classification. The proposed TabNet models offer a robust approach for early CKD severity detection in diabetes patients, potentially improving clinical decision-making and patient outcomes.
期刊介绍:
Artificial Intelligence in Medicine publishes original articles from a wide variety of interdisciplinary perspectives concerning the theory and practice of artificial intelligence (AI) in medicine, medically-oriented human biology, and health care.
Artificial intelligence in medicine may be characterized as the scientific discipline pertaining to research studies, projects, and applications that aim at supporting decision-based medical tasks through knowledge- and/or data-intensive computer-based solutions that ultimately support and improve the performance of a human care provider.