{"title":"Application of machine learning algorithm for the prediction of lupus nephritis using SNP data, polygenic risk score, and electronic health record.","authors":"Chih-Wei Chung, Seng-Cho Chou, Chung-Mao Kao, Yen-Ju Chen, Tzu-Hung Hsiao, Yi-Ming Chen","doi":"10.1177/14604582251363510","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Lupus nephritis (LN) flares raise the risks of renal failure and mortality in systemic lupus erythematosus (SLE) patients, making risk stratification and individualized care crucial. Our goal was to develop machine learning (ML) models to predict LN flares.</p><p><strong>Methods: </strong>A total of 1546 SLE patients were enrolled from a hospital-based cohort. Electronic health record (EHR), single nucleotide polymorphism (SNP), and polygenic risk score (PRS) were combined to construct ML models. SHapley Additive exPlanation (SHAP) values were calculated to assess each feature's contribution.</p><p><strong>Results: </strong>Within 5 years, 448 patients developed LN. Of the 686,354 SNPs, 375 were used for PRS computation. The model combining EHR, SNP, and PRS achieved the highest AUROC of 0.9512 and AUPRC of 0.8902 in validation, while the XGB-based hybrid model reached an AUPRC of 0.9021 in testing. The SHAP summary plot highlighted the top 20 features predicting LN flares.</p><p><strong>Conclusions: </strong>This hybrid model combining SNP, PRS, and EHR predicts active LN and requires validation.</p>","PeriodicalId":55069,"journal":{"name":"Health Informatics Journal","volume":"31 3","pages":"14604582251363510"},"PeriodicalIF":2.3000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Informatics Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/14604582251363510","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/6 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Lupus nephritis (LN) flares raise the risks of renal failure and mortality in systemic lupus erythematosus (SLE) patients, making risk stratification and individualized care crucial. Our goal was to develop machine learning (ML) models to predict LN flares.
Methods: A total of 1546 SLE patients were enrolled from a hospital-based cohort. Electronic health record (EHR), single nucleotide polymorphism (SNP), and polygenic risk score (PRS) were combined to construct ML models. SHapley Additive exPlanation (SHAP) values were calculated to assess each feature's contribution.
Results: Within 5 years, 448 patients developed LN. Of the 686,354 SNPs, 375 were used for PRS computation. The model combining EHR, SNP, and PRS achieved the highest AUROC of 0.9512 and AUPRC of 0.8902 in validation, while the XGB-based hybrid model reached an AUPRC of 0.9021 in testing. The SHAP summary plot highlighted the top 20 features predicting LN flares.
Conclusions: This hybrid model combining SNP, PRS, and EHR predicts active LN and requires validation.
期刊介绍:
Health Informatics Journal is an international peer-reviewed journal. All papers submitted to Health Informatics Journal are subject to peer review by members of a carefully appointed editorial board. The journal operates a conventional single-blind reviewing policy in which the reviewer’s name is always concealed from the submitting author.