{"title":"利用SNP数据、多基因风险评分和电子健康记录,应用机器学习算法预测狼疮性肾炎。","authors":"Chih-Wei Chung, Seng-Cho Chou, Chung-Mao Kao, Yen-Ju Chen, Tzu-Hung Hsiao, Yi-Ming Chen","doi":"10.1177/14604582251363510","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Lupus nephritis (LN) flares raise the risks of renal failure and mortality in systemic lupus erythematosus (SLE) patients, making risk stratification and individualized care crucial. Our goal was to develop machine learning (ML) models to predict LN flares.</p><p><strong>Methods: </strong>A total of 1546 SLE patients were enrolled from a hospital-based cohort. Electronic health record (EHR), single nucleotide polymorphism (SNP), and polygenic risk score (PRS) were combined to construct ML models. SHapley Additive exPlanation (SHAP) values were calculated to assess each feature's contribution.</p><p><strong>Results: </strong>Within 5 years, 448 patients developed LN. Of the 686,354 SNPs, 375 were used for PRS computation. The model combining EHR, SNP, and PRS achieved the highest AUROC of 0.9512 and AUPRC of 0.8902 in validation, while the XGB-based hybrid model reached an AUPRC of 0.9021 in testing. The SHAP summary plot highlighted the top 20 features predicting LN flares.</p><p><strong>Conclusions: </strong>This hybrid model combining SNP, PRS, and EHR predicts active LN and requires validation.</p>","PeriodicalId":55069,"journal":{"name":"Health Informatics Journal","volume":"31 3","pages":"14604582251363510"},"PeriodicalIF":2.3000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Application of machine learning algorithm for the prediction of lupus nephritis using SNP data, polygenic risk score, and electronic health record.\",\"authors\":\"Chih-Wei Chung, Seng-Cho Chou, Chung-Mao Kao, Yen-Ju Chen, Tzu-Hung Hsiao, Yi-Ming Chen\",\"doi\":\"10.1177/14604582251363510\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Lupus nephritis (LN) flares raise the risks of renal failure and mortality in systemic lupus erythematosus (SLE) patients, making risk stratification and individualized care crucial. Our goal was to develop machine learning (ML) models to predict LN flares.</p><p><strong>Methods: </strong>A total of 1546 SLE patients were enrolled from a hospital-based cohort. Electronic health record (EHR), single nucleotide polymorphism (SNP), and polygenic risk score (PRS) were combined to construct ML models. SHapley Additive exPlanation (SHAP) values were calculated to assess each feature's contribution.</p><p><strong>Results: </strong>Within 5 years, 448 patients developed LN. Of the 686,354 SNPs, 375 were used for PRS computation. The model combining EHR, SNP, and PRS achieved the highest AUROC of 0.9512 and AUPRC of 0.8902 in validation, while the XGB-based hybrid model reached an AUPRC of 0.9021 in testing. The SHAP summary plot highlighted the top 20 features predicting LN flares.</p><p><strong>Conclusions: </strong>This hybrid model combining SNP, PRS, and EHR predicts active LN and requires validation.</p>\",\"PeriodicalId\":55069,\"journal\":{\"name\":\"Health Informatics Journal\",\"volume\":\"31 3\",\"pages\":\"14604582251363510\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health Informatics Journal\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1177/14604582251363510\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/8/6 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Informatics Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/14604582251363510","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/8/6 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
Application of machine learning algorithm for the prediction of lupus nephritis using SNP data, polygenic risk score, and electronic health record.
Background: Lupus nephritis (LN) flares raise the risks of renal failure and mortality in systemic lupus erythematosus (SLE) patients, making risk stratification and individualized care crucial. Our goal was to develop machine learning (ML) models to predict LN flares.
Methods: A total of 1546 SLE patients were enrolled from a hospital-based cohort. Electronic health record (EHR), single nucleotide polymorphism (SNP), and polygenic risk score (PRS) were combined to construct ML models. SHapley Additive exPlanation (SHAP) values were calculated to assess each feature's contribution.
Results: Within 5 years, 448 patients developed LN. Of the 686,354 SNPs, 375 were used for PRS computation. The model combining EHR, SNP, and PRS achieved the highest AUROC of 0.9512 and AUPRC of 0.8902 in validation, while the XGB-based hybrid model reached an AUPRC of 0.9021 in testing. The SHAP summary plot highlighted the top 20 features predicting LN flares.
Conclusions: This hybrid model combining SNP, PRS, and EHR predicts active LN and requires validation.
期刊介绍:
Health Informatics Journal is an international peer-reviewed journal. All papers submitted to Health Informatics Journal are subject to peer review by members of a carefully appointed editorial board. The journal operates a conventional single-blind reviewing policy in which the reviewer’s name is always concealed from the submitting author.