Zhengbiao Hu, Shuangshan Bu, Kai Wang, Qianqian Cao, Huanhuan Zheng, Jie Yang, Shanshan Chen, Yuemeng Wu, Wenkai Ren, Chenlei He
{"title":"使用机器学习模型对原发性肾小球肾炎进行分类:重点关注IgA肾病预测。","authors":"Zhengbiao Hu, Shuangshan Bu, Kai Wang, Qianqian Cao, Huanhuan Zheng, Jie Yang, Shanshan Chen, Yuemeng Wu, Wenkai Ren, Chenlei He","doi":"10.1186/s12882-025-04253-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>IgA nephropathy (IgAN) is the most common form of glomerulonephritis worldwide, characterized by immune complex deposition in the glomerular mesangium, leading to mesangial hypercellularity, persistent microhematuria, proteinuria, and progressive renal impairment. Given its common occurrence, diagnosis normally involves renal biopsy, with its accompanying risks of bleeding and infection. In this study, multiple machine learning algorithms were used to develop a non-invasive and improved model for the diagnosis of IgAN.</p><p><strong>Materials and methods: </strong>This retrospective study included 292 patients with IgAN and 310 individuals with different nephropathies, utilizing 82 clinical variables, with kidney pathology results serving as ML labels. A random forest (RF) regression model addressed missing values. Subjects were divided into a development set (n = 542) and a test set (n = 60). The RF method was applied to select 17 key features for building diagnostic models, including the RF model, support vector machine (SVM), adaptive boosting (ADB), and traditional doctor judgment. Performance was evaluated using accuracy, sensitivity, specificity, and area under the curve (AUC) from receiver operating characteristic (ROC) analyses.</p><p><strong>Results: </strong>The random forest model performed best with an accuracy of 82.3% and an AUC of 0.89 on the test set, outstripping SVM with an AUC of 0.82 and ADB with an AUC of 0.88. High urinary protein, low serum albumin, and elevated IgG levels were the primary features correlated with IgAN.</p><p><strong>Conclusion: </strong>In this study, a non-invasive diagnostic model for IgAN was developed, with RF line and superior accuracy and clinical applicability. This further highlights the potential of ML-based approaches in reducing reliance on invasive procedures and providing opportunities for early IgAN diagnosis.</p>","PeriodicalId":9089,"journal":{"name":"BMC Nephrology","volume":"26 1","pages":"289"},"PeriodicalIF":2.2000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12186348/pdf/","citationCount":"0","resultStr":"{\"title\":\"Classification of primary glomerulonephritis using machine learning models: a focus on IgA nephropathy prediction.\",\"authors\":\"Zhengbiao Hu, Shuangshan Bu, Kai Wang, Qianqian Cao, Huanhuan Zheng, Jie Yang, Shanshan Chen, Yuemeng Wu, Wenkai Ren, Chenlei He\",\"doi\":\"10.1186/s12882-025-04253-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>IgA nephropathy (IgAN) is the most common form of glomerulonephritis worldwide, characterized by immune complex deposition in the glomerular mesangium, leading to mesangial hypercellularity, persistent microhematuria, proteinuria, and progressive renal impairment. Given its common occurrence, diagnosis normally involves renal biopsy, with its accompanying risks of bleeding and infection. In this study, multiple machine learning algorithms were used to develop a non-invasive and improved model for the diagnosis of IgAN.</p><p><strong>Materials and methods: </strong>This retrospective study included 292 patients with IgAN and 310 individuals with different nephropathies, utilizing 82 clinical variables, with kidney pathology results serving as ML labels. A random forest (RF) regression model addressed missing values. Subjects were divided into a development set (n = 542) and a test set (n = 60). The RF method was applied to select 17 key features for building diagnostic models, including the RF model, support vector machine (SVM), adaptive boosting (ADB), and traditional doctor judgment. Performance was evaluated using accuracy, sensitivity, specificity, and area under the curve (AUC) from receiver operating characteristic (ROC) analyses.</p><p><strong>Results: </strong>The random forest model performed best with an accuracy of 82.3% and an AUC of 0.89 on the test set, outstripping SVM with an AUC of 0.82 and ADB with an AUC of 0.88. High urinary protein, low serum albumin, and elevated IgG levels were the primary features correlated with IgAN.</p><p><strong>Conclusion: </strong>In this study, a non-invasive diagnostic model for IgAN was developed, with RF line and superior accuracy and clinical applicability. This further highlights the potential of ML-based approaches in reducing reliance on invasive procedures and providing opportunities for early IgAN diagnosis.</p>\",\"PeriodicalId\":9089,\"journal\":{\"name\":\"BMC Nephrology\",\"volume\":\"26 1\",\"pages\":\"289\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12186348/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Nephrology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12882-025-04253-6\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"UROLOGY & NEPHROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Nephrology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12882-025-04253-6","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"UROLOGY & NEPHROLOGY","Score":null,"Total":0}
Classification of primary glomerulonephritis using machine learning models: a focus on IgA nephropathy prediction.
Objective: IgA nephropathy (IgAN) is the most common form of glomerulonephritis worldwide, characterized by immune complex deposition in the glomerular mesangium, leading to mesangial hypercellularity, persistent microhematuria, proteinuria, and progressive renal impairment. Given its common occurrence, diagnosis normally involves renal biopsy, with its accompanying risks of bleeding and infection. In this study, multiple machine learning algorithms were used to develop a non-invasive and improved model for the diagnosis of IgAN.
Materials and methods: This retrospective study included 292 patients with IgAN and 310 individuals with different nephropathies, utilizing 82 clinical variables, with kidney pathology results serving as ML labels. A random forest (RF) regression model addressed missing values. Subjects were divided into a development set (n = 542) and a test set (n = 60). The RF method was applied to select 17 key features for building diagnostic models, including the RF model, support vector machine (SVM), adaptive boosting (ADB), and traditional doctor judgment. Performance was evaluated using accuracy, sensitivity, specificity, and area under the curve (AUC) from receiver operating characteristic (ROC) analyses.
Results: The random forest model performed best with an accuracy of 82.3% and an AUC of 0.89 on the test set, outstripping SVM with an AUC of 0.82 and ADB with an AUC of 0.88. High urinary protein, low serum albumin, and elevated IgG levels were the primary features correlated with IgAN.
Conclusion: In this study, a non-invasive diagnostic model for IgAN was developed, with RF line and superior accuracy and clinical applicability. This further highlights the potential of ML-based approaches in reducing reliance on invasive procedures and providing opportunities for early IgAN diagnosis.
期刊介绍:
BMC Nephrology is an open access journal publishing original peer-reviewed research articles in all aspects of the prevention, diagnosis and management of kidney and associated disorders, as well as related molecular genetics, pathophysiology, and epidemiology.