L. Muflikhah, Amira G. Nurfansepta, Fitra A. Bachtiar, Dian E. Ratnawati
{"title":"利用堆叠回归的集合学习法高效预测糖尿病肾病","authors":"L. Muflikhah, Amira G. Nurfansepta, Fitra A. Bachtiar, Dian E. Ratnawati","doi":"10.3991/ijoe.v20i08.48387","DOIUrl":null,"url":null,"abstract":"Diabetes may lead to several problems, one of the most prevalent and deadly of which is diabetic nephropathy. Therefore, the condition represents a significant threat to one’s health since it has the potential to cause irreversible harm to the kidneys’ ability to operate. A significant portion of the research that is being conducted now is focused on determining how accurately diabetic people may be predicted to develop kidney illness. Considering this, the research suggests a regression stacking approach for predicting albumin levels. These albumin values will serve as a reference for the incidence of diabetic nephropathy disease. They will be derived from the medical records of patients. The utilization of stacking regression from three different ensemble approaches, using Random Forest and CatBoost regressors, while the Huber algorithm is used as a meta-learner. The accuracy with which the combination of parameters that are employed is determined is a significant factor. It contributes to the high degree of performance that the ensemble approach achieves. Therefore, in this investigation, a grid search was carried out to tune the hyperparameters of both regressor models. We evaluated the performance of the proposed model using accuracy, MAPE, RMSE, and MSE values. The experimental findings demonstrate great performance. Three selected variables including quantitative UACR, semi-quantitative UACR, and urinary creatinine, achieved high performance. Overall, the performance obtained an accuracy rate of more than 98% with an error rate (MAPE, RMSE, and MSE values) of less than 1%. In conclusion, the stack regressor model can be implemented to predict diabetic nephropathy using clinical datasets.","PeriodicalId":507997,"journal":{"name":"International Journal of Online and Biomedical Engineering (iJOE)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"High Performance for Predicting Diabetic Nephropathy Using Stacking Regression of Ensemble Learning Method\",\"authors\":\"L. Muflikhah, Amira G. Nurfansepta, Fitra A. Bachtiar, Dian E. Ratnawati\",\"doi\":\"10.3991/ijoe.v20i08.48387\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Diabetes may lead to several problems, one of the most prevalent and deadly of which is diabetic nephropathy. Therefore, the condition represents a significant threat to one’s health since it has the potential to cause irreversible harm to the kidneys’ ability to operate. A significant portion of the research that is being conducted now is focused on determining how accurately diabetic people may be predicted to develop kidney illness. Considering this, the research suggests a regression stacking approach for predicting albumin levels. These albumin values will serve as a reference for the incidence of diabetic nephropathy disease. They will be derived from the medical records of patients. The utilization of stacking regression from three different ensemble approaches, using Random Forest and CatBoost regressors, while the Huber algorithm is used as a meta-learner. The accuracy with which the combination of parameters that are employed is determined is a significant factor. It contributes to the high degree of performance that the ensemble approach achieves. Therefore, in this investigation, a grid search was carried out to tune the hyperparameters of both regressor models. We evaluated the performance of the proposed model using accuracy, MAPE, RMSE, and MSE values. The experimental findings demonstrate great performance. Three selected variables including quantitative UACR, semi-quantitative UACR, and urinary creatinine, achieved high performance. Overall, the performance obtained an accuracy rate of more than 98% with an error rate (MAPE, RMSE, and MSE values) of less than 1%. In conclusion, the stack regressor model can be implemented to predict diabetic nephropathy using clinical datasets.\",\"PeriodicalId\":507997,\"journal\":{\"name\":\"International Journal of Online and Biomedical Engineering (iJOE)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-05-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Online and Biomedical Engineering (iJOE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3991/ijoe.v20i08.48387\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Online and Biomedical Engineering (iJOE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3991/ijoe.v20i08.48387","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
High Performance for Predicting Diabetic Nephropathy Using Stacking Regression of Ensemble Learning Method
Diabetes may lead to several problems, one of the most prevalent and deadly of which is diabetic nephropathy. Therefore, the condition represents a significant threat to one’s health since it has the potential to cause irreversible harm to the kidneys’ ability to operate. A significant portion of the research that is being conducted now is focused on determining how accurately diabetic people may be predicted to develop kidney illness. Considering this, the research suggests a regression stacking approach for predicting albumin levels. These albumin values will serve as a reference for the incidence of diabetic nephropathy disease. They will be derived from the medical records of patients. The utilization of stacking regression from three different ensemble approaches, using Random Forest and CatBoost regressors, while the Huber algorithm is used as a meta-learner. The accuracy with which the combination of parameters that are employed is determined is a significant factor. It contributes to the high degree of performance that the ensemble approach achieves. Therefore, in this investigation, a grid search was carried out to tune the hyperparameters of both regressor models. We evaluated the performance of the proposed model using accuracy, MAPE, RMSE, and MSE values. The experimental findings demonstrate great performance. Three selected variables including quantitative UACR, semi-quantitative UACR, and urinary creatinine, achieved high performance. Overall, the performance obtained an accuracy rate of more than 98% with an error rate (MAPE, RMSE, and MSE values) of less than 1%. In conclusion, the stack regressor model can be implemented to predict diabetic nephropathy using clinical datasets.