使用机器学习方法检测慢性肾脏疾病

Mohammed Gollapalli, B. Saad, Jomana Alabdulkarim, Razan Sendi, Reema Alsabt, Sarah Alsharif
{"title":"使用机器学习方法检测慢性肾脏疾病","authors":"Mohammed Gollapalli, B. Saad, Jomana Alabdulkarim, Razan Sendi, Reema Alsabt, Sarah Alsharif","doi":"10.1109/CICN56167.2022.10008293","DOIUrl":null,"url":null,"abstract":"The slow progression of chronic kidney disease (CKD) makes early detection and effective treatment the only ways to prevent the mortality rates. In this study, an amalgamation of ensemble machine learning (ML) models has been leveraged in an effort to support clinicians in their goal of faster, more accurate CKD recognition and detection. By detecting and assessing the risk variables early on, patients could limit the ramifications of this disease on their health. Consequently, binary categorization is the basis of this proposed ML technique. The CKD dataset, obtained from the UCI machine learning repository was utilized in this research consisting of 400 instances and 24 attributes, which is comprised of indicators, symptoms, and risk factors. 80% of the data was used to train the model, while the remaining 20% was used for testing. While utilizing the entire set of 25 features, the CatBoost and Random Forest models outperformed and outmatched the remaining algorithms with an accuracy of 99%. The Decision Tree, Ada Boost, and SVM algorithms were then used, with a constructive accuracy rate of 98%, 98%, and 95%, respectively. Furthermore, ROC curve for the five chosen ML models was used as a significant evaluation metric to help improve and supplement our understanding of the performance of the CKD categorization challenges. The results showed that the CatBoost model is more efficient and competent in successfully and accurately classifying a patient's CKD status, with an accuracy of 99.9% when critical attributes were used.","PeriodicalId":287589,"journal":{"name":"2022 14th International Conference on Computational Intelligence and Communication Networks (CICN)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Detection of Chronic Kidney Disease Using Machine Learning Approach\",\"authors\":\"Mohammed Gollapalli, B. Saad, Jomana Alabdulkarim, Razan Sendi, Reema Alsabt, Sarah Alsharif\",\"doi\":\"10.1109/CICN56167.2022.10008293\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The slow progression of chronic kidney disease (CKD) makes early detection and effective treatment the only ways to prevent the mortality rates. In this study, an amalgamation of ensemble machine learning (ML) models has been leveraged in an effort to support clinicians in their goal of faster, more accurate CKD recognition and detection. By detecting and assessing the risk variables early on, patients could limit the ramifications of this disease on their health. Consequently, binary categorization is the basis of this proposed ML technique. The CKD dataset, obtained from the UCI machine learning repository was utilized in this research consisting of 400 instances and 24 attributes, which is comprised of indicators, symptoms, and risk factors. 80% of the data was used to train the model, while the remaining 20% was used for testing. While utilizing the entire set of 25 features, the CatBoost and Random Forest models outperformed and outmatched the remaining algorithms with an accuracy of 99%. The Decision Tree, Ada Boost, and SVM algorithms were then used, with a constructive accuracy rate of 98%, 98%, and 95%, respectively. Furthermore, ROC curve for the five chosen ML models was used as a significant evaluation metric to help improve and supplement our understanding of the performance of the CKD categorization challenges. The results showed that the CatBoost model is more efficient and competent in successfully and accurately classifying a patient's CKD status, with an accuracy of 99.9% when critical attributes were used.\",\"PeriodicalId\":287589,\"journal\":{\"name\":\"2022 14th International Conference on Computational Intelligence and Communication Networks (CICN)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 14th International Conference on Computational Intelligence and Communication Networks (CICN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CICN56167.2022.10008293\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Computational Intelligence and Communication Networks (CICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CICN56167.2022.10008293","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

慢性肾脏疾病(CKD)进展缓慢,早期发现和有效治疗是预防死亡率的唯一途径。在这项研究中,集成机器学习(ML)模型的合并被用来支持临床医生更快、更准确地识别和检测CKD。通过及早发现和评估风险变量,患者可以限制这种疾病对其健康的影响。因此,二元分类是该ML技术的基础。本研究使用的CKD数据集来自UCI机器学习存储库,由400个实例和24个属性组成,包括指标、症状和风险因素。80%的数据用于训练模型,剩下的20%用于测试。在利用全部25个特征时,CatBoost和Random Forest模型的表现优于其他算法,准确率达到99%。然后使用Decision Tree、Ada Boost和SVM算法,构造准确率分别为98%、98%和95%。此外,五种ML模型的ROC曲线被用作重要的评估指标,以帮助改进和补充我们对CKD分类挑战性能的理解。结果表明,CatBoost模型在成功、准确地分类患者CKD状态方面更有效、更有能力,当使用关键属性时,准确率达到99.9%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Detection of Chronic Kidney Disease Using Machine Learning Approach
The slow progression of chronic kidney disease (CKD) makes early detection and effective treatment the only ways to prevent the mortality rates. In this study, an amalgamation of ensemble machine learning (ML) models has been leveraged in an effort to support clinicians in their goal of faster, more accurate CKD recognition and detection. By detecting and assessing the risk variables early on, patients could limit the ramifications of this disease on their health. Consequently, binary categorization is the basis of this proposed ML technique. The CKD dataset, obtained from the UCI machine learning repository was utilized in this research consisting of 400 instances and 24 attributes, which is comprised of indicators, symptoms, and risk factors. 80% of the data was used to train the model, while the remaining 20% was used for testing. While utilizing the entire set of 25 features, the CatBoost and Random Forest models outperformed and outmatched the remaining algorithms with an accuracy of 99%. The Decision Tree, Ada Boost, and SVM algorithms were then used, with a constructive accuracy rate of 98%, 98%, and 95%, respectively. Furthermore, ROC curve for the five chosen ML models was used as a significant evaluation metric to help improve and supplement our understanding of the performance of the CKD categorization challenges. The results showed that the CatBoost model is more efficient and competent in successfully and accurately classifying a patient's CKD status, with an accuracy of 99.9% when critical attributes were used.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信