{"title":"Chronic kidney disease prediction using different machine learning models","authors":"Apoorva Pravin Datir, Snehal Shivaji Funde, Nikita Tanaii Bhore, S. Gawande, Pallavi Dhade","doi":"10.1109/ICETET-SIP-2254415.2022.9791838","DOIUrl":null,"url":null,"abstract":"A kidney's major purpose is to eliminate waste materials and excess fluids from the body through urine, which helps to maintain a stable chemical equilibrium in the body. Chronic kidney disease (CKD) is a serious global concern that is defined by a steady decline of kidney function over time. CKD affects over 14% of the world's population and is difficult to identify in its early stages. This disease is usually detected at the final or most critical stage in the human body, posing a significant risk to the human body and often resulting in the person's death. If the condition is identified early on, the patient's kidney function may be saved, allowing him or her to live a longer life. Machine learning has progressed to the point that we can now examine the medical records of individuals and detect chronic kidney disease in its early stages. On the CKD dataset from the UCI machine learning repository, this research examines the occurrence of CKD by creating ML models with 6 distinct classification algorithms. Before we can use machine learning techniques on the raw dataset, we must first process it and remove any duplicated or null variables before sending it to the models. After running the data through all of the models, it was observed that Random Forest and Extra Trees Classifier proved the highest accuracy of 98.33. The literature survey conducted before execution offered valuable insights and helped to shorten the execution time because we only chose algorithms with good accuracy.","PeriodicalId":117229,"journal":{"name":"2022 10th International Conference on Emerging Trends in Engineering and Technology - Signal and Information Processing (ICETET-SIP-22)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 10th International Conference on Emerging Trends in Engineering and Technology - Signal and Information Processing (ICETET-SIP-22)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICETET-SIP-2254415.2022.9791838","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
A kidney's major purpose is to eliminate waste materials and excess fluids from the body through urine, which helps to maintain a stable chemical equilibrium in the body. Chronic kidney disease (CKD) is a serious global concern that is defined by a steady decline of kidney function over time. CKD affects over 14% of the world's population and is difficult to identify in its early stages. This disease is usually detected at the final or most critical stage in the human body, posing a significant risk to the human body and often resulting in the person's death. If the condition is identified early on, the patient's kidney function may be saved, allowing him or her to live a longer life. Machine learning has progressed to the point that we can now examine the medical records of individuals and detect chronic kidney disease in its early stages. On the CKD dataset from the UCI machine learning repository, this research examines the occurrence of CKD by creating ML models with 6 distinct classification algorithms. Before we can use machine learning techniques on the raw dataset, we must first process it and remove any duplicated or null variables before sending it to the models. After running the data through all of the models, it was observed that Random Forest and Extra Trees Classifier proved the highest accuracy of 98.33. The literature survey conducted before execution offered valuable insights and helped to shorten the execution time because we only chose algorithms with good accuracy.
肾脏的主要作用是通过尿液排出体内的废物和多余的液体,这有助于维持体内稳定的化学平衡。慢性肾脏疾病(CKD)是一个严重的全球性问题,其定义是肾功能随着时间的推移而稳步下降。慢性肾病影响了世界上14%以上的人口,在早期阶段很难确诊。这种疾病通常在人体的最后或最关键阶段被发现,对人体构成重大风险,往往导致人死亡。如果病情及早发现,病人的肾脏功能可能会得到挽救,从而使他或她活得更长。机器学习已经发展到我们现在可以检查个人的医疗记录并在早期发现慢性肾脏疾病的程度。在UCI机器学习存储库的CKD数据集上,本研究通过创建具有6种不同分类算法的ML模型来检查CKD的发生。在我们可以在原始数据集上使用机器学习技术之前,我们必须首先处理它,并在将其发送到模型之前删除任何重复或空变量。在所有模型中运行数据后,观察到Random Forest和Extra Trees Classifier的准确率最高,为98.33。在执行前进行的文献调查提供了有价值的见解,并有助于缩短执行时间,因为我们只选择精度好的算法。