分析特征选择技术对早期预测慢性肾功能衰竭的影响

K Hema , K. Meena , Ramaraj Pandian
{"title":"分析特征选择技术对早期预测慢性肾功能衰竭的影响","authors":"K Hema ,&nbsp;K. Meena ,&nbsp;Ramaraj Pandian","doi":"10.1016/j.ijcce.2023.12.002","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Chronic renal disease, often known as Chronic Kidney Disease (CKD), is an illness that causes a steady decline in kidney function. As per the World Health Organization survey, the incidence of CKD may increase from 10% to 13% by 2030. Because of the lack of symptoms in the initial phase, diagnosing CKD early on may be difficult. The key objective of this study is to develop a forecasting model for the early detection of chronic renal disease.</p></div><div><h3>Methods</h3><p>In medical science, Machine Learning (ML) Techniques play a significant role in disease prediction despite numerous studies conducted to categorize CKD in patients using machine learning tools. Most researchers need to analyze the impact of feature selection techniques, yielding high-quality and reliable results. The efficiency of any Techniques/Algorithms depends on feature selection, feature extraction, and classifiers. In this work, the impact of feature selection is experimented with using the Exhaustive Feature Selection (EFS) method. For the early prediction of CKD, a comparative examination of machine learning classifiers, including Gradient Boost (GB), XGBoost, Decision Tree (DT), Random Forest (RF), and KNN (k-nearest neighbors), are utilized.</p></div><div><h3>Results</h3><p>Two types of datasets, standard (New Model) &amp; real-time data sets collected from the dialysis unit of a reputed hospital in Chennai, are used to carry out extensive experiment analysis. Various metrics, including Accuracy, Precision, Recall, and F1-score, are used to tabulate the results of experiments conducted to measure the performance of the proposed approach for various combinations of test and training data.</p></div><div><h3>Conclusion</h3><p>CKD is an irreversible and silent disease; it might have a high impact on many people and begin to manifest themselves at an early age in life. This research paper analyses the effect of feature selection techniques on early CKD prediction.</p></div>","PeriodicalId":100694,"journal":{"name":"International Journal of Cognitive Computing in Engineering","volume":"5 ","pages":"Pages 66-77"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666307423000426/pdfft?md5=137c3649d2841b5bf30476563074d493&pid=1-s2.0-S2666307423000426-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Analyze the impact of feature selection techniques in the early prediction of CKD\",\"authors\":\"K Hema ,&nbsp;K. Meena ,&nbsp;Ramaraj Pandian\",\"doi\":\"10.1016/j.ijcce.2023.12.002\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><p>Chronic renal disease, often known as Chronic Kidney Disease (CKD), is an illness that causes a steady decline in kidney function. As per the World Health Organization survey, the incidence of CKD may increase from 10% to 13% by 2030. Because of the lack of symptoms in the initial phase, diagnosing CKD early on may be difficult. The key objective of this study is to develop a forecasting model for the early detection of chronic renal disease.</p></div><div><h3>Methods</h3><p>In medical science, Machine Learning (ML) Techniques play a significant role in disease prediction despite numerous studies conducted to categorize CKD in patients using machine learning tools. Most researchers need to analyze the impact of feature selection techniques, yielding high-quality and reliable results. The efficiency of any Techniques/Algorithms depends on feature selection, feature extraction, and classifiers. In this work, the impact of feature selection is experimented with using the Exhaustive Feature Selection (EFS) method. For the early prediction of CKD, a comparative examination of machine learning classifiers, including Gradient Boost (GB), XGBoost, Decision Tree (DT), Random Forest (RF), and KNN (k-nearest neighbors), are utilized.</p></div><div><h3>Results</h3><p>Two types of datasets, standard (New Model) &amp; real-time data sets collected from the dialysis unit of a reputed hospital in Chennai, are used to carry out extensive experiment analysis. Various metrics, including Accuracy, Precision, Recall, and F1-score, are used to tabulate the results of experiments conducted to measure the performance of the proposed approach for various combinations of test and training data.</p></div><div><h3>Conclusion</h3><p>CKD is an irreversible and silent disease; it might have a high impact on many people and begin to manifest themselves at an early age in life. This research paper analyses the effect of feature selection techniques on early CKD prediction.</p></div>\",\"PeriodicalId\":100694,\"journal\":{\"name\":\"International Journal of Cognitive Computing in Engineering\",\"volume\":\"5 \",\"pages\":\"Pages 66-77\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666307423000426/pdfft?md5=137c3649d2841b5bf30476563074d493&pid=1-s2.0-S2666307423000426-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Cognitive Computing in Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666307423000426\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Cognitive Computing in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666307423000426","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

背景 慢性肾脏病,又称慢性肾脏病(CKD),是一种导致肾功能持续衰退的疾病。根据世界卫生组织的调查,到 2030 年,慢性肾脏病的发病率可能会从 10%增至 13%。由于初期缺乏症状,早期诊断 CKD 可能比较困难。本研究的主要目的是开发一种用于早期检测慢性肾脏病的预测模型。方法在医学科学中,尽管有大量研究利用机器学习工具对患者的慢性肾脏病进行分类,但机器学习(ML)技术在疾病预测中发挥着重要作用。大多数研究人员需要分析特征选择技术的影响,以获得高质量和可靠的结果。任何技术/算法的效率都取决于特征选择、特征提取和分类器。在这项工作中,使用穷举特征选择(EFS)方法对特征选择的影响进行了实验。对于 CKD 的早期预测,采用了机器学习分类器的比较检查,包括梯度提升(GB)、XGBoost、决策树(DT)、随机森林(RF)和 KNN(k-近邻)。结果两种类型的数据集,标准(新模型)&;从钦奈一家知名医院透析室收集的实时数据集,用于进行广泛的实验分析。各种指标,包括准确率、精确度、召回率和 F1 分数,都被用来统计实验结果,以衡量所提方法在不同测试和训练数据组合下的性能。本研究论文分析了特征选择技术对早期 CKD 预测的影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Analyze the impact of feature selection techniques in the early prediction of CKD

Background

Chronic renal disease, often known as Chronic Kidney Disease (CKD), is an illness that causes a steady decline in kidney function. As per the World Health Organization survey, the incidence of CKD may increase from 10% to 13% by 2030. Because of the lack of symptoms in the initial phase, diagnosing CKD early on may be difficult. The key objective of this study is to develop a forecasting model for the early detection of chronic renal disease.

Methods

In medical science, Machine Learning (ML) Techniques play a significant role in disease prediction despite numerous studies conducted to categorize CKD in patients using machine learning tools. Most researchers need to analyze the impact of feature selection techniques, yielding high-quality and reliable results. The efficiency of any Techniques/Algorithms depends on feature selection, feature extraction, and classifiers. In this work, the impact of feature selection is experimented with using the Exhaustive Feature Selection (EFS) method. For the early prediction of CKD, a comparative examination of machine learning classifiers, including Gradient Boost (GB), XGBoost, Decision Tree (DT), Random Forest (RF), and KNN (k-nearest neighbors), are utilized.

Results

Two types of datasets, standard (New Model) & real-time data sets collected from the dialysis unit of a reputed hospital in Chennai, are used to carry out extensive experiment analysis. Various metrics, including Accuracy, Precision, Recall, and F1-score, are used to tabulate the results of experiments conducted to measure the performance of the proposed approach for various combinations of test and training data.

Conclusion

CKD is an irreversible and silent disease; it might have a high impact on many people and begin to manifest themselves at an early age in life. This research paper analyses the effect of feature selection techniques on early CKD prediction.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
13.80
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信