Comparative anatomization of data mining and fuzzy logic techniques used in diabetes prognosis

Harshil Thakkar , Vaishnavi Shah , Hiteshri Yagnik , Manan Shah
{"title":"Comparative anatomization of data mining and fuzzy logic techniques used in diabetes prognosis","authors":"Harshil Thakkar ,&nbsp;Vaishnavi Shah ,&nbsp;Hiteshri Yagnik ,&nbsp;Manan Shah","doi":"10.1016/j.ceh.2020.11.001","DOIUrl":null,"url":null,"abstract":"<div><p>Diabetes is an ailment in which glucose level increase in at high rates in blood due to body’s inability to metabolize it. This happens when body does not produce sufficient amount of insulin or it does not respond to it properly. Critical and long-term health issues arise if diabetes is not handled or properly treated which includes: heart problems, disorders of the lungs, skin and liver complications, nerve damage, etc. With increasing number of diabetic patients, its early detection becomes essential. In this paper, our major focus areas are data mining and fuzzy logic techniques used in diabetes diagnosis. Data mining is used for locating patterns in huge datasets using a composition of different methods of machine learning, database manipulations and statistics. Data mining offers a lot of methods to inspect large data considering the expected outcome to find the hidden knowledge. Fuzzy logic is similar to human reasoning system and hence it can handle the uncertainties found in the data of medical diagnosis. These systems are called expert systems. The fuzzy expert systems (FES) analyze the knowledge from the available data which might be vague and suggests linguistic concept with huge approximation as its core to medical texts. In this paper, the methodology section delivers the pipeline of various tasks such as selecting the dataset, preprocessing the data by applying numerous methods such as standardization, normalization etc. After that, feature extraction technique is implemented on the dataset for improving the accuracy and finally dataset worked on data mining and fuzzy logic various classification algorithms. While analyzing different data mining methods, the accuracy computed through random forest classifiers as high as 99.7% and in case of numerous fuzzy logic approaches, high precision and low complexity was found to contribute a fairly high accuracy of 96%.</p></div>","PeriodicalId":100268,"journal":{"name":"Clinical eHealth","volume":"4 ","pages":"Pages 12-23"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.ceh.2020.11.001","citationCount":"48","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical eHealth","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2588914120300113","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 48

Abstract

Diabetes is an ailment in which glucose level increase in at high rates in blood due to body’s inability to metabolize it. This happens when body does not produce sufficient amount of insulin or it does not respond to it properly. Critical and long-term health issues arise if diabetes is not handled or properly treated which includes: heart problems, disorders of the lungs, skin and liver complications, nerve damage, etc. With increasing number of diabetic patients, its early detection becomes essential. In this paper, our major focus areas are data mining and fuzzy logic techniques used in diabetes diagnosis. Data mining is used for locating patterns in huge datasets using a composition of different methods of machine learning, database manipulations and statistics. Data mining offers a lot of methods to inspect large data considering the expected outcome to find the hidden knowledge. Fuzzy logic is similar to human reasoning system and hence it can handle the uncertainties found in the data of medical diagnosis. These systems are called expert systems. The fuzzy expert systems (FES) analyze the knowledge from the available data which might be vague and suggests linguistic concept with huge approximation as its core to medical texts. In this paper, the methodology section delivers the pipeline of various tasks such as selecting the dataset, preprocessing the data by applying numerous methods such as standardization, normalization etc. After that, feature extraction technique is implemented on the dataset for improving the accuracy and finally dataset worked on data mining and fuzzy logic various classification algorithms. While analyzing different data mining methods, the accuracy computed through random forest classifiers as high as 99.7% and in case of numerous fuzzy logic approaches, high precision and low complexity was found to contribute a fairly high accuracy of 96%.

数据挖掘与模糊逻辑技术在糖尿病预后中的比较分析
糖尿病是一种由于身体无法代谢葡萄糖而导致血液中葡萄糖水平快速升高的疾病。当身体不能产生足够数量的胰岛素或不能对胰岛素做出适当的反应时,就会发生这种情况。如果糖尿病得不到适当处理或治疗,就会出现严重和长期的健康问题,包括:心脏问题、肺部疾病、皮肤和肝脏并发症、神经损伤等。随着糖尿病患者数量的不断增加,早期发现糖尿病变得至关重要。在本文中,我们主要关注的领域是数据挖掘和模糊逻辑技术在糖尿病诊断中的应用。数据挖掘是利用机器学习、数据库操作和统计等不同方法的组合,在庞大的数据集中定位模式。数据挖掘提供了多种方法,可以根据预期结果对大数据进行检查,从而发现隐藏的知识。模糊逻辑类似于人类的推理系统,因此它可以处理医疗诊断数据中的不确定性。这些系统被称为专家系统。模糊专家系统(FES)从现有的数据中对模糊的知识进行分析,提出具有巨大近似值的语言概念作为医学文本的核心。在本文中,方法论部分提供了各种任务的流水线,例如选择数据集,通过应用多种方法(如标准化,规范化等)对数据进行预处理。然后对数据集进行特征提取技术以提高准确率,最后对数据集进行数据挖掘和模糊逻辑各种分类算法的处理。在分析不同的数据挖掘方法时,通过随机森林分类器计算出的准确率高达99.7%,在模糊逻辑方法众多的情况下,高精度和低复杂度贡献了96%的相当高的准确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.10
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信