Data Mining for Predicting Pre-diabetes: Comparing Two Approaches

K. Farahmand, Guangjing You, Jing Shi, S. S. Wadhwa
{"title":"Data Mining for Predicting Pre-diabetes: Comparing Two Approaches","authors":"K. Farahmand, Guangjing You, Jing Shi, S. S. Wadhwa","doi":"10.4018/IJUDH.2015070103","DOIUrl":null,"url":null,"abstract":"Many individuals who are at risk for type 2 diabetes do not experience symptoms of diabetes, and therefore are not aware of this condition. Screening for type 2 diabetes can identify individuals at risk for type 2 diabetes, and prevent or delay complications. A total of 13 risk factors, out of 17 variables of NHANES', were selected as predictors. In this study, a comparison of two data mining methodology showed that Decision Tree has a higher ROC index than Logistic Regression modeling. All ROC indexes for two data mining models were greater than 77% indicating both methods present a good prediction for pre-diabetes. The final results of comparison indicated Decision Tree modeling is a better indicator to predict pre-diabetes.","PeriodicalId":211533,"journal":{"name":"International Journal of User-Driven Healthcare","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2015-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of User-Driven Healthcare","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/IJUDH.2015070103","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Many individuals who are at risk for type 2 diabetes do not experience symptoms of diabetes, and therefore are not aware of this condition. Screening for type 2 diabetes can identify individuals at risk for type 2 diabetes, and prevent or delay complications. A total of 13 risk factors, out of 17 variables of NHANES', were selected as predictors. In this study, a comparison of two data mining methodology showed that Decision Tree has a higher ROC index than Logistic Regression modeling. All ROC indexes for two data mining models were greater than 77% indicating both methods present a good prediction for pre-diabetes. The final results of comparison indicated Decision Tree modeling is a better indicator to predict pre-diabetes.
预测糖尿病前期的数据挖掘:两种方法的比较
许多有2型糖尿病风险的人并没有经历糖尿病的症状,因此没有意识到这种情况。2型糖尿病筛查可以识别有2型糖尿病风险的个体,预防或延缓并发症的发生。从NHANES的17个变量中选取13个危险因素作为预测因子。在本研究中,两种数据挖掘方法的比较表明,决策树模型比Logistic回归模型具有更高的ROC指数。两种数据挖掘模型的ROC指标均大于77%,表明两种方法均能较好地预测糖尿病前期。比较结果表明决策树模型是预测糖尿病前期的较好指标。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信