{"title":"Cart与朴素贝叶斯算法诊断糖尿病的性能比较","authors":"Irfan Santiko, Pungkas Subarkah","doi":"10.47738/ijiis.v2i1.9","DOIUrl":null,"url":null,"abstract":"Based on Indonesia's health profile in 2008, Diabetes Mellitus is the cause of the ranking of six for all ages in Indonesia with the proportion of deaths of 5.7% under stroke, TB, hypertension, injury and perinatal. This is reinforced by WHO (2003), Diabetes Mellitus disease reached 194 million people or 5.1 percent of the world's adult population and in 2025 is expected to increase to 333 million inhabitants. In particular, in Indonesia, people with Diabetes Mellitus are increasing. In 2000, Diabetes Mellitus sufferers have reached 8.4 million people and it is estimated that the prevalence of Diabetes Mellitus in 2030 in Indonesia reaches 21.3 million people.This allows researchers and practitioners to focus their attention on detecting/diagnosing diabetes mellitus and to prevent it because the disease can cause complications. The method used in this research was problem identification, data collection, pre-processing stage, classification method, validation and evaluation and conclusion. The algorithm used in this research was CART and Naïve Bayes using dataset taken from UCI Indian Pima database repository consisting of clinical data ofpatients who detected positive and negative diabetes mellitus. Validation and evaluation method used was 10-crossvalidation and confusion Matrix for the assessment of precision, recall and F-Measure. The result of calculation has been done, got the accuracy result on CART algorithm equaled to 76.9337% with precision 0.764%, recall 0.769%, and F-Measure 0.765%. Whilethe diabetes dataset was tested with the Naïve Bayes algorithm, got an accuracy of 73.7569% with precision 0.732%, recall 0.738%, and F-Measure 0.734%. From these results it can be concluded that to diagnose diabetes mellitus disease it is suggested to use CART algorithm.","PeriodicalId":229613,"journal":{"name":"IJIIS: International Journal of Informatics and Information Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Comparison of Cart and Naive Bayesian Algorithm Performance to Diagnose Diabetes Mellitus\",\"authors\":\"Irfan Santiko, Pungkas Subarkah\",\"doi\":\"10.47738/ijiis.v2i1.9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Based on Indonesia's health profile in 2008, Diabetes Mellitus is the cause of the ranking of six for all ages in Indonesia with the proportion of deaths of 5.7% under stroke, TB, hypertension, injury and perinatal. This is reinforced by WHO (2003), Diabetes Mellitus disease reached 194 million people or 5.1 percent of the world's adult population and in 2025 is expected to increase to 333 million inhabitants. In particular, in Indonesia, people with Diabetes Mellitus are increasing. In 2000, Diabetes Mellitus sufferers have reached 8.4 million people and it is estimated that the prevalence of Diabetes Mellitus in 2030 in Indonesia reaches 21.3 million people.This allows researchers and practitioners to focus their attention on detecting/diagnosing diabetes mellitus and to prevent it because the disease can cause complications. The method used in this research was problem identification, data collection, pre-processing stage, classification method, validation and evaluation and conclusion. The algorithm used in this research was CART and Naïve Bayes using dataset taken from UCI Indian Pima database repository consisting of clinical data ofpatients who detected positive and negative diabetes mellitus. Validation and evaluation method used was 10-crossvalidation and confusion Matrix for the assessment of precision, recall and F-Measure. The result of calculation has been done, got the accuracy result on CART algorithm equaled to 76.9337% with precision 0.764%, recall 0.769%, and F-Measure 0.765%. Whilethe diabetes dataset was tested with the Naïve Bayes algorithm, got an accuracy of 73.7569% with precision 0.732%, recall 0.738%, and F-Measure 0.734%. From these results it can be concluded that to diagnose diabetes mellitus disease it is suggested to use CART algorithm.\",\"PeriodicalId\":229613,\"journal\":{\"name\":\"IJIIS: International Journal of Informatics and Information Systems\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IJIIS: International Journal of Informatics and Information Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.47738/ijiis.v2i1.9\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IJIIS: International Journal of Informatics and Information Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.47738/ijiis.v2i1.9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
摘要
根据印度尼西亚2008年的健康状况,糖尿病是印度尼西亚所有年龄段中排名第六的原因,中风、结核病、高血压、伤害和围产期死亡所占比例为5.7%。世界卫生组织(2003年)进一步证实了这一点,糖尿病患者达到1.94亿人,占世界成年人口的5.1%,预计到2025年将增加到3.33亿居民。特别是在印度尼西亚,糖尿病患者正在增加。2000年,糖尿病患者已达840万人,预计到2030年,印度尼西亚的糖尿病患病率将达到2130万人。这使研究人员和从业人员能够将注意力集中在检测/诊断糖尿病和预防糖尿病上,因为这种疾病会引起并发症。本研究采用的方法为问题识别、数据收集、预处理阶段、分类方法、验证评价和结论。本研究使用的算法是CART和Naïve Bayes,使用的数据集来自UCI Indian Pima数据库存储库,包括检测到阳性和阴性糖尿病患者的临床数据。采用10交叉验证和混淆矩阵评价精密度、召回率和F-Measure。计算结果表明,CART算法的准确率为76.9337%,精密度为0.764%,召回率为0.769%,F-Measure为0.765%。而使用Naïve Bayes算法对糖尿病数据集进行测试,准确率为73.7569%,精密度为0.732%,召回率为0.738%,F-Measure为0.734%。从这些结果可以得出结论,建议使用CART算法诊断糖尿病疾病。
Comparison of Cart and Naive Bayesian Algorithm Performance to Diagnose Diabetes Mellitus
Based on Indonesia's health profile in 2008, Diabetes Mellitus is the cause of the ranking of six for all ages in Indonesia with the proportion of deaths of 5.7% under stroke, TB, hypertension, injury and perinatal. This is reinforced by WHO (2003), Diabetes Mellitus disease reached 194 million people or 5.1 percent of the world's adult population and in 2025 is expected to increase to 333 million inhabitants. In particular, in Indonesia, people with Diabetes Mellitus are increasing. In 2000, Diabetes Mellitus sufferers have reached 8.4 million people and it is estimated that the prevalence of Diabetes Mellitus in 2030 in Indonesia reaches 21.3 million people.This allows researchers and practitioners to focus their attention on detecting/diagnosing diabetes mellitus and to prevent it because the disease can cause complications. The method used in this research was problem identification, data collection, pre-processing stage, classification method, validation and evaluation and conclusion. The algorithm used in this research was CART and Naïve Bayes using dataset taken from UCI Indian Pima database repository consisting of clinical data ofpatients who detected positive and negative diabetes mellitus. Validation and evaluation method used was 10-crossvalidation and confusion Matrix for the assessment of precision, recall and F-Measure. The result of calculation has been done, got the accuracy result on CART algorithm equaled to 76.9337% with precision 0.764%, recall 0.769%, and F-Measure 0.765%. Whilethe diabetes dataset was tested with the Naïve Bayes algorithm, got an accuracy of 73.7569% with precision 0.732%, recall 0.738%, and F-Measure 0.734%. From these results it can be concluded that to diagnose diabetes mellitus disease it is suggested to use CART algorithm.