{"title":"糖尿病患者使用CFS和ROS方法与J48的Adaboost算法进行分类","authors":"Dian Pramadhana","doi":"10.29408/edumatic.v5i1.3336","DOIUrl":null,"url":null,"abstract":"Diabetes was a disease that occurs due to high blood-glucose levels. Researchers tried to prevent complications from developing by using data mining techniques. One of the techniques used in data mining was classification. The purpose of this study improves the accuracy of the classification of diabetes for a better and optimal result. The method in this study is Correlation Feature Selection (CFS) as attribute selection, Random Over Sampling to handle unbalanced data and AdaBoost to improve the performance of the J48 algorithm so the result obtained best. Based on the result of this study, showed that Correlation Feature Selection for attribute selection and Random Over Sampling to handle imbalance's class with the Adaboost-based J48 algorithm proved can increase the results of the diabetes classification with an accuracy of 92.3%. For the further research recommended to apply other methods so that the accuracy results obtained are more optimal for comparison.","PeriodicalId":314771,"journal":{"name":"Edumatic: Jurnal Pendidikan Informatika","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Klasifikasi Penyakit Diabetes Menggunakan Metode CFS dan ROS dengan Algoritma J48 Berbasis Adaboost\",\"authors\":\"Dian Pramadhana\",\"doi\":\"10.29408/edumatic.v5i1.3336\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Diabetes was a disease that occurs due to high blood-glucose levels. Researchers tried to prevent complications from developing by using data mining techniques. One of the techniques used in data mining was classification. The purpose of this study improves the accuracy of the classification of diabetes for a better and optimal result. The method in this study is Correlation Feature Selection (CFS) as attribute selection, Random Over Sampling to handle unbalanced data and AdaBoost to improve the performance of the J48 algorithm so the result obtained best. Based on the result of this study, showed that Correlation Feature Selection for attribute selection and Random Over Sampling to handle imbalance's class with the Adaboost-based J48 algorithm proved can increase the results of the diabetes classification with an accuracy of 92.3%. For the further research recommended to apply other methods so that the accuracy results obtained are more optimal for comparison.\",\"PeriodicalId\":314771,\"journal\":{\"name\":\"Edumatic: Jurnal Pendidikan Informatika\",\"volume\":\"41 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-06-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Edumatic: Jurnal Pendidikan Informatika\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.29408/edumatic.v5i1.3336\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Edumatic: Jurnal Pendidikan Informatika","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29408/edumatic.v5i1.3336","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
摘要
糖尿病是一种由高血糖引起的疾病。研究人员试图通过使用数据挖掘技术来防止并发症的发展。数据挖掘中使用的技术之一是分类。本研究的目的是提高糖尿病分类的准确性,以获得更好的最佳结果。本研究采用相关特征选择(Correlation Feature Selection, CFS)作为属性选择,随机过采样(Random Over Sampling)处理不平衡数据,AdaBoost提高J48算法的性能,从而获得最佳的结果。基于本研究的结果表明,使用基于adaboost的J48算法进行属性选择的相关特征选择和处理不平衡类的随机过采样可以提高糖尿病分类的结果,准确率达到92.3%。对于进一步的研究,建议采用其他方法,以获得更优的精度结果进行比较。
Klasifikasi Penyakit Diabetes Menggunakan Metode CFS dan ROS dengan Algoritma J48 Berbasis Adaboost
Diabetes was a disease that occurs due to high blood-glucose levels. Researchers tried to prevent complications from developing by using data mining techniques. One of the techniques used in data mining was classification. The purpose of this study improves the accuracy of the classification of diabetes for a better and optimal result. The method in this study is Correlation Feature Selection (CFS) as attribute selection, Random Over Sampling to handle unbalanced data and AdaBoost to improve the performance of the J48 algorithm so the result obtained best. Based on the result of this study, showed that Correlation Feature Selection for attribute selection and Random Over Sampling to handle imbalance's class with the Adaboost-based J48 algorithm proved can increase the results of the diabetes classification with an accuracy of 92.3%. For the further research recommended to apply other methods so that the accuracy results obtained are more optimal for comparison.