Comparison of Naïve Bayes, C4.5 and K-Nearest Neighbor for Covid-19 Data Classification

Umairah Rizkya Gurning, Mustakim, Said Thaufik Rizaldi, Hamdi Syukron
{"title":"Comparison of Naïve Bayes, C4.5 and K-Nearest Neighbor for Covid-19 Data Classification","authors":"Umairah Rizkya Gurning, Mustakim, Said Thaufik Rizaldi, Hamdi Syukron","doi":"10.1109/ISITDI55734.2022.9944507","DOIUrl":null,"url":null,"abstract":"Covid-19 is a new virus that appeared in the city of Wuhan in 2019. This virus spreads very quickly even to Indonesia. One effort that can be done to detect the presence of this virus is the PCR and antigen test. Increasing this case resulted in a medical team having difficulty detecting suspects exposed to viruses. This research was conducted to find the best classification algorithm in predicting and classifying status on the suspected Covid-19 both exposed or not exposed. The method used in this study is Naïve Bayes, C4.5 and K-Nearest Neighbor which have very high accuracy using secondary data from the Dumai City Health Agency. From this study it was found that the algorithm C4.5 as the best algorithm in predicting the status of COVID-19 patients, especially in the city of Dumai with an accuracy of 86.54%, recall 71.51%and precision 85.14%. This study has implications for further researchers in choosing an algorithm to predict the COVID-19 case.","PeriodicalId":312644,"journal":{"name":"2022 International Symposium on Information Technology and Digital Innovation (ISITDI)","volume":"125 41","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Symposium on Information Technology and Digital Innovation (ISITDI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISITDI55734.2022.9944507","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Covid-19 is a new virus that appeared in the city of Wuhan in 2019. This virus spreads very quickly even to Indonesia. One effort that can be done to detect the presence of this virus is the PCR and antigen test. Increasing this case resulted in a medical team having difficulty detecting suspects exposed to viruses. This research was conducted to find the best classification algorithm in predicting and classifying status on the suspected Covid-19 both exposed or not exposed. The method used in this study is Naïve Bayes, C4.5 and K-Nearest Neighbor which have very high accuracy using secondary data from the Dumai City Health Agency. From this study it was found that the algorithm C4.5 as the best algorithm in predicting the status of COVID-19 patients, especially in the city of Dumai with an accuracy of 86.54%, recall 71.51%and precision 85.14%. This study has implications for further researchers in choosing an algorithm to predict the COVID-19 case.
Naïve贝叶斯、C4.5和k近邻算法在Covid-19数据分类中的比较
Covid-19是2019年在武汉市出现的一种新病毒。这种病毒传播得非常快,甚至到了印度尼西亚。检测这种病毒存在的一种方法是聚合酶链反应和抗原试验。这一病例的增加导致医疗小组难以发现感染病毒的嫌疑人。本研究旨在寻找对疑似Covid-19暴露或未暴露状态进行预测和分类的最佳分类算法。本研究使用的方法是Naïve贝叶斯,C4.5和k近邻,使用杜麦市卫生局的二手数据,具有很高的准确性。本研究发现C4.5算法是预测COVID-19患者状态的最佳算法,特别是在杜麦市,准确率为86.54%,召回率为71.51%,精密度为85.14%。这项研究对进一步研究人员选择预测COVID-19病例的算法具有启示意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信