{"title":"Prediction for Diagnosing Liver Disease in Patients using KNN and Naïve Bayes Algorithms","authors":"H. Hartatik, Mohammad Badri Tamam, A. Setyanto","doi":"10.1109/ICORIS50180.2020.9320797","DOIUrl":null,"url":null,"abstract":"There is a lot of data on patients who undergo medication or medical examinations at the hospital and this is information that must be extracted so that it can provide information for future improvement conditions, meaning that past data can be used as a prediction basis for liver disease in patients. This is very beneficial for medical personnel and also for patients if they experience symptoms that match the symptoms felt by a patient. This project uses machine learning because it involves big data and past data is used to predict future data. Referring to previous research in reference that the results of the evaluation are varied. So in this study, The proposed strategy to performance optimization is carried out based on training data and variables that affect the model. Based on the results of calculations and analysis, it was found that the performance evaluation values were area under the curve(AUC) for naïve Bayes algorithm is 72.5% and k-nearest neighbour (KNN) of 63.19%.","PeriodicalId":280589,"journal":{"name":"2020 2nd International Conference on Cybernetics and Intelligent System (ICORIS)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 2nd International Conference on Cybernetics and Intelligent System (ICORIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICORIS50180.2020.9320797","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
There is a lot of data on patients who undergo medication or medical examinations at the hospital and this is information that must be extracted so that it can provide information for future improvement conditions, meaning that past data can be used as a prediction basis for liver disease in patients. This is very beneficial for medical personnel and also for patients if they experience symptoms that match the symptoms felt by a patient. This project uses machine learning because it involves big data and past data is used to predict future data. Referring to previous research in reference that the results of the evaluation are varied. So in this study, The proposed strategy to performance optimization is carried out based on training data and variables that affect the model. Based on the results of calculations and analysis, it was found that the performance evaluation values were area under the curve(AUC) for naïve Bayes algorithm is 72.5% and k-nearest neighbour (KNN) of 63.19%.
有很多关于在医院接受药物治疗或医学检查的患者的数据,这些信息必须被提取出来,以便为未来的改善状况提供信息,这意味着过去的数据可以用作预测患者肝脏疾病的基础。这对医务人员和患者非常有益,如果他们经历的症状与患者感觉到的症状相匹配。这个项目使用机器学习,因为它涉及到大数据,过去的数据被用来预测未来的数据。参考前人的研究,得出的评价结果各不相同。因此,在本研究中,提出的性能优化策略是基于训练数据和影响模型的变量进行的。计算分析结果表明,naïve贝叶斯算法的性能评价值为曲线下面积(area under the curve, AUC)为72.5%,k近邻(k-nearest neighbour, KNN)为63.19%。