Implementation of K-Nearest Neighbors, Naïve Bayes Classifier, Support Vector Machine and Decision Tree Algorithms for Obesity Risk Prediction

Public Research Journal of Engineering, Data Technology and Computer Science Pub Date : 2024-04-21 DOI:10.57152/predatecs.v2i1.1110

Amanda Iksanul Putri, Nur Alfa Husna, Neha Mella Cia, Muhammad Abdillah Arba, Nasywa Rihadatul Aisyi, Chintya Harum Pramesthi, Abidaharbya Salsa Irdayusman

{"title":"Implementation of K-Nearest Neighbors, Naïve Bayes Classifier, Support Vector Machine and Decision Tree Algorithms for Obesity Risk Prediction","authors":"Amanda Iksanul Putri, Nur Alfa Husna, Neha Mella Cia, Muhammad Abdillah Arba, Nasywa Rihadatul Aisyi, Chintya Harum Pramesthi, Abidaharbya Salsa Irdayusman","doi":"10.57152/predatecs.v2i1.1110","DOIUrl":null,"url":null,"abstract":"An abnormal or excessive build-up of fat that can negatively impact one's health as a result of an imbalance in energy between calories consumed and burnt is known as obesity. The majority of ailments, such as diabetes, heart disease, cancer, osteoarthritis, chronic renal disease, stroke, hypertension, and other fatal conditions, are linked to obesity. Information technology has therefore been the subject of several studies aimed at diagnosing and treating obesity. Because there is a wealth of information on obesity, data mining techniques such as the K-Nearest Neighbors (K-NN) algorithm, Naïve Bayes Classifier, Support Vector Machine (SVM), and Decision Tree can be used to classify the data. The 2111 records and 17 characteristics of obesity data that were received from Kaggle will be used in this study. The four algorithms are to be compared in this study. In other words, using the dataset used in this study, the Decision Tree algorithm's accuracy outperforms that of the other three algorithms K-NN, Naïve Bayes, and SVM. Using the Decision Tree algorithm, the accuracy was 84.98%; the K-NN algorithm came in second with an accuracy value of 83.55%; the Naïve Bayes algorithm came in third with an accuracy rate of 77.48%; and the SVM algorithm came in last with the lowest accuracy value in this study, at 77.32%.","PeriodicalId":516904,"journal":{"name":"Public Research Journal of Engineering, Data Technology and Computer Science","volume":"104 30","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Public Research Journal of Engineering, Data Technology and Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.57152/predatecs.v2i1.1110","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

An abnormal or excessive build-up of fat that can negatively impact one's health as a result of an imbalance in energy between calories consumed and burnt is known as obesity. The majority of ailments, such as diabetes, heart disease, cancer, osteoarthritis, chronic renal disease, stroke, hypertension, and other fatal conditions, are linked to obesity. Information technology has therefore been the subject of several studies aimed at diagnosing and treating obesity. Because there is a wealth of information on obesity, data mining techniques such as the K-Nearest Neighbors (K-NN) algorithm, Naïve Bayes Classifier, Support Vector Machine (SVM), and Decision Tree can be used to classify the data. The 2111 records and 17 characteristics of obesity data that were received from Kaggle will be used in this study. The four algorithms are to be compared in this study. In other words, using the dataset used in this study, the Decision Tree algorithm's accuracy outperforms that of the other three algorithms K-NN, Naïve Bayes, and SVM. Using the Decision Tree algorithm, the accuracy was 84.98%; the K-NN algorithm came in second with an accuracy value of 83.55%; the Naïve Bayes algorithm came in third with an accuracy rate of 77.48%; and the SVM algorithm came in last with the lowest accuracy value in this study, at 77.32%.

查看原文本刊更多论文

实施 K-近邻、奈夫贝叶斯分类器、支持向量机和决策树算法预测肥胖风险

由于摄入和消耗的热量不平衡，导致脂肪异常或过度堆积，从而对人体健康产生负面影响，这就是肥胖症。大多数疾病，如糖尿病、心脏病、癌症、骨关节炎、慢性肾病、中风、高血压和其他致命疾病，都与肥胖有关。因此，信息技术已成为旨在诊断和治疗肥胖症的多项研究的主题。由于有关肥胖症的信息非常丰富，因此可以使用 K-近邻（K-NN）算法、奈夫贝叶斯分类器、支持向量机（SVM）和决策树等数据挖掘技术对数据进行分类。本研究将使用从 Kaggle 收到的 2111 条记录和 17 种肥胖特征数据。本研究将对这四种算法进行比较。换句话说，使用本研究中使用的数据集，决策树算法的准确性优于其他三种算法 K-NN、Naïve Bayes 和 SVM。使用决策树算法的准确率为 84.98%；K-NN 算法位居第二，准确率为 83.55%；Naïve Bayes 算法位居第三，准确率为 77.48%；SVM 算法位居最后，准确率为 77.32%，是本研究中准确率最低的算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Public Research Journal of Engineering, Data Technology and Computer Science

自引率

0.00%

发文量