评价 K-Nearest Neighbors 和 Naïve Bayes 在 Dringking Water Potability 分类中的作用

Anisa Rahmawati, Muhamad Fatchan, Wahyu Hadikristanto, Universitas Pelita Bangsa
{"title":"评价 K-Nearest Neighbors 和 Naïve Bayes 在 Dringking Water Potability 分类中的作用","authors":"Anisa Rahmawati, Muhamad Fatchan, Wahyu Hadikristanto, Universitas Pelita Bangsa","doi":"10.59890/ijsas.v2i5.1864","DOIUrl":null,"url":null,"abstract":"The availability of drinking water that is safe and suitable for consumption is important to support health and development. This research emphasises the importance of handling the clean water crisis through the evaluation of drinking water quality using data mining algorithms.  The dringking water quality evaluation method was selected using the K-Nearest Neighbors and Naive Bayes algorithms, replacing the manual method which is less responsive in predicting. The experimental process was conducted by utilising Kaggle website data by applying data processing and oversampling techniques to handle class imbalance in the dataset used. Bases on the research results, the accurancy of the K-Nearest Neighbors Algorithm reaches 65%, which is higher than the accuracy od the Naive Bayes Algorithm which is 64%. So it can be concluded that the K-Nearest Neighbors Algorithm is more effective in predicting the quality of water suitable for consumption. This research provides an in-depth insight into the use of technology and data analysis in dealing with the crisis in the availability of water suitable for consumption and offers suggestions for further research using more diverse methods and the use of more datasets to improve accuracy in evaluating the quality of potable water.","PeriodicalId":512625,"journal":{"name":"International Journal of Sustainable Applied Sciences","volume":"70 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Valuation K-Nearest Neighbors and Naïve Bayes for Dringking Water Potability Classification\",\"authors\":\"Anisa Rahmawati, Muhamad Fatchan, Wahyu Hadikristanto, Universitas Pelita Bangsa\",\"doi\":\"10.59890/ijsas.v2i5.1864\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The availability of drinking water that is safe and suitable for consumption is important to support health and development. This research emphasises the importance of handling the clean water crisis through the evaluation of drinking water quality using data mining algorithms.  The dringking water quality evaluation method was selected using the K-Nearest Neighbors and Naive Bayes algorithms, replacing the manual method which is less responsive in predicting. The experimental process was conducted by utilising Kaggle website data by applying data processing and oversampling techniques to handle class imbalance in the dataset used. Bases on the research results, the accurancy of the K-Nearest Neighbors Algorithm reaches 65%, which is higher than the accuracy od the Naive Bayes Algorithm which is 64%. So it can be concluded that the K-Nearest Neighbors Algorithm is more effective in predicting the quality of water suitable for consumption. This research provides an in-depth insight into the use of technology and data analysis in dealing with the crisis in the availability of water suitable for consumption and offers suggestions for further research using more diverse methods and the use of more datasets to improve accuracy in evaluating the quality of potable water.\",\"PeriodicalId\":512625,\"journal\":{\"name\":\"International Journal of Sustainable Applied Sciences\",\"volume\":\"70 5\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Sustainable Applied Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.59890/ijsas.v2i5.1864\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Sustainable Applied Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.59890/ijsas.v2i5.1864","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

提供安全、适合饮用的饮用水对于促进健康和发展非常重要。这项研究强调了通过使用数据挖掘算法评估饮用水水质来应对清洁水危机的重要性。 使用 K-Nearest Neighbors 和 Naive Bayes 算法选择了饮用水水质评价方法,以取代预测反应较慢的人工方法。实验过程利用了 Kaggle 网站的数据,通过应用数据处理和超采样技术来处理数据集中的类不平衡问题。根据研究结果,K-近邻算法的准确率达到 65%,高于 Naive Bayes 算法 64% 的准确率。因此可以得出结论,K-近邻算法在预测适合饮用的水质方面更为有效。这项研究深入探讨了如何利用技术和数据分析来应对饮用水供应危机,并提出了进一步研究的建议,即使用更多样化的方法和更多的数据集来提高饮用水质量评估的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Valuation K-Nearest Neighbors and Naïve Bayes for Dringking Water Potability Classification
The availability of drinking water that is safe and suitable for consumption is important to support health and development. This research emphasises the importance of handling the clean water crisis through the evaluation of drinking water quality using data mining algorithms.  The dringking water quality evaluation method was selected using the K-Nearest Neighbors and Naive Bayes algorithms, replacing the manual method which is less responsive in predicting. The experimental process was conducted by utilising Kaggle website data by applying data processing and oversampling techniques to handle class imbalance in the dataset used. Bases on the research results, the accurancy of the K-Nearest Neighbors Algorithm reaches 65%, which is higher than the accuracy od the Naive Bayes Algorithm which is 64%. So it can be concluded that the K-Nearest Neighbors Algorithm is more effective in predicting the quality of water suitable for consumption. This research provides an in-depth insight into the use of technology and data analysis in dealing with the crisis in the availability of water suitable for consumption and offers suggestions for further research using more diverse methods and the use of more datasets to improve accuracy in evaluating the quality of potable water.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信