评价 K-Nearest Neighbors 和 Naïve Bayes 在 Dringking Water Potability 分类中的作用

International Journal of Sustainable Applied Sciences Pub Date : 2024-06-06 DOI:10.59890/ijsas.v2i5.1864

Anisa Rahmawati, Muhamad Fatchan, Wahyu Hadikristanto, Universitas Pelita Bangsa

{"title":"评价 K-Nearest Neighbors 和 Naïve Bayes 在 Dringking Water Potability 分类中的作用","authors":"Anisa Rahmawati, Muhamad Fatchan, Wahyu Hadikristanto, Universitas Pelita Bangsa","doi":"10.59890/ijsas.v2i5.1864","DOIUrl":null,"url":null,"abstract":"The availability of drinking water that is safe and suitable for consumption is important to support health and development. This research emphasises the importance of handling the clean water crisis through the evaluation of drinking water quality using data mining algorithms. The dringking water quality evaluation method was selected using the K-Nearest Neighbors and Naive Bayes algorithms, replacing the manual method which is less responsive in predicting. The experimental process was conducted by utilising Kaggle website data by applying data processing and oversampling techniques to handle class imbalance in the dataset used. Bases on the research results, the accurancy of the K-Nearest Neighbors Algorithm reaches 65%, which is higher than the accuracy od the Naive Bayes Algorithm which is 64%. So it can be concluded that the K-Nearest Neighbors Algorithm is more effective in predicting the quality of water suitable for consumption. This research provides an in-depth insight into the use of technology and data analysis in dealing with the crisis in the availability of water suitable for consumption and offers suggestions for further research using more diverse methods and the use of more datasets to improve accuracy in evaluating the quality of potable water.","PeriodicalId":512625,"journal":{"name":"International Journal of Sustainable Applied Sciences","volume":"70 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Valuation K-Nearest Neighbors and Naïve Bayes for Dringking Water Potability Classification\",\"authors\":\"Anisa Rahmawati, Muhamad Fatchan, Wahyu Hadikristanto, Universitas Pelita Bangsa\",\"doi\":\"10.59890/ijsas.v2i5.1864\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The availability of drinking water that is safe and suitable for consumption is important to support health and development. This research emphasises the importance of handling the clean water crisis through the evaluation of drinking water quality using data mining algorithms. The dringking water quality evaluation method was selected using the K-Nearest Neighbors and Naive Bayes algorithms, replacing the manual method which is less responsive in predicting. The experimental process was conducted by utilising Kaggle website data by applying data processing and oversampling techniques to handle class imbalance in the dataset used. Bases on the research results, the accurancy of the K-Nearest Neighbors Algorithm reaches 65%, which is higher than the accuracy od the Naive Bayes Algorithm which is 64%. So it can be concluded that the K-Nearest Neighbors Algorithm is more effective in predicting the quality of water suitable for consumption. This research provides an in-depth insight into the use of technology and data analysis in dealing with the crisis in the availability of water suitable for consumption and offers suggestions for further research using more diverse methods and the use of more datasets to improve accuracy in evaluating the quality of potable water.\",\"PeriodicalId\":512625,\"journal\":{\"name\":\"International Journal of Sustainable Applied Sciences\",\"volume\":\"70 5\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Sustainable Applied Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.59890/ijsas.v2i5.1864\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Sustainable Applied Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.59890/ijsas.v2i5.1864","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

提供安全、适合饮用的饮用水对于促进健康和发展非常重要。这项研究强调了通过使用数据挖掘算法评估饮用水水质来应对清洁水危机的重要性。使用 K-Nearest Neighbors 和 Naive Bayes 算法选择了饮用水水质评价方法，以取代预测反应较慢的人工方法。实验过程利用了 Kaggle 网站的数据，通过应用数据处理和超采样技术来处理数据集中的类不平衡问题。根据研究结果，K-近邻算法的准确率达到 65%，高于 Naive Bayes 算法 64% 的准确率。因此可以得出结论，K-近邻算法在预测适合饮用的水质方面更为有效。这项研究深入探讨了如何利用技术和数据分析来应对饮用水供应危机，并提出了进一步研究的建议，即使用更多样化的方法和更多的数据集来提高饮用水质量评估的准确性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Valuation K-Nearest Neighbors and Naïve Bayes for Dringking Water Potability Classification

The availability of drinking water that is safe and suitable for consumption is important to support health and development. This research emphasises the importance of handling the clean water crisis through the evaluation of drinking water quality using data mining algorithms. The dringking water quality evaluation method was selected using the K-Nearest Neighbors and Naive Bayes algorithms, replacing the manual method which is less responsive in predicting. The experimental process was conducted by utilising Kaggle website data by applying data processing and oversampling techniques to handle class imbalance in the dataset used. Bases on the research results, the accurancy of the K-Nearest Neighbors Algorithm reaches 65%, which is higher than the accuracy od the Naive Bayes Algorithm which is 64%. So it can be concluded that the K-Nearest Neighbors Algorithm is more effective in predicting the quality of water suitable for consumption. This research provides an in-depth insight into the use of technology and data analysis in dealing with the crisis in the availability of water suitable for consumption and offers suggestions for further research using more diverse methods and the use of more datasets to improve accuracy in evaluating the quality of potable water.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Journal of Sustainable Applied Sciences

自引率

0.00%

发文量