Predicting water potability using a machine learning approach

Q2 Environmental Science
El-Bacha Rachid , Salhi Abderrahim , Abderrafia Hafid , Rabi Souad
{"title":"Predicting water potability using a machine learning approach","authors":"El-Bacha Rachid ,&nbsp;Salhi Abderrahim ,&nbsp;Abderrafia Hafid ,&nbsp;Rabi Souad","doi":"10.1016/j.envc.2025.101131","DOIUrl":null,"url":null,"abstract":"<div><div>In order to improve the prevention of risks associated with water quality, the precise and rapid assessment of this later is necessary. Nevertheless, conventional techniques often encounter difficulties due to the complexity and inconsistency of water quality data, resulting in time-consuming and arduous analyses. This research introduces a novel application of machine learning (ML) to predict water potability. In this context, support vector machine (SVM) and random forest (RF) models were trained, validated and tested using publicly available dataset that include records of water potability as a function of various physical and chemical parameters. Results indicated that random forest model outperformed the support vector machine model by achieving better results, with 70 % accuracy, 72 % precision, and 75 % receiver operating characteristic area under the curve (ROC-AUC). These findings indicated that machine learning, specifically random forest, can be effective method for evaluating water potability, leading to enhanced water resource management and public health safety. Additionally, water treatment plants can utilize these machine learning algorithms for immediate data analysis. This enables the anticipation of contamination events before water reaches consumers. This model can play an important role in developing early alert systems by encouraging prevention measures rather than just reactive actions.</div></div>","PeriodicalId":34794,"journal":{"name":"Environmental Challenges","volume":"19 ","pages":"Article 101131"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Challenges","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667010025000496","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Environmental Science","Score":null,"Total":0}
引用次数: 0

Abstract

In order to improve the prevention of risks associated with water quality, the precise and rapid assessment of this later is necessary. Nevertheless, conventional techniques often encounter difficulties due to the complexity and inconsistency of water quality data, resulting in time-consuming and arduous analyses. This research introduces a novel application of machine learning (ML) to predict water potability. In this context, support vector machine (SVM) and random forest (RF) models were trained, validated and tested using publicly available dataset that include records of water potability as a function of various physical and chemical parameters. Results indicated that random forest model outperformed the support vector machine model by achieving better results, with 70 % accuracy, 72 % precision, and 75 % receiver operating characteristic area under the curve (ROC-AUC). These findings indicated that machine learning, specifically random forest, can be effective method for evaluating water potability, leading to enhanced water resource management and public health safety. Additionally, water treatment plants can utilize these machine learning algorithms for immediate data analysis. This enables the anticipation of contamination events before water reaches consumers. This model can play an important role in developing early alert systems by encouraging prevention measures rather than just reactive actions.
使用机器学习方法预测水的可饮用性
为了提高对与水质有关的风险的预防,有必要在后期对此进行精确和快速的评估。然而,由于水质数据的复杂性和不一致性,常规技术经常遇到困难,导致耗时和繁重的分析。本研究介绍了一种新的机器学习(ML)应用于预测水的可饮用性。在这种情况下,支持向量机(SVM)和随机森林(RF)模型使用公开可用的数据集进行训练、验证和测试,这些数据集包括各种物理和化学参数对饮用水的影响记录。结果表明,随机森林模型的准确率为70%,精密度为72%,曲线下接收者工作特征面积(ROC-AUC)为75%,优于支持向量机模型。这些发现表明,机器学习,特别是随机森林,可以成为评估饮用水的有效方法,从而加强水资源管理和公共卫生安全。此外,水处理厂可以利用这些机器学习算法进行即时数据分析。这样可以在水到达消费者之前预测污染事件。这种模式可以通过鼓励预防措施而不仅仅是反应性行动,在开发早期预警系统方面发挥重要作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environmental Challenges
Environmental Challenges Environmental Science-Environmental Engineering
CiteScore
8.00
自引率
0.00%
发文量
249
审稿时长
8 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信