Design of machine learning-based water quality prediction system with recursive feature elimination cross-validation

James Julian, Annastya Bagas Dewantara, F. Wahyuni
{"title":"Design of machine learning-based water quality prediction system with recursive feature elimination cross-validation","authors":"James Julian, Annastya Bagas Dewantara, F. Wahyuni","doi":"10.20895/infotel.v15i3.977","DOIUrl":null,"url":null,"abstract":"Lack of clean water has become a problem in the world, and it is estimated that by 2025 there will be 2.8 billion people who will experience a shortage of clean water. The high demand for clean water and the limited water sources with proper potency is one of the main reasons for the need for a device capable of measuring the potability level of water that is flexible to carry and does not require high costs in the manufacturing process. In this paper, the design of machine learning-based potability devices with recursive feature elimination with cross-validation (RFECV) is carried out as a guide in making the design of a water potability detection system, and the results obtained from RFECV with the Random Forest (RF) algorithm have a higher accuracy value. 15.71% better than the RF model, 6.85% better than the Support Vector Machine (SVM) model, and 8.57% better than the Artificial Neural Network (ANN) model trained without RFECV. The water potability prediction system's design selection is based on feature elimination results in the RFECV process. It is based on a literature review on device selection. The proposed water potability detection system consists of ESP32 as the primary computing device, electrochemical spectroscopy-based Al/PET sensor to detect sulfate values with a sensitivity of 0.874 Ω/ppm, PH4502C as a pH measuring instrument with an accuracy of up to 98.10%, WD-35802-49 electrode. as a device for measuring hardness in water with a measurement range of 0.4 – 40,000 ppm, a total dissolved solids sensor to determine the solids content in water with an accuracy of up to 97.80%, as well as a carbon-based sensor for measuring chloramines with a reading capacity of 186 nA/ppm.","PeriodicalId":30672,"journal":{"name":"Jurnal Infotel","volume":"2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Infotel","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20895/infotel.v15i3.977","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Lack of clean water has become a problem in the world, and it is estimated that by 2025 there will be 2.8 billion people who will experience a shortage of clean water. The high demand for clean water and the limited water sources with proper potency is one of the main reasons for the need for a device capable of measuring the potability level of water that is flexible to carry and does not require high costs in the manufacturing process. In this paper, the design of machine learning-based potability devices with recursive feature elimination with cross-validation (RFECV) is carried out as a guide in making the design of a water potability detection system, and the results obtained from RFECV with the Random Forest (RF) algorithm have a higher accuracy value. 15.71% better than the RF model, 6.85% better than the Support Vector Machine (SVM) model, and 8.57% better than the Artificial Neural Network (ANN) model trained without RFECV. The water potability prediction system's design selection is based on feature elimination results in the RFECV process. It is based on a literature review on device selection. The proposed water potability detection system consists of ESP32 as the primary computing device, electrochemical spectroscopy-based Al/PET sensor to detect sulfate values with a sensitivity of 0.874 Ω/ppm, PH4502C as a pH measuring instrument with an accuracy of up to 98.10%, WD-35802-49 electrode. as a device for measuring hardness in water with a measurement range of 0.4 – 40,000 ppm, a total dissolved solids sensor to determine the solids content in water with an accuracy of up to 97.80%, as well as a carbon-based sensor for measuring chloramines with a reading capacity of 186 nA/ppm.
利用递归特征消除交叉验证设计基于机器学习的水质预测系统
清洁水的缺乏已成为世界性问题,据估计,到 2025 年,将有 28 亿人面临清洁水短缺的问题。人们对清洁水的需求量很大,而具有适当水效的水源却很有限,这就是需要一种能够测量水的水效等级、携带方便且制造成本不高的设备的主要原因之一。本文以基于机器学习的可饮用性装置的设计为指导,采用递归特征消除与交叉验证(RFECV)方法进行水的可饮用性检测系统的设计,RFECV 与随机森林(RF)算法得到的结果具有更高的准确度值。与 RF 模型相比,准确率提高了 15.71%;与支持向量机(SVM)模型相比,准确率提高了 6.85%;与未使用 RFECV 的人工神经网络(ANN)模型相比,准确率提高了 8.57%。水可饮用性预测系统的设计选择基于 RFECV 过程中的特征消除结果。该系统的设计选择是基于 RFECV 过程中的特征消除结果,并以设备选择方面的文献综述为基础。拟议的水可饮用性检测系统包括:作为主要计算设备的 ESP32、基于电化学光谱的 Al/PET 传感器(灵敏度为 0.874 Ω/ppm)、PH4502C(精度高达 98.10%的 pH 测量仪器)、WD-35802-49 电极。此外,还有用于测量氯胺的碳基传感器(读数能力为 186 nA/ppm)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
47
审稿时长
6 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信