在尼泊尔使用机器学习预测自来水中大肠杆菌的存在

IF 1.7 4区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES
So Kuroki, Ryuji Ogata, M. Sakamoto
{"title":"在尼泊尔使用机器学习预测自来水中大肠杆菌的存在","authors":"So Kuroki, Ryuji Ogata, M. Sakamoto","doi":"10.1111/wej.12844","DOIUrl":null,"url":null,"abstract":"Within developing countries, a multitude of problems that affect the water supply process can result in the contamination of water taps. While machine learning applications have become popular for attaining efficient water quality predictions, acquiring the necessary data for modelling for developing countries is challenging. This study constructs water quality prediction models by machine learning with a pseudo‐pipeline network to complement the missing data of the water supply process. Using both water source and water tap quality information measured by the Government of Nepal, we apply the three machine learning models: support vector machine (SVM), random forest (RF) and LightGBM. Furthermore, we also apply a traditional statistical method—logistic regression (LR)—to the prediction of the Escherichia coli (E. coli) contamination in water taps. With some input variables (such as the length from the nearest sources) obtained from the pseudo‐pipeline network, the results show that SVM has stable and high accuracy for both the 26 cities (70%) and for the 25 cities except for Kathmandu (79%). LR performed a significantly lower accuracy for all cities (61%) than for 25 cities (79%). Additionally, we show that our method can be applied to other regions where a water quality survey has not yet been conducted.","PeriodicalId":23753,"journal":{"name":"Water and Environment Journal","volume":"37 1","pages":"402 - 411"},"PeriodicalIF":1.7000,"publicationDate":"2023-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting the presence of E. coli in tap water using machine learning in Nepal\",\"authors\":\"So Kuroki, Ryuji Ogata, M. Sakamoto\",\"doi\":\"10.1111/wej.12844\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Within developing countries, a multitude of problems that affect the water supply process can result in the contamination of water taps. While machine learning applications have become popular for attaining efficient water quality predictions, acquiring the necessary data for modelling for developing countries is challenging. This study constructs water quality prediction models by machine learning with a pseudo‐pipeline network to complement the missing data of the water supply process. Using both water source and water tap quality information measured by the Government of Nepal, we apply the three machine learning models: support vector machine (SVM), random forest (RF) and LightGBM. Furthermore, we also apply a traditional statistical method—logistic regression (LR)—to the prediction of the Escherichia coli (E. coli) contamination in water taps. With some input variables (such as the length from the nearest sources) obtained from the pseudo‐pipeline network, the results show that SVM has stable and high accuracy for both the 26 cities (70%) and for the 25 cities except for Kathmandu (79%). LR performed a significantly lower accuracy for all cities (61%) than for 25 cities (79%). Additionally, we show that our method can be applied to other regions where a water quality survey has not yet been conducted.\",\"PeriodicalId\":23753,\"journal\":{\"name\":\"Water and Environment Journal\",\"volume\":\"37 1\",\"pages\":\"402 - 411\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2023-01-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Water and Environment Journal\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1111/wej.12844\",\"RegionNum\":4,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water and Environment Journal","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1111/wej.12844","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

在发展中国家,影响供水过程的许多问题可能导致水龙头受到污染。虽然机器学习应用程序在实现高效的水质预测方面变得很受欢迎,但为发展中国家获取建模所需的数据是一项挑战。本研究通过机器学习和伪管网构建水质预测模型,以补充供水过程中缺失的数据。利用尼泊尔政府测量的水源和水龙头质量信息,我们应用了三个机器学习模型:支持向量机(SVM)、随机森林(RF)和LightGBM。此外,我们还将传统的统计方法——逻辑回归(LR)——应用于水龙头中大肠杆菌污染的预测。利用从伪管网中获得的一些输入变量(如最近源的长度),结果表明,SVM对26个城市(70%)和除加德满都外的25个城市(79%)都具有稳定和高精度。LR在所有城市(61%)的准确率明显低于25个城市(79%)。此外,我们还表明,我们的方法可以应用于尚未进行水质调查的其他地区。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting the presence of E. coli in tap water using machine learning in Nepal
Within developing countries, a multitude of problems that affect the water supply process can result in the contamination of water taps. While machine learning applications have become popular for attaining efficient water quality predictions, acquiring the necessary data for modelling for developing countries is challenging. This study constructs water quality prediction models by machine learning with a pseudo‐pipeline network to complement the missing data of the water supply process. Using both water source and water tap quality information measured by the Government of Nepal, we apply the three machine learning models: support vector machine (SVM), random forest (RF) and LightGBM. Furthermore, we also apply a traditional statistical method—logistic regression (LR)—to the prediction of the Escherichia coli (E. coli) contamination in water taps. With some input variables (such as the length from the nearest sources) obtained from the pseudo‐pipeline network, the results show that SVM has stable and high accuracy for both the 26 cities (70%) and for the 25 cities except for Kathmandu (79%). LR performed a significantly lower accuracy for all cities (61%) than for 25 cities (79%). Additionally, we show that our method can be applied to other regions where a water quality survey has not yet been conducted.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Water and Environment Journal
Water and Environment Journal 环境科学-湖沼学
CiteScore
4.80
自引率
0.00%
发文量
67
审稿时长
18-36 weeks
期刊介绍: Water and Environment Journal is an internationally recognised peer reviewed Journal for the dissemination of innovations and solutions focussed on enhancing water management best practice. Water and Environment Journal is available to over 12,000 institutions with a further 7,000 copies physically distributed to the Chartered Institution of Water and Environmental Management (CIWEM) membership, comprised of environment sector professionals based across the value chain (utilities, consultancy, technology suppliers, regulators, government and NGOs). As such, the journal provides a conduit between academics and practitioners. We therefore particularly encourage contributions focussed at the interface between academia and industry, which deliver industrially impactful applied research underpinned by scientific evidence. We are keen to attract papers on a broad range of subjects including: -Water and wastewater treatment for agricultural, municipal and industrial applications -Sludge treatment including processing, storage and management -Water recycling -Urban and stormwater management -Integrated water management strategies -Water infrastructure and distribution -Climate change mitigation including management of impacts on agriculture, urban areas and infrastructure
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信