使用机器学习预测沿海含水层中砷的形态:泰国春武里和罗勇地下水盆地的案例研究

IF 4.3 Q1 ENVIRONMENTAL SCIENCES
Pongsathorn Thunyawatcharakul, Kyung Hwa Cho and Srilert Chotpantarat*, 
{"title":"使用机器学习预测沿海含水层中砷的形态:泰国春武里和罗勇地下水盆地的案例研究","authors":"Pongsathorn Thunyawatcharakul,&nbsp;Kyung Hwa Cho and Srilert Chotpantarat*,&nbsp;","doi":"10.1021/acsestwater.4c01082","DOIUrl":null,"url":null,"abstract":"<p >This study developed machine learning models to predict arsenic speciation, focusing on As(III), in contaminated groundwater systems. Two input sets were considered: a full set containing comprehensive hydrochemical variables for high-accuracy prediction and a reduced on-site set including only field-measurable parameters and total arsenic, designed for rapid and cost-effective screening. Due to limited As(III) data, models were trained to estimate As(V) instead. Three algorithms: random forest (RF), support vector regression (SVR), and artificial neural network (ANN), were evaluated using 5-fold cross-validation. RF achieved the highest accuracy under the full set, while SVR showed the most robust performance across both input sets. ANN underperformed due to overfitting caused by a scarcity of high-concentration samples. Margin-based learning of SVR allowed the model to maintain stability despite fewer inputs, and outliers were included, suggesting its suitability for fast screening monitoring. The proposed SVR model can reduce arsenic speciation analysis costs by minimizing laboratory requirements while maintaining reliable accuracy, with only total As concentration required. These findings support the integration of SVR-based models into groundwater monitoring frameworks and public health policies, particularly in arsenic-affected regions with limited resources, contributing to more accessible and efficient arsenic risk assessment.</p><p >Machine learning models predict arsenic speciation in groundwater using both annual hydrochemical datasets and limited on-site field measurements.</p>","PeriodicalId":93847,"journal":{"name":"ACS ES&T water","volume":"5 9","pages":"5011–5024"},"PeriodicalIF":4.3000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/pdf/10.1021/acsestwater.4c01082","citationCount":"0","resultStr":"{\"title\":\"Predicting Arsenic Speciation in Coastal Aquifers Using Machine Learning: A Case Study of the Chonburi and Rayong Groundwater Basins, Thailand\",\"authors\":\"Pongsathorn Thunyawatcharakul,&nbsp;Kyung Hwa Cho and Srilert Chotpantarat*,&nbsp;\",\"doi\":\"10.1021/acsestwater.4c01082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >This study developed machine learning models to predict arsenic speciation, focusing on As(III), in contaminated groundwater systems. Two input sets were considered: a full set containing comprehensive hydrochemical variables for high-accuracy prediction and a reduced on-site set including only field-measurable parameters and total arsenic, designed for rapid and cost-effective screening. Due to limited As(III) data, models were trained to estimate As(V) instead. Three algorithms: random forest (RF), support vector regression (SVR), and artificial neural network (ANN), were evaluated using 5-fold cross-validation. RF achieved the highest accuracy under the full set, while SVR showed the most robust performance across both input sets. ANN underperformed due to overfitting caused by a scarcity of high-concentration samples. Margin-based learning of SVR allowed the model to maintain stability despite fewer inputs, and outliers were included, suggesting its suitability for fast screening monitoring. The proposed SVR model can reduce arsenic speciation analysis costs by minimizing laboratory requirements while maintaining reliable accuracy, with only total As concentration required. These findings support the integration of SVR-based models into groundwater monitoring frameworks and public health policies, particularly in arsenic-affected regions with limited resources, contributing to more accessible and efficient arsenic risk assessment.</p><p >Machine learning models predict arsenic speciation in groundwater using both annual hydrochemical datasets and limited on-site field measurements.</p>\",\"PeriodicalId\":93847,\"journal\":{\"name\":\"ACS ES&T water\",\"volume\":\"5 9\",\"pages\":\"5011–5024\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2025-07-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://pubs.acs.org/doi/pdf/10.1021/acsestwater.4c01082\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS ES&T water\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acsestwater.4c01082\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS ES&T water","FirstCategoryId":"1085","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acsestwater.4c01082","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

本研究开发了机器学习模型来预测受污染地下水系统中砷的形态,重点是As(III)。考虑了两个输入集:一个包含全面的水化学变量的完整集,用于高精度预测;一个减少的现场集,仅包括现场可测量的参数和总砷,用于快速和具有成本效益的筛选。由于有限的As(III)数据,模型被训练来估计As(V)。三种算法:随机森林(RF)、支持向量回归(SVR)和人工神经网络(ANN),采用5倍交叉验证进行评估。RF在完整的输入集下达到最高的精度,而SVR在两个输入集上都表现出最稳健的性能。由于缺乏高浓度样本导致的过拟合,人工神经网络表现不佳。基于边际的SVR学习允许模型在输入较少的情况下保持稳定,并且包含异常值,表明其适合快速筛选监测。所提出的SVR模型可以通过最大限度地减少实验室要求,同时保持可靠的准确性,从而降低砷形态分析成本,只需要总砷浓度。这些发现支持将基于svr的模型纳入地下水监测框架和公共卫生政策,特别是在资源有限的砷影响地区,有助于更容易获得和更有效的砷风险评估。机器学习模型使用年度水化学数据集和有限的现场测量来预测地下水中的砷形态。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting Arsenic Speciation in Coastal Aquifers Using Machine Learning: A Case Study of the Chonburi and Rayong Groundwater Basins, Thailand

This study developed machine learning models to predict arsenic speciation, focusing on As(III), in contaminated groundwater systems. Two input sets were considered: a full set containing comprehensive hydrochemical variables for high-accuracy prediction and a reduced on-site set including only field-measurable parameters and total arsenic, designed for rapid and cost-effective screening. Due to limited As(III) data, models were trained to estimate As(V) instead. Three algorithms: random forest (RF), support vector regression (SVR), and artificial neural network (ANN), were evaluated using 5-fold cross-validation. RF achieved the highest accuracy under the full set, while SVR showed the most robust performance across both input sets. ANN underperformed due to overfitting caused by a scarcity of high-concentration samples. Margin-based learning of SVR allowed the model to maintain stability despite fewer inputs, and outliers were included, suggesting its suitability for fast screening monitoring. The proposed SVR model can reduce arsenic speciation analysis costs by minimizing laboratory requirements while maintaining reliable accuracy, with only total As concentration required. These findings support the integration of SVR-based models into groundwater monitoring frameworks and public health policies, particularly in arsenic-affected regions with limited resources, contributing to more accessible and efficient arsenic risk assessment.

Machine learning models predict arsenic speciation in groundwater using both annual hydrochemical datasets and limited on-site field measurements.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
5.40
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信