Multiple Data Imputation Methods Advance Risk Analysis and Treatability of Co-occurring Inorganic Chemicals in Groundwater

IF 11.3 1区 环境科学与生态学 Q1 ENGINEERING, ENVIRONMENTAL
Akhlak U. Mahmood, Minhazul Islam, Alexey V. Gulyuk, Emily Briese, Carmen A. Velasco, Mohit Malu, Naushita Sharma, Andreas Spanias, Yaroslava G. Yingling, Paul Westerhoff
{"title":"Multiple Data Imputation Methods Advance Risk Analysis and Treatability of Co-occurring Inorganic Chemicals in Groundwater","authors":"Akhlak U. Mahmood, Minhazul Islam, Alexey V. Gulyuk, Emily Briese, Carmen A. Velasco, Mohit Malu, Naushita Sharma, Andreas Spanias, Yaroslava G. Yingling, Paul Westerhoff","doi":"10.1021/acs.est.4c05203","DOIUrl":null,"url":null,"abstract":"Accurately assessing and managing risks associated with inorganic pollutants in groundwater is imperative. Historic water quality databases are often sparse due to rationale or financial budgets for sample collection and analysis, posing challenges in evaluating exposure or water treatment effectiveness. We utilized and compared two advanced multiple data imputation techniques, AMELIA and MICE algorithms, to fill gaps in sparse groundwater quality data sets. AMELIA outperformed MICE in handling missing values, as MICE tended to overestimate certain values, resulting in more outliers. Field data sets revealed that 75% to 80% of samples exhibited no co-occurring regulated pollutants surpassing MCL values, whereas imputed values showed only 15% to 55% of the samples posed no health risks. Imputed data unveiled a significant increase, ranging from 2 to 5 times, in the number of sampling locations predicted to potentially exceed health-based limits and identified samples where 2 to 6 co-occurring chemicals may occur and surpass health-based levels. Linking imputed data to sampling locations can pinpoint potential hotspots of elevated chemical levels and guide optimal resource allocation for additional field sampling and chemical analysis. With this approach, further analysis of complete data sets allows state agencies authorized to conduct groundwater monitoring, often with limited financial resources, to prioritize sampling locations and chemicals to be tested. Given existing data and time constraints, it is crucial to identify the most strategic use of the available resources to address data gaps effectively. This work establishes a framework to enhance the beneficial impact of funding groundwater data collection by reducing uncertainty in prioritizing future sampling locations and chemical analyses.","PeriodicalId":36,"journal":{"name":"环境科学与技术","volume":"95 1","pages":""},"PeriodicalIF":11.3000,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"环境科学与技术","FirstCategoryId":"1","ListUrlMain":"https://doi.org/10.1021/acs.est.4c05203","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Accurately assessing and managing risks associated with inorganic pollutants in groundwater is imperative. Historic water quality databases are often sparse due to rationale or financial budgets for sample collection and analysis, posing challenges in evaluating exposure or water treatment effectiveness. We utilized and compared two advanced multiple data imputation techniques, AMELIA and MICE algorithms, to fill gaps in sparse groundwater quality data sets. AMELIA outperformed MICE in handling missing values, as MICE tended to overestimate certain values, resulting in more outliers. Field data sets revealed that 75% to 80% of samples exhibited no co-occurring regulated pollutants surpassing MCL values, whereas imputed values showed only 15% to 55% of the samples posed no health risks. Imputed data unveiled a significant increase, ranging from 2 to 5 times, in the number of sampling locations predicted to potentially exceed health-based limits and identified samples where 2 to 6 co-occurring chemicals may occur and surpass health-based levels. Linking imputed data to sampling locations can pinpoint potential hotspots of elevated chemical levels and guide optimal resource allocation for additional field sampling and chemical analysis. With this approach, further analysis of complete data sets allows state agencies authorized to conduct groundwater monitoring, often with limited financial resources, to prioritize sampling locations and chemicals to be tested. Given existing data and time constraints, it is crucial to identify the most strategic use of the available resources to address data gaps effectively. This work establishes a framework to enhance the beneficial impact of funding groundwater data collection by reducing uncertainty in prioritizing future sampling locations and chemical analyses.

Abstract Image

多重数据推算方法推进地下水中共生无机化学品的风险分析和可处理性
准确评估和管理与地下水中无机污染物相关的风险势在必行。由于样本收集和分析的合理性或资金预算问题,历史水质数据库通常很稀少,这给评估暴露或水处理效果带来了挑战。我们利用并比较了两种先进的多重数据估算技术--AMELIA 算法和 MICE 算法,以填补稀疏地下水水质数据集的空白。AMELIA 在处理缺失值方面的表现优于 MICE,因为 MICE 往往会高估某些值,从而导致更多的异常值。实地数据集显示,75% 至 80% 的样本中没有同时出现超过 MCL 值的受管制污染物,而估算值显示只有 15% 至 55% 的样本没有健康风险。估算数据显示,预测可能超过健康限值的采样点数量大幅增加了 2 到 5 倍,并确定了可能出现 2 到 6 种共存化学品并超过健康限值的样本。将估算数据与采样地点联系起来,可以准确定位化学物质水平升高的潜在热点,并指导额外现场采样和化学分析的最佳资源分配。通过这种方法,对完整数据集的进一步分析可以让授权进行地下水监测的国家机构(通常财力有限)优先考虑采样地点和需要检测的化学品。鉴于现有的数据和时间限制,确定现有资源的最具战略性的使用方法以有效解决数据缺口至关重要。这项工作建立了一个框架,通过减少在确定未来采样地点和化学分析优先次序方面的不确定性,来提高资助地下水数据收集工作的有益影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
环境科学与技术
环境科学与技术 环境科学-工程:环境
CiteScore
17.50
自引率
9.60%
发文量
12359
审稿时长
2.8 months
期刊介绍: Environmental Science & Technology (ES&T) is a co-sponsored academic and technical magazine by the Hubei Provincial Environmental Protection Bureau and the Hubei Provincial Academy of Environmental Sciences. Environmental Science & Technology (ES&T) holds the status of Chinese core journals, scientific papers source journals of China, Chinese Science Citation Database source journals, and Chinese Academic Journal Comprehensive Evaluation Database source journals. This publication focuses on the academic field of environmental protection, featuring articles related to environmental protection and technical advancements.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信