Common issues of data science on the eco-environmental risks of emerging contaminants

IF 10.3 1区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Xiangang Hu, Xu Dong, Zhangjia Wang
{"title":"Common issues of data science on the eco-environmental risks of emerging contaminants","authors":"Xiangang Hu,&nbsp;Xu Dong,&nbsp;Zhangjia Wang","doi":"10.1016/j.envint.2025.109301","DOIUrl":null,"url":null,"abstract":"<div><div>Data-driven approaches (e.g., machine learning) are increasingly used to replace or assist laboratory studies in the study of emerging contaminants (ECs). In the past ten years, an increasing number of models or approaches have been applied to ECs, and the datasets used are continuously enriched. However, there are large knowledge gaps between what we have found and the natural eco-environmental meaning. For most published reviews, the contents are organized by the types of ECs, but the common issues of data science, regardless of the type of pollutant, are not sufficiently addressed. To close or narrow the knowledge gaps, we highlight the following issues ignored in the field of data-driven EC research. Complicated biological and ecological data and ensemble models revealing mechanisms and spatiotemporal trends with strong causal relationships and without data leakage deserve more attention in the future. In addition, the matrix influence, trace concentration, and complex scenario have often been ignored in previous works. Therefore, an integrated research framework related to natural fields, ecological systems, and large-scale environmental problems, rather than relying solely on laboratory data-related analysis, is urgently needed. Beyond the current prediction purposes, data science can inspire the discovery of scientific questions, and mutual inspiration among data science, process and mechanism models, and laboratory and field research is a critical direction. Focusing on the above urgent and common issues related to data, frameworks, and purposes, regardless of the type of pollutant, data science is expected to achieve great advancements in addressing the eco-environmental risks of ECs.</div></div>","PeriodicalId":308,"journal":{"name":"Environment International","volume":"196 ","pages":"Article 109301"},"PeriodicalIF":10.3000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environment International","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0160412025000522","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Data-driven approaches (e.g., machine learning) are increasingly used to replace or assist laboratory studies in the study of emerging contaminants (ECs). In the past ten years, an increasing number of models or approaches have been applied to ECs, and the datasets used are continuously enriched. However, there are large knowledge gaps between what we have found and the natural eco-environmental meaning. For most published reviews, the contents are organized by the types of ECs, but the common issues of data science, regardless of the type of pollutant, are not sufficiently addressed. To close or narrow the knowledge gaps, we highlight the following issues ignored in the field of data-driven EC research. Complicated biological and ecological data and ensemble models revealing mechanisms and spatiotemporal trends with strong causal relationships and without data leakage deserve more attention in the future. In addition, the matrix influence, trace concentration, and complex scenario have often been ignored in previous works. Therefore, an integrated research framework related to natural fields, ecological systems, and large-scale environmental problems, rather than relying solely on laboratory data-related analysis, is urgently needed. Beyond the current prediction purposes, data science can inspire the discovery of scientific questions, and mutual inspiration among data science, process and mechanism models, and laboratory and field research is a critical direction. Focusing on the above urgent and common issues related to data, frameworks, and purposes, regardless of the type of pollutant, data science is expected to achieve great advancements in addressing the eco-environmental risks of ECs.
在研究新兴污染物(ECs)时,越来越多地使用数据驱动方法(如机器学习)来替代或辅助实验室研究。在过去十年中,越来越多的模型或方法被应用于 ECs 研究,所使用的数据集也在不断丰富。然而,我们的发现与自然生态环境的含义之间还存在很大的知识差距。大多数已发表的综述都是按照生态污染物的类型来组织内容的,但对于数据科学的共性问题,无论污染物的类型如何,都没有充分讨论。为了弥补或缩小知识差距,我们强调以下在数据驱动的EC研究领域被忽视的问题。复杂的生物和生态数据以及揭示机制和时空趋势的集合模型,具有很强的因果关系,且没有数据泄露,今后应得到更多关注。此外,矩阵影响、痕量浓度和复杂场景在以往的研究中往往被忽视。因此,迫切需要一个与自然领域、生态系统和大规模环境问题相关的综合研究框架,而不是仅仅依赖实验室数据相关分析。在当前的预测目的之外,数据科学可以启发科学问题的发现,而数据科学、过程和机制模型、实验室和野外研究之间的相互启发是一个重要方向。无论污染物的类型如何,围绕上述与数据、框架和目的相关的紧迫和共性问题,数据科学有望在应对 ECs 生态环境风险方面取得重大进展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environment International
Environment International 环境科学-环境科学
CiteScore
21.90
自引率
3.40%
发文量
734
审稿时长
2.8 months
期刊介绍: Environmental Health publishes manuscripts focusing on critical aspects of environmental and occupational medicine, including studies in toxicology and epidemiology, to illuminate the human health implications of exposure to environmental hazards. The journal adopts an open-access model and practices open peer review. It caters to scientists and practitioners across all environmental science domains, directly or indirectly impacting human health and well-being. With a commitment to enhancing the prevention of environmentally-related health risks, Environmental Health serves as a public health journal for the community and scientists engaged in matters of public health significance concerning the environment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信