Identifying the key factors of mercury exposure in residents of southwestern Iran using machine learning algorithms.

IF 3.2 3区 环境科学与生态学 Q3 ENGINEERING, ENVIRONMENTAL
Narjes Okati, Zohre Ebrahimi-Khusfi, Samira Zandifar, Ruhollah Taghizadeh-Mehrjardi
{"title":"Identifying the key factors of mercury exposure in residents of southwestern Iran using machine learning algorithms.","authors":"Narjes Okati, Zohre Ebrahimi-Khusfi, Samira Zandifar, Ruhollah Taghizadeh-Mehrjardi","doi":"10.1007/s10653-025-02533-6","DOIUrl":null,"url":null,"abstract":"<p><p>It is necessary to predict hair mercury (Hg) levels and specify the related effective factors to develop preventive strategies to reduce Hg exposure in different regions. This study is the first effort to investigate the effectiveness of eight machine learning (ML) models (including multiple linear regression, decision tree regression, least absolute shrinkage and selection operator, multivariate adaptive regression splines, random forest, extreme gradient boosting, K-nearest neighbor, and Gaussian process) for predicting hair Hg levels and identifying the most important factors affecting them in residents of southwestern Iran. All ML models were trained with 70% of the dataset and their performance was evaluated using the determination coefficient (R<sup>2</sup>), root mean square error (RMSE), and mean absolute error (MAE) based on the remaining dataset. Finally, the Permutation Feature Importance (PFI) method was used to determine the relative importance (RI) of influencing factors. Mean hair Hg (3.31 µg g⁻<sup>1</sup>) was higher than the United States Environmental Protection Agency (US EPA) and World Health Organization (WHO) limits. It was indicated a high exposure risk for some people in this region. The extreme gradient boosting (XGB) model outperformed other algorithms in modeling hair Hg levels, with R<sup>2</sup> = 0.61, RMSE = 2.2, and MAE = 1.25. According to the PFI analysis, weight (RI: 43.4%) and geographic place (RI: 41.8%) were found as the most important demographic factors influencing Hg variation in the study population. Additionally, occupation (RI: 46.1%) and the frequency of fish and canned fish consumption (RI: 22%) were identified as the most significant exposure factors controlling hair Hg variability in southwestern Iran. These findings can be useful for formulating appropriate strategies to reduce the health risk of Hg exposure and improve human health.</p>","PeriodicalId":11759,"journal":{"name":"Environmental Geochemistry and Health","volume":"47 7","pages":"239"},"PeriodicalIF":3.2000,"publicationDate":"2025-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Geochemistry and Health","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s10653-025-02533-6","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

It is necessary to predict hair mercury (Hg) levels and specify the related effective factors to develop preventive strategies to reduce Hg exposure in different regions. This study is the first effort to investigate the effectiveness of eight machine learning (ML) models (including multiple linear regression, decision tree regression, least absolute shrinkage and selection operator, multivariate adaptive regression splines, random forest, extreme gradient boosting, K-nearest neighbor, and Gaussian process) for predicting hair Hg levels and identifying the most important factors affecting them in residents of southwestern Iran. All ML models were trained with 70% of the dataset and their performance was evaluated using the determination coefficient (R2), root mean square error (RMSE), and mean absolute error (MAE) based on the remaining dataset. Finally, the Permutation Feature Importance (PFI) method was used to determine the relative importance (RI) of influencing factors. Mean hair Hg (3.31 µg g⁻1) was higher than the United States Environmental Protection Agency (US EPA) and World Health Organization (WHO) limits. It was indicated a high exposure risk for some people in this region. The extreme gradient boosting (XGB) model outperformed other algorithms in modeling hair Hg levels, with R2 = 0.61, RMSE = 2.2, and MAE = 1.25. According to the PFI analysis, weight (RI: 43.4%) and geographic place (RI: 41.8%) were found as the most important demographic factors influencing Hg variation in the study population. Additionally, occupation (RI: 46.1%) and the frequency of fish and canned fish consumption (RI: 22%) were identified as the most significant exposure factors controlling hair Hg variability in southwestern Iran. These findings can be useful for formulating appropriate strategies to reduce the health risk of Hg exposure and improve human health.

利用机器学习算法确定伊朗西南部居民汞暴露的关键因素。
有必要预测头发汞(Hg)水平,明确相关的有效因素,以制定预防策略,减少不同地区的汞暴露。本研究首次研究了八种机器学习(ML)模型(包括多元线性回归、决策树回归、最小绝对收缩和选择算子、多变量自适应回归样条、随机森林、极端梯度增强、k近邻和高斯过程)在预测伊朗西南部居民头发汞水平和识别影响它们的最重要因素方面的有效性。所有ML模型都使用70%的数据集进行训练,并使用基于剩余数据集的决定系数(R2)、均方根误差(RMSE)和平均绝对误差(MAE)对其性能进行评估。最后,采用排列特征重要性(PFI)方法确定影响因素的相对重要性(RI)。平均头发汞(3.31µg - 1)高于美国环境保护署(US EPA)和世界卫生组织(WHO)的标准。这表明该地区一些人的接触风险很高。极端梯度增强(XGB)模型在模拟头发汞水平方面优于其他算法,R2 = 0.61, RMSE = 2.2, MAE = 1.25。根据PFI分析,体重(RI: 43.4%)和地理位置(RI: 41.8%)是影响研究人群中汞变化的最重要人口统计学因素。此外,职业(RI: 46.1%)和食用鱼和罐装鱼的频率(RI: 22%)被确定为控制伊朗西南部头发汞变异的最重要暴露因素。这些发现可用于制定适当的策略,以减少汞暴露的健康风险和改善人类健康。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environmental Geochemistry and Health
Environmental Geochemistry and Health 环境科学-工程:环境
CiteScore
8.00
自引率
4.80%
发文量
279
审稿时长
4.2 months
期刊介绍: Environmental Geochemistry and Health publishes original research papers and review papers across the broad field of environmental geochemistry. Environmental geochemistry and health establishes and explains links between the natural or disturbed chemical composition of the earth’s surface and the health of plants, animals and people. Beneficial elements regulate or promote enzymatic and hormonal activity whereas other elements may be toxic. Bedrock geochemistry controls the composition of soil and hence that of water and vegetation. Environmental issues, such as pollution, arising from the extraction and use of mineral resources, are discussed. The effects of contaminants introduced into the earth’s geochemical systems are examined. Geochemical surveys of soil, water and plants show how major and trace elements are distributed geographically. Associated epidemiological studies reveal the possibility of causal links between the natural or disturbed geochemical environment and disease. Experimental research illuminates the nature or consequences of natural or disturbed geochemical processes. The journal particularly welcomes novel research linking environmental geochemistry and health issues on such topics as: heavy metals (including mercury), persistent organic pollutants (POPs), and mixed chemicals emitted through human activities, such as uncontrolled recycling of electronic-waste; waste recycling; surface-atmospheric interaction processes (natural and anthropogenic emissions, vertical transport, deposition, and physical-chemical interaction) of gases and aerosols; phytoremediation/restoration of contaminated sites; food contamination and safety; environmental effects of medicines; effects and toxicity of mixed pollutants; speciation of heavy metals/metalloids; effects of mining; disturbed geochemistry from human behavior, natural or man-made hazards; particle and nanoparticle toxicology; risk and the vulnerability of populations, etc.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信