Machine learning-based identification of critical factors for cadmium accumulation in rice grains.

IF 3.2 3区 环境科学与生态学 Q3 ENGINEERING, ENVIRONMENTAL
Weichun Yang, Jiaxin Li, Kai Nie, Pengwei Zhao, Hui Xia, Qi Li, Qi Liao, Qingzhu Li, Chunhua Dong, Zhihui Yang, Mengying Si
{"title":"Machine learning-based identification of critical factors for cadmium accumulation in rice grains.","authors":"Weichun Yang, Jiaxin Li, Kai Nie, Pengwei Zhao, Hui Xia, Qi Li, Qi Liao, Qingzhu Li, Chunhua Dong, Zhihui Yang, Mengying Si","doi":"10.1007/s10653-024-02312-9","DOIUrl":null,"url":null,"abstract":"<p><p>The aggregation of Cadmium (Cd) in rice grains is a significant threat to human healthy. The complexity of the soil-rice system, with its numerous influencing parameters, highlights the need to identify the crucial factors responsible for Cd aggregation. This study uses machine learning (ML) modeling to predict Cd aggregation in rice grains and identify the influencing factors. Data from 474 data points from 77 published works were analyzed, and eight ML models were established using different algorithms. The input variables were total soil Cd concentration (TS Cd) and extractable Cd concentration (Ex-Cd), while rice Cd concentration (Cd<sub>rice</sub>) was the output variable. Among the models, the Extremely Randomized Trees (ERT) model performed the best (TS Cd: R<sup>2</sup> = 0.825; Ex-Cd: R<sup>2</sup> = 0.792), followed by Random Forest (TS Cd: R<sup>2</sup> = 0.721; Ex-Cd: R<sup>2</sup> = 0.719). The ERT feature importance ranking analysis revealed that the essential factors responsible for Cd aggregation are cation exchange capacity (CEC), TS Cd, Water Management Model (WMM), and pH for total soil Cd as input variables. For extractable Cd as an input variable, the vital factors are CEC, Ex-Cd, pH, and WMM. The study highlights the importance of the Water Management Model and its impact on Cd concentration in rice grains, which has been overlooked in previous research.Please check and confirm that the authors and their respective affiliations have been correctly identified and amend if necessary.The authors and their respective affiliations are correct.Author details: Kindly check and confirm whether the corresponding author is correctly identified.It is correct.</p>","PeriodicalId":11759,"journal":{"name":"Environmental Geochemistry and Health","volume":"47 1","pages":"2"},"PeriodicalIF":3.2000,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Geochemistry and Health","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s10653-024-02312-9","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

The aggregation of Cadmium (Cd) in rice grains is a significant threat to human healthy. The complexity of the soil-rice system, with its numerous influencing parameters, highlights the need to identify the crucial factors responsible for Cd aggregation. This study uses machine learning (ML) modeling to predict Cd aggregation in rice grains and identify the influencing factors. Data from 474 data points from 77 published works were analyzed, and eight ML models were established using different algorithms. The input variables were total soil Cd concentration (TS Cd) and extractable Cd concentration (Ex-Cd), while rice Cd concentration (Cdrice) was the output variable. Among the models, the Extremely Randomized Trees (ERT) model performed the best (TS Cd: R2 = 0.825; Ex-Cd: R2 = 0.792), followed by Random Forest (TS Cd: R2 = 0.721; Ex-Cd: R2 = 0.719). The ERT feature importance ranking analysis revealed that the essential factors responsible for Cd aggregation are cation exchange capacity (CEC), TS Cd, Water Management Model (WMM), and pH for total soil Cd as input variables. For extractable Cd as an input variable, the vital factors are CEC, Ex-Cd, pH, and WMM. The study highlights the importance of the Water Management Model and its impact on Cd concentration in rice grains, which has been overlooked in previous research.Please check and confirm that the authors and their respective affiliations have been correctly identified and amend if necessary.The authors and their respective affiliations are correct.Author details: Kindly check and confirm whether the corresponding author is correctly identified.It is correct.

基于机器学习的水稻谷粒镉积累关键因素识别。
镉(Cd)在米粒中的聚集对人类健康构成重大威胁。土壤-水稻系统的复杂性及其众多的影响参数,凸显了识别导致镉聚集的关键因素的必要性。本研究利用机器学习(ML)建模来预测米粒中的镉聚集并确定影响因素。研究分析了 77 篇已发表论文中的 474 个数据点,并使用不同算法建立了 8 个 ML 模型。输入变量为土壤总镉浓度(TS Cd)和可提取镉浓度(Ex-Cd),输出变量为水稻镉浓度(Cdrice)。在这些模型中,极随机树(ERT)模型表现最好(TS Cd:R2 = 0.825;Ex-Cd:R2 = 0.792),其次是随机森林(TS Cd:R2 = 0.721;Ex-Cd:R2 = 0.719)。ERT 特征重要性排序分析表明,对于土壤总镉输入变量,阳离子交换容量(CEC)、TS Cd、水管理模式(WMM)和 pH 是造成镉聚集的基本因素。对于作为输入变量的可提取镉,关键因素是 CEC、Ex-Cd、pH 值和 WMM。该研究强调了水管理模式的重要性及其对稻谷中镉浓度的影响,而这一点在以往的研究中被忽视了。请检查并确认作者及其所属单位是否正确,必要时进行修改:请检查并确认通讯作者的身份是否正确。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environmental Geochemistry and Health
Environmental Geochemistry and Health 环境科学-工程:环境
CiteScore
8.00
自引率
4.80%
发文量
279
审稿时长
4.2 months
期刊介绍: Environmental Geochemistry and Health publishes original research papers and review papers across the broad field of environmental geochemistry. Environmental geochemistry and health establishes and explains links between the natural or disturbed chemical composition of the earth’s surface and the health of plants, animals and people. Beneficial elements regulate or promote enzymatic and hormonal activity whereas other elements may be toxic. Bedrock geochemistry controls the composition of soil and hence that of water and vegetation. Environmental issues, such as pollution, arising from the extraction and use of mineral resources, are discussed. The effects of contaminants introduced into the earth’s geochemical systems are examined. Geochemical surveys of soil, water and plants show how major and trace elements are distributed geographically. Associated epidemiological studies reveal the possibility of causal links between the natural or disturbed geochemical environment and disease. Experimental research illuminates the nature or consequences of natural or disturbed geochemical processes. The journal particularly welcomes novel research linking environmental geochemistry and health issues on such topics as: heavy metals (including mercury), persistent organic pollutants (POPs), and mixed chemicals emitted through human activities, such as uncontrolled recycling of electronic-waste; waste recycling; surface-atmospheric interaction processes (natural and anthropogenic emissions, vertical transport, deposition, and physical-chemical interaction) of gases and aerosols; phytoremediation/restoration of contaminated sites; food contamination and safety; environmental effects of medicines; effects and toxicity of mixed pollutants; speciation of heavy metals/metalloids; effects of mining; disturbed geochemistry from human behavior, natural or man-made hazards; particle and nanoparticle toxicology; risk and the vulnerability of populations, etc.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信