使用自动机器学习模型预测作物谷物中的重金属浓度。

Q3 Environmental Science
Ye-Xiang Zhang, Feng-Xian Chen, Yu-Hong Zhang, Xi-Juan Chen
{"title":"使用自动机器学习模型预测作物谷物中的重金属浓度。","authors":"Ye-Xiang Zhang, Feng-Xian Chen, Yu-Hong Zhang, Xi-Juan Chen","doi":"10.13287/j.1001-9332.202506.018","DOIUrl":null,"url":null,"abstract":"<p><p>With the acceleration of industrialization and the intensification of agricultural activities, heavy metals (HMs) pollution in crops has become an issue that can not be ignored in current agricultural production. Based on 791 data sets from 54 publications, we predicted HMs concentrations in crop grains by using automated machine learning (AutoML) models. Ten factors were used as input variables: organic fertilizer application, HMs concentration in organic fertilizer, soil HMs concentration, soil organic matter, pH, cation exchange capacity, clay content, silt content, sand content and plant types. The concentrations of chromium (Cr), cadmium (Cd), lead (Pb), arsenic (As) and mercury (Hg) in crop grains were set as output variables. We evaluated the simulation and prediction performance of six models: deep learning (DL), distributed random forest (DRF), extremely randomized trees (XRT), stacked ensemble (SE), gradient boosting machine (GBM) and generalized linear model (GLM), with which we analyzed the key factors driving heavy metal accumulation in crop grains. The results showed that the optimal prediction model differed for different HMs. The DL model provided the best prediction for Cr, Pb, As and Hg, while the GBM model achieved the highest prediction accuracy for Cd. Feature importance and SHAP analysis revealed that the application of organic fertilizer and plant type were the key factors influencing HMs accumulation in crop grains. Organic fertilizer application, soil HMs concentration, organic fertilizer HMs concentration, and sand content were positively correlated with HMs concentration in crop grains, while cation exchange capacity, pH, organic matter, and clay content were negatively correlated with heavy metal concentration in crop grains. In summary, the DL and GBM models performed better in predicting heavy metal concentrations in crop grains. The input risk of heavy metals during organic fertilizer application must be strictly controlled.</p>","PeriodicalId":35942,"journal":{"name":"应用生态学报","volume":"36 6","pages":"1889-1897"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting heavy metal concentration in crop grain using automated machine learning models.\",\"authors\":\"Ye-Xiang Zhang, Feng-Xian Chen, Yu-Hong Zhang, Xi-Juan Chen\",\"doi\":\"10.13287/j.1001-9332.202506.018\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>With the acceleration of industrialization and the intensification of agricultural activities, heavy metals (HMs) pollution in crops has become an issue that can not be ignored in current agricultural production. Based on 791 data sets from 54 publications, we predicted HMs concentrations in crop grains by using automated machine learning (AutoML) models. Ten factors were used as input variables: organic fertilizer application, HMs concentration in organic fertilizer, soil HMs concentration, soil organic matter, pH, cation exchange capacity, clay content, silt content, sand content and plant types. The concentrations of chromium (Cr), cadmium (Cd), lead (Pb), arsenic (As) and mercury (Hg) in crop grains were set as output variables. We evaluated the simulation and prediction performance of six models: deep learning (DL), distributed random forest (DRF), extremely randomized trees (XRT), stacked ensemble (SE), gradient boosting machine (GBM) and generalized linear model (GLM), with which we analyzed the key factors driving heavy metal accumulation in crop grains. The results showed that the optimal prediction model differed for different HMs. The DL model provided the best prediction for Cr, Pb, As and Hg, while the GBM model achieved the highest prediction accuracy for Cd. Feature importance and SHAP analysis revealed that the application of organic fertilizer and plant type were the key factors influencing HMs accumulation in crop grains. Organic fertilizer application, soil HMs concentration, organic fertilizer HMs concentration, and sand content were positively correlated with HMs concentration in crop grains, while cation exchange capacity, pH, organic matter, and clay content were negatively correlated with heavy metal concentration in crop grains. In summary, the DL and GBM models performed better in predicting heavy metal concentrations in crop grains. The input risk of heavy metals during organic fertilizer application must be strictly controlled.</p>\",\"PeriodicalId\":35942,\"journal\":{\"name\":\"应用生态学报\",\"volume\":\"36 6\",\"pages\":\"1889-1897\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"应用生态学报\",\"FirstCategoryId\":\"1087\",\"ListUrlMain\":\"https://doi.org/10.13287/j.1001-9332.202506.018\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Environmental Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"应用生态学报","FirstCategoryId":"1087","ListUrlMain":"https://doi.org/10.13287/j.1001-9332.202506.018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Environmental Science","Score":null,"Total":0}
引用次数: 0

摘要

随着工业化进程的加快和农业生产活动的集约化,农作物重金属污染已成为当前农业生产中不可忽视的问题。基于来自54份出版物的791个数据集,我们使用自动机器学习(AutoML)模型预测了作物谷物中HMs的浓度。以有机肥施用量、有机肥中hm浓度、土壤hm浓度、土壤有机质、pH、阳离子交换容量、粘粒含量、粉粒含量、砂粒含量和植物类型10个因素作为输入变量。以作物籽粒中铬(Cr)、镉(Cd)、铅(Pb)、砷(As)和汞(Hg)的浓度为输出变量。利用深度学习(DL)、分布随机森林(DRF)、极度随机树(XRT)、堆叠集成(SE)、梯度增强机(GBM)和广义线性模型(GLM) 6种模型的模拟和预测性能,分析了作物籽粒重金属积累的关键驱动因素。结果表明,不同药材的最佳预测模型存在差异。DL模型对Cr、Pb、As和Hg的预测效果最好,而GBM模型对Cd的预测精度最高。特征重要性和SHAP分析表明,有机肥施用和植株类型是影响作物籽粒有机质积累的关键因素。有机肥施用量、土壤hm浓度、有机肥hm浓度、砂粒含量与作物籽粒hm浓度呈显著正相关,阳离子交换容量、pH、有机质、粘土含量与作物籽粒重金属浓度呈显著负相关。综上所述,DL和GBM模型对作物籽粒重金属浓度的预测效果较好。必须严格控制有机肥施用过程中重金属的投入风险。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Predicting heavy metal concentration in crop grain using automated machine learning models.

With the acceleration of industrialization and the intensification of agricultural activities, heavy metals (HMs) pollution in crops has become an issue that can not be ignored in current agricultural production. Based on 791 data sets from 54 publications, we predicted HMs concentrations in crop grains by using automated machine learning (AutoML) models. Ten factors were used as input variables: organic fertilizer application, HMs concentration in organic fertilizer, soil HMs concentration, soil organic matter, pH, cation exchange capacity, clay content, silt content, sand content and plant types. The concentrations of chromium (Cr), cadmium (Cd), lead (Pb), arsenic (As) and mercury (Hg) in crop grains were set as output variables. We evaluated the simulation and prediction performance of six models: deep learning (DL), distributed random forest (DRF), extremely randomized trees (XRT), stacked ensemble (SE), gradient boosting machine (GBM) and generalized linear model (GLM), with which we analyzed the key factors driving heavy metal accumulation in crop grains. The results showed that the optimal prediction model differed for different HMs. The DL model provided the best prediction for Cr, Pb, As and Hg, while the GBM model achieved the highest prediction accuracy for Cd. Feature importance and SHAP analysis revealed that the application of organic fertilizer and plant type were the key factors influencing HMs accumulation in crop grains. Organic fertilizer application, soil HMs concentration, organic fertilizer HMs concentration, and sand content were positively correlated with HMs concentration in crop grains, while cation exchange capacity, pH, organic matter, and clay content were negatively correlated with heavy metal concentration in crop grains. In summary, the DL and GBM models performed better in predicting heavy metal concentrations in crop grains. The input risk of heavy metals during organic fertilizer application must be strictly controlled.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
应用生态学报
应用生态学报 Environmental Science-Ecology
CiteScore
2.50
自引率
0.00%
发文量
11393
期刊介绍:
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信