{"title":"使用自动机器学习模型预测作物谷物中的重金属浓度。","authors":"Ye-Xiang Zhang, Feng-Xian Chen, Yu-Hong Zhang, Xi-Juan Chen","doi":"10.13287/j.1001-9332.202506.018","DOIUrl":null,"url":null,"abstract":"<p><p>With the acceleration of industrialization and the intensification of agricultural activities, heavy metals (HMs) pollution in crops has become an issue that can not be ignored in current agricultural production. Based on 791 data sets from 54 publications, we predicted HMs concentrations in crop grains by using automated machine learning (AutoML) models. Ten factors were used as input variables: organic fertilizer application, HMs concentration in organic fertilizer, soil HMs concentration, soil organic matter, pH, cation exchange capacity, clay content, silt content, sand content and plant types. The concentrations of chromium (Cr), cadmium (Cd), lead (Pb), arsenic (As) and mercury (Hg) in crop grains were set as output variables. We evaluated the simulation and prediction performance of six models: deep learning (DL), distributed random forest (DRF), extremely randomized trees (XRT), stacked ensemble (SE), gradient boosting machine (GBM) and generalized linear model (GLM), with which we analyzed the key factors driving heavy metal accumulation in crop grains. The results showed that the optimal prediction model differed for different HMs. The DL model provided the best prediction for Cr, Pb, As and Hg, while the GBM model achieved the highest prediction accuracy for Cd. Feature importance and SHAP analysis revealed that the application of organic fertilizer and plant type were the key factors influencing HMs accumulation in crop grains. Organic fertilizer application, soil HMs concentration, organic fertilizer HMs concentration, and sand content were positively correlated with HMs concentration in crop grains, while cation exchange capacity, pH, organic matter, and clay content were negatively correlated with heavy metal concentration in crop grains. In summary, the DL and GBM models performed better in predicting heavy metal concentrations in crop grains. The input risk of heavy metals during organic fertilizer application must be strictly controlled.</p>","PeriodicalId":35942,"journal":{"name":"应用生态学报","volume":"36 6","pages":"1889-1897"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting heavy metal concentration in crop grain using automated machine learning models.\",\"authors\":\"Ye-Xiang Zhang, Feng-Xian Chen, Yu-Hong Zhang, Xi-Juan Chen\",\"doi\":\"10.13287/j.1001-9332.202506.018\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>With the acceleration of industrialization and the intensification of agricultural activities, heavy metals (HMs) pollution in crops has become an issue that can not be ignored in current agricultural production. Based on 791 data sets from 54 publications, we predicted HMs concentrations in crop grains by using automated machine learning (AutoML) models. Ten factors were used as input variables: organic fertilizer application, HMs concentration in organic fertilizer, soil HMs concentration, soil organic matter, pH, cation exchange capacity, clay content, silt content, sand content and plant types. The concentrations of chromium (Cr), cadmium (Cd), lead (Pb), arsenic (As) and mercury (Hg) in crop grains were set as output variables. We evaluated the simulation and prediction performance of six models: deep learning (DL), distributed random forest (DRF), extremely randomized trees (XRT), stacked ensemble (SE), gradient boosting machine (GBM) and generalized linear model (GLM), with which we analyzed the key factors driving heavy metal accumulation in crop grains. The results showed that the optimal prediction model differed for different HMs. The DL model provided the best prediction for Cr, Pb, As and Hg, while the GBM model achieved the highest prediction accuracy for Cd. Feature importance and SHAP analysis revealed that the application of organic fertilizer and plant type were the key factors influencing HMs accumulation in crop grains. Organic fertilizer application, soil HMs concentration, organic fertilizer HMs concentration, and sand content were positively correlated with HMs concentration in crop grains, while cation exchange capacity, pH, organic matter, and clay content were negatively correlated with heavy metal concentration in crop grains. In summary, the DL and GBM models performed better in predicting heavy metal concentrations in crop grains. The input risk of heavy metals during organic fertilizer application must be strictly controlled.</p>\",\"PeriodicalId\":35942,\"journal\":{\"name\":\"应用生态学报\",\"volume\":\"36 6\",\"pages\":\"1889-1897\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"应用生态学报\",\"FirstCategoryId\":\"1087\",\"ListUrlMain\":\"https://doi.org/10.13287/j.1001-9332.202506.018\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Environmental Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"应用生态学报","FirstCategoryId":"1087","ListUrlMain":"https://doi.org/10.13287/j.1001-9332.202506.018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Environmental Science","Score":null,"Total":0}
Predicting heavy metal concentration in crop grain using automated machine learning models.
With the acceleration of industrialization and the intensification of agricultural activities, heavy metals (HMs) pollution in crops has become an issue that can not be ignored in current agricultural production. Based on 791 data sets from 54 publications, we predicted HMs concentrations in crop grains by using automated machine learning (AutoML) models. Ten factors were used as input variables: organic fertilizer application, HMs concentration in organic fertilizer, soil HMs concentration, soil organic matter, pH, cation exchange capacity, clay content, silt content, sand content and plant types. The concentrations of chromium (Cr), cadmium (Cd), lead (Pb), arsenic (As) and mercury (Hg) in crop grains were set as output variables. We evaluated the simulation and prediction performance of six models: deep learning (DL), distributed random forest (DRF), extremely randomized trees (XRT), stacked ensemble (SE), gradient boosting machine (GBM) and generalized linear model (GLM), with which we analyzed the key factors driving heavy metal accumulation in crop grains. The results showed that the optimal prediction model differed for different HMs. The DL model provided the best prediction for Cr, Pb, As and Hg, while the GBM model achieved the highest prediction accuracy for Cd. Feature importance and SHAP analysis revealed that the application of organic fertilizer and plant type were the key factors influencing HMs accumulation in crop grains. Organic fertilizer application, soil HMs concentration, organic fertilizer HMs concentration, and sand content were positively correlated with HMs concentration in crop grains, while cation exchange capacity, pH, organic matter, and clay content were negatively correlated with heavy metal concentration in crop grains. In summary, the DL and GBM models performed better in predicting heavy metal concentrations in crop grains. The input risk of heavy metals during organic fertilizer application must be strictly controlled.