{"title":"Prediction of Zinc, Cadmium, and Arsenic in European Soils Using Multi-End Machine Learning Models","authors":"Mohammad Sadegh Barkhordari, Chongchong Qi","doi":"10.1016/j.jhazmat.2025.137800","DOIUrl":null,"url":null,"abstract":"Heavy metal contamination in soil is a major environmental and public health concern, especially in regions with substantial industrial and agricultural activities. Conventional predictive models often focus on single contaminants, limiting their utility for comprehensive environmental monitoring. This study addressed these limitations by developing an advanced multi-end ensemble convolutional neural network model capable of simultaneously predicting the concentrations of cadmium, arsenic, and zinc in European soils. A comprehensive dataset with 18 diverse factors was prepared, including soil properties, climatic factors, and anthropogenic activities. Moreover, the model compared four ensemble learning techniques in contamination prediction, including simple averaging, snapshot ensembles, integrated stacking, and separate stacking. Among these, the separate stacking model with random forest regressor meta-model achieved the highest accuracy, with a mean spared error of 0.0378, a mean absolute error of 0.0785, and a coefficient of determination of 0.79 in the testing phases. Sensitivity analysis highlighted farming area, road length, nitrogen content, and mean annual temperature as key factors influencing metal concentrations. To enhance accessibility, a GUI-based web application was developed, allowing users to enter relevant factors and receive real-time predictions of contamination levels. This application empowers stakeholders, such as environmental regulators and policymakers, to make informed, data-driven decisions for targeted remediation. These findings underscore the critical role of integrated machine learning approaches in environmental science, offering a powerful tool for identifying contamination hotspots, supporting soil health management, and promoting sustainable land use.<h3>Environmental Implication</h3>The study reveals significant environmental implications, showing that soil contamination by arsenic, cadmium, and zinc across Europe is heavily influenced by human activities, particularly intensive agricultural practices that enhance the mobility of these heavy metals. Findings indicate that factors such as farming area, road length, nitrogen content, and mean annual temperature play a crucial role in metal accumulation. The advanced, multi-ended predictive models developed in this research, paired with an accessible web-based application, provide a practical tool for policymakers, researchers, and land managers. This tool enables real-time contamination assessments and supports the creation of policies aimed at controlling soil pollution, safeguarding public health, and promoting sustainable agricultural practices. These findings underscore the urgent need for integrated soil management strategies and contamination mitigation efforts, emphasizing how targeted actions can reduce the long-term impacts of heavy metal pollution on ecosystems and human health.","PeriodicalId":361,"journal":{"name":"Journal of Hazardous Materials","volume":"17 1","pages":""},"PeriodicalIF":12.2000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hazardous Materials","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.jhazmat.2025.137800","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Heavy metal contamination in soil is a major environmental and public health concern, especially in regions with substantial industrial and agricultural activities. Conventional predictive models often focus on single contaminants, limiting their utility for comprehensive environmental monitoring. This study addressed these limitations by developing an advanced multi-end ensemble convolutional neural network model capable of simultaneously predicting the concentrations of cadmium, arsenic, and zinc in European soils. A comprehensive dataset with 18 diverse factors was prepared, including soil properties, climatic factors, and anthropogenic activities. Moreover, the model compared four ensemble learning techniques in contamination prediction, including simple averaging, snapshot ensembles, integrated stacking, and separate stacking. Among these, the separate stacking model with random forest regressor meta-model achieved the highest accuracy, with a mean spared error of 0.0378, a mean absolute error of 0.0785, and a coefficient of determination of 0.79 in the testing phases. Sensitivity analysis highlighted farming area, road length, nitrogen content, and mean annual temperature as key factors influencing metal concentrations. To enhance accessibility, a GUI-based web application was developed, allowing users to enter relevant factors and receive real-time predictions of contamination levels. This application empowers stakeholders, such as environmental regulators and policymakers, to make informed, data-driven decisions for targeted remediation. These findings underscore the critical role of integrated machine learning approaches in environmental science, offering a powerful tool for identifying contamination hotspots, supporting soil health management, and promoting sustainable land use.
Environmental Implication
The study reveals significant environmental implications, showing that soil contamination by arsenic, cadmium, and zinc across Europe is heavily influenced by human activities, particularly intensive agricultural practices that enhance the mobility of these heavy metals. Findings indicate that factors such as farming area, road length, nitrogen content, and mean annual temperature play a crucial role in metal accumulation. The advanced, multi-ended predictive models developed in this research, paired with an accessible web-based application, provide a practical tool for policymakers, researchers, and land managers. This tool enables real-time contamination assessments and supports the creation of policies aimed at controlling soil pollution, safeguarding public health, and promoting sustainable agricultural practices. These findings underscore the urgent need for integrated soil management strategies and contamination mitigation efforts, emphasizing how targeted actions can reduce the long-term impacts of heavy metal pollution on ecosystems and human health.
期刊介绍:
The Journal of Hazardous Materials serves as a global platform for promoting cutting-edge research in the field of Environmental Science and Engineering. Our publication features a wide range of articles, including full-length research papers, review articles, and perspectives, with the aim of enhancing our understanding of the dangers and risks associated with various materials concerning public health and the environment. It is important to note that the term "environmental contaminants" refers specifically to substances that pose hazardous effects through contamination, while excluding those that do not have such impacts on the environment or human health. Moreover, we emphasize the distinction between wastes and hazardous materials in order to provide further clarity on the scope of the journal. We have a keen interest in exploring specific compounds and microbial agents that have adverse effects on the environment.