Alexis Munyengabe, Maria Fezile Banda, Wilma Augustyn
{"title":"Predicting plant uptake of potential contaminants of emerging concerns using machine learning models (2018–2025): A global review","authors":"Alexis Munyengabe, Maria Fezile Banda, Wilma Augustyn","doi":"10.1016/j.rineng.2025.106050","DOIUrl":null,"url":null,"abstract":"<div><div>This comprehensive review (2018–2025) synthesizes studies conducted on machine learning (ML) models to predict plant uptake of contaminants of emerging concerns (CECs). CECs are a global concern driven by anthropogenic activities, and their plant uptake is critical for environmental remediation and food safety. ML offers powerful predictive opportunities, fostering ecosystem preservation, human health and sustainable development. The review analyzes diverse ML algorithms, including ensemble models (random forest, gradient boosted regression trees, eXtreme gradient boosting) and deep learning (deep neural networks, recurrent neural networks, long short-term memory) alongside key predictors like soil pH, organic matter, and plant traits. It identifies dominant predictors and modelling approaches while highlighting significant research gaps: limited data, inconsistent feature reporting, and underexplored uncertainty-sensitivity coupling. A notable geographic imbalance exists, with China dominating research (82.1 % of 28 studies), followed by the USA (14.3 %) and Italy (3.6 %). Africa, particularly South Africa, is significantly underrepresented despite prevalent CEC pollution in African water bodies, including those used for irrigation, and agricultural vulnerabilities. This reveals a critical gap in global efforts to address crop contaminant risk, necessitating model retraining or transfer learning for African contexts given existing models' China-centric training. The findings underscore the urgent need for standardized databases, global sensitivity and uncertainty analysis and expanded geographic representation to enhance model generalizability and interpretability. This synthesis bridges environmental chemistry, plant physiology and artificial intelligence in predictive contaminant modelling, providing a foundation for future, globally relevant research.</div></div>","PeriodicalId":36919,"journal":{"name":"Results in Engineering","volume":"27 ","pages":"Article 106050"},"PeriodicalIF":7.9000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Results in Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S259012302502122X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
This comprehensive review (2018–2025) synthesizes studies conducted on machine learning (ML) models to predict plant uptake of contaminants of emerging concerns (CECs). CECs are a global concern driven by anthropogenic activities, and their plant uptake is critical for environmental remediation and food safety. ML offers powerful predictive opportunities, fostering ecosystem preservation, human health and sustainable development. The review analyzes diverse ML algorithms, including ensemble models (random forest, gradient boosted regression trees, eXtreme gradient boosting) and deep learning (deep neural networks, recurrent neural networks, long short-term memory) alongside key predictors like soil pH, organic matter, and plant traits. It identifies dominant predictors and modelling approaches while highlighting significant research gaps: limited data, inconsistent feature reporting, and underexplored uncertainty-sensitivity coupling. A notable geographic imbalance exists, with China dominating research (82.1 % of 28 studies), followed by the USA (14.3 %) and Italy (3.6 %). Africa, particularly South Africa, is significantly underrepresented despite prevalent CEC pollution in African water bodies, including those used for irrigation, and agricultural vulnerabilities. This reveals a critical gap in global efforts to address crop contaminant risk, necessitating model retraining or transfer learning for African contexts given existing models' China-centric training. The findings underscore the urgent need for standardized databases, global sensitivity and uncertainty analysis and expanded geographic representation to enhance model generalizability and interpretability. This synthesis bridges environmental chemistry, plant physiology and artificial intelligence in predictive contaminant modelling, providing a foundation for future, globally relevant research.