{"title":"Predicting Sorption of Diverse Organic Compounds in Soil-Water Systems: Meta-analysis, Machine Learning Modeling, and Global Soil Mapping","authors":"Jiachun Sun, Kai Zhang, Huichun Zhang","doi":"10.1016/j.jhazmat.2025.137480","DOIUrl":null,"url":null,"abstract":"In recent decades, the environmental detection of various organic compounds (OCs) has highlighted the limitations of conventional soil-water sorption models, which simplify complex experimental conditions and often overlook OCs with polyfunctional and ionizable structures. To address these shortcomings, we compiled a comprehensive soil-water sorption dataset encompassing 20,945 data points for 419 OCs with various functional groups and 1037 different soils. Meta-analysis of the dataset revealed the trends of soil sorption associated with OC substructures, soil properties, and solution conditions. Machine learning models employing the XGBoost algorithm, in conjunction with MACCS fingerprints and experimental conditions, were developed to cover the entire spectrum of speciation for cationic, neutral, and anionic species. Among these, the individual models tailored to each speciation achieved an overall root-mean-square-error value of 0.32 for log <em>K</em><sub>d</sub>. Model interpretation revealed that the models correctly understood the contributions of various substructures, such as multiple aromatic rings and nitrogen or oxygen atoms, to sorption. The models were also found to accurately capture isotherm nonlinearity and the pH effect on the sorption of ionizable OCs. Finally, utilizing soil properties from the Harmonized World Soil Database, the models predicted the sorption of diverse OCs based on global soil properties under simulated environmental scenarios.","PeriodicalId":361,"journal":{"name":"Journal of Hazardous Materials","volume":"61 1","pages":""},"PeriodicalIF":12.2000,"publicationDate":"2025-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hazardous Materials","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.jhazmat.2025.137480","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
In recent decades, the environmental detection of various organic compounds (OCs) has highlighted the limitations of conventional soil-water sorption models, which simplify complex experimental conditions and often overlook OCs with polyfunctional and ionizable structures. To address these shortcomings, we compiled a comprehensive soil-water sorption dataset encompassing 20,945 data points for 419 OCs with various functional groups and 1037 different soils. Meta-analysis of the dataset revealed the trends of soil sorption associated with OC substructures, soil properties, and solution conditions. Machine learning models employing the XGBoost algorithm, in conjunction with MACCS fingerprints and experimental conditions, were developed to cover the entire spectrum of speciation for cationic, neutral, and anionic species. Among these, the individual models tailored to each speciation achieved an overall root-mean-square-error value of 0.32 for log Kd. Model interpretation revealed that the models correctly understood the contributions of various substructures, such as multiple aromatic rings and nitrogen or oxygen atoms, to sorption. The models were also found to accurately capture isotherm nonlinearity and the pH effect on the sorption of ionizable OCs. Finally, utilizing soil properties from the Harmonized World Soil Database, the models predicted the sorption of diverse OCs based on global soil properties under simulated environmental scenarios.
期刊介绍:
The Journal of Hazardous Materials serves as a global platform for promoting cutting-edge research in the field of Environmental Science and Engineering. Our publication features a wide range of articles, including full-length research papers, review articles, and perspectives, with the aim of enhancing our understanding of the dangers and risks associated with various materials concerning public health and the environment. It is important to note that the term "environmental contaminants" refers specifically to substances that pose hazardous effects through contamination, while excluding those that do not have such impacts on the environment or human health. Moreover, we emphasize the distinction between wastes and hazardous materials in order to provide further clarity on the scope of the journal. We have a keen interest in exploring specific compounds and microbial agents that have adverse effects on the environment.