{"title":"基于机器学习方法的聚合物改性土-膨润土防渗墙回填体导电性预测及可解释性分析","authors":"Heng Zhuang , Xian-Lei Fu , Hao Ni , Yan-Jun Du","doi":"10.1016/j.envres.2025.122960","DOIUrl":null,"url":null,"abstract":"<div><div>Soil-bentonite backfills with low hydraulic conductivity are extensively used to control flow of groundwater at contaminated sites. Predicting hydraulic conductivity of backfills is challenging due to nonlinear couplings among chemistry reactions, backfill compositions, and in-situ stresses. This study developed a machine-learning framework that unifieed a curated dataset of 108 laboratory observations, systematic benchmarking of twelve algorithms, SHAP-based interpretability, and mechanism-guided feature subset optimization to balance accuracy with measurement cost. Validation used an 80/20 training/testing split with five-fold cross-validation for hyperparameter tuning, multi-metric evaluation, and a Taylor diagram for consistency appraisal. Peak test performance showed clear stratification: Gradient Boosting Regressor (<em>R</em><sup><em>2</em></sup> = 0.794), Support Vector Regression (<em>R</em><sup><em>2</em></sup> = 0.788), XGBoost (<em>R</em><sup><em>2</em></sup> = 0.783), Random Forest (<em>R</em><sup><em>2</em></sup> = 0.736), LightGBM (<em>R</em><sup><em>2</em></sup> = 0.698). Taylor diagram corroborated this ordering by proximity to the reference point. SHAP analysis identifieed leachate pH as the top contributor in Gradient Boosting Regressor, and explained why low ionic strength lowered the predicted hydraulic conductivity whereas higher ionic strength increased it. Feature-count analysis indicateed that a stable baseline requireed at least 5 features and that gains generally tapered after 9 to 10 features. The results motivated a three-tier workflow: rapid screening with a compact interaction set, routine design with curated features, and final validation with the all features set.</div></div>","PeriodicalId":312,"journal":{"name":"Environmental Research","volume":"286 ","pages":"Article 122960"},"PeriodicalIF":7.7000,"publicationDate":"2025-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction and interpretability analysis of hydraulic conductivity for polymer-amended soil-bentonite cutoff wall backfills using machine learning methods\",\"authors\":\"Heng Zhuang , Xian-Lei Fu , Hao Ni , Yan-Jun Du\",\"doi\":\"10.1016/j.envres.2025.122960\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Soil-bentonite backfills with low hydraulic conductivity are extensively used to control flow of groundwater at contaminated sites. Predicting hydraulic conductivity of backfills is challenging due to nonlinear couplings among chemistry reactions, backfill compositions, and in-situ stresses. This study developed a machine-learning framework that unifieed a curated dataset of 108 laboratory observations, systematic benchmarking of twelve algorithms, SHAP-based interpretability, and mechanism-guided feature subset optimization to balance accuracy with measurement cost. Validation used an 80/20 training/testing split with five-fold cross-validation for hyperparameter tuning, multi-metric evaluation, and a Taylor diagram for consistency appraisal. Peak test performance showed clear stratification: Gradient Boosting Regressor (<em>R</em><sup><em>2</em></sup> = 0.794), Support Vector Regression (<em>R</em><sup><em>2</em></sup> = 0.788), XGBoost (<em>R</em><sup><em>2</em></sup> = 0.783), Random Forest (<em>R</em><sup><em>2</em></sup> = 0.736), LightGBM (<em>R</em><sup><em>2</em></sup> = 0.698). Taylor diagram corroborated this ordering by proximity to the reference point. SHAP analysis identifieed leachate pH as the top contributor in Gradient Boosting Regressor, and explained why low ionic strength lowered the predicted hydraulic conductivity whereas higher ionic strength increased it. Feature-count analysis indicateed that a stable baseline requireed at least 5 features and that gains generally tapered after 9 to 10 features. The results motivated a three-tier workflow: rapid screening with a compact interaction set, routine design with curated features, and final validation with the all features set.</div></div>\",\"PeriodicalId\":312,\"journal\":{\"name\":\"Environmental Research\",\"volume\":\"286 \",\"pages\":\"Article 122960\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2025-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Research\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0013935125022133\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Research","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0013935125022133","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Prediction and interpretability analysis of hydraulic conductivity for polymer-amended soil-bentonite cutoff wall backfills using machine learning methods
Soil-bentonite backfills with low hydraulic conductivity are extensively used to control flow of groundwater at contaminated sites. Predicting hydraulic conductivity of backfills is challenging due to nonlinear couplings among chemistry reactions, backfill compositions, and in-situ stresses. This study developed a machine-learning framework that unifieed a curated dataset of 108 laboratory observations, systematic benchmarking of twelve algorithms, SHAP-based interpretability, and mechanism-guided feature subset optimization to balance accuracy with measurement cost. Validation used an 80/20 training/testing split with five-fold cross-validation for hyperparameter tuning, multi-metric evaluation, and a Taylor diagram for consistency appraisal. Peak test performance showed clear stratification: Gradient Boosting Regressor (R2 = 0.794), Support Vector Regression (R2 = 0.788), XGBoost (R2 = 0.783), Random Forest (R2 = 0.736), LightGBM (R2 = 0.698). Taylor diagram corroborated this ordering by proximity to the reference point. SHAP analysis identifieed leachate pH as the top contributor in Gradient Boosting Regressor, and explained why low ionic strength lowered the predicted hydraulic conductivity whereas higher ionic strength increased it. Feature-count analysis indicateed that a stable baseline requireed at least 5 features and that gains generally tapered after 9 to 10 features. The results motivated a three-tier workflow: rapid screening with a compact interaction set, routine design with curated features, and final validation with the all features set.
期刊介绍:
The Environmental Research journal presents a broad range of interdisciplinary research, focused on addressing worldwide environmental concerns and featuring innovative findings. Our publication strives to explore relevant anthropogenic issues across various environmental sectors, showcasing practical applications in real-life settings.