Jinglun Huang, Zhixin Cao, Leiping Ye, Jie Ren, Jiaxue Wu
{"title":"泥沙沉降速度的智能预测:采用可解释的机器学习和量化环境因素的影响","authors":"Jinglun Huang, Zhixin Cao, Leiping Ye, Jie Ren, Jiaxue Wu","doi":"10.1016/j.margeo.2025.107631","DOIUrl":null,"url":null,"abstract":"<div><div>Settling velocity of flocs resulting from cohesive sediment flocculation, significantly influenced by hydrological factors (e.g., turbulence shear, salinity, suspended sediment concentration and sediment size), is a key parameter to evaluate sediment transport in estuarine and coastal ocean. Measurement and prediction of in-situ settling velocity for sediment flocs are challenging at current stage due to the complex estuarine dynamics and technique constraints. A powerful and interpretable prediction model is urgently required. To meet this requirement, we propose a new paradigm in predicting settling velocity with interpretable machine learning (ML) model by combination of ML with Shapley additive explanation (SHAP) using 4 hydrological parameters as independent input features. Eight ML algorithms (including recently proposed XGBoost, LightGBM and so on) and ensemble learning (Stacking) were employed, with Bayesian Optimization Algorithm to determine the hyper-parameters. XGBoost model was found to have the highest prediction accuracy in terms of 6 performance metrics, among the 8 base learners. Stacking of these models outperform each component model. With the powerful SHAP methods, the proposed paradigm is able to interpret/explain ML model predictions and highlight the highly influential features for the system of sediment dynamics in both local and global points of view. Additionally, the developed ML model provides transparent insights into the contribution of each feature without compromising predictive accuracy. These ML-based models outperform conventional modified Stokes law or empirical equations. Specially, physics-informed neural network (PINN) is employed for prediction of floc settling velocity, which is also a useful approach to overcome the limitations of data driven model characterized by black box. Combinations of ML with physical law and SHAP, the key of the proposed paradigm, could be very powerful tool for researchers on sediment transport.</div></div>","PeriodicalId":18229,"journal":{"name":"Marine Geology","volume":"489 ","pages":"Article 107631"},"PeriodicalIF":2.2000,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Intelligent prediction of sediment floc settling velocity: Employing explainable machine learning and quantifying the impacts of environmental factors\",\"authors\":\"Jinglun Huang, Zhixin Cao, Leiping Ye, Jie Ren, Jiaxue Wu\",\"doi\":\"10.1016/j.margeo.2025.107631\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Settling velocity of flocs resulting from cohesive sediment flocculation, significantly influenced by hydrological factors (e.g., turbulence shear, salinity, suspended sediment concentration and sediment size), is a key parameter to evaluate sediment transport in estuarine and coastal ocean. Measurement and prediction of in-situ settling velocity for sediment flocs are challenging at current stage due to the complex estuarine dynamics and technique constraints. A powerful and interpretable prediction model is urgently required. To meet this requirement, we propose a new paradigm in predicting settling velocity with interpretable machine learning (ML) model by combination of ML with Shapley additive explanation (SHAP) using 4 hydrological parameters as independent input features. Eight ML algorithms (including recently proposed XGBoost, LightGBM and so on) and ensemble learning (Stacking) were employed, with Bayesian Optimization Algorithm to determine the hyper-parameters. XGBoost model was found to have the highest prediction accuracy in terms of 6 performance metrics, among the 8 base learners. Stacking of these models outperform each component model. With the powerful SHAP methods, the proposed paradigm is able to interpret/explain ML model predictions and highlight the highly influential features for the system of sediment dynamics in both local and global points of view. Additionally, the developed ML model provides transparent insights into the contribution of each feature without compromising predictive accuracy. These ML-based models outperform conventional modified Stokes law or empirical equations. Specially, physics-informed neural network (PINN) is employed for prediction of floc settling velocity, which is also a useful approach to overcome the limitations of data driven model characterized by black box. Combinations of ML with physical law and SHAP, the key of the proposed paradigm, could be very powerful tool for researchers on sediment transport.</div></div>\",\"PeriodicalId\":18229,\"journal\":{\"name\":\"Marine Geology\",\"volume\":\"489 \",\"pages\":\"Article 107631\"},\"PeriodicalIF\":2.2000,\"publicationDate\":\"2025-08-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Marine Geology\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0025322725001562\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Marine Geology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0025322725001562","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
Intelligent prediction of sediment floc settling velocity: Employing explainable machine learning and quantifying the impacts of environmental factors
Settling velocity of flocs resulting from cohesive sediment flocculation, significantly influenced by hydrological factors (e.g., turbulence shear, salinity, suspended sediment concentration and sediment size), is a key parameter to evaluate sediment transport in estuarine and coastal ocean. Measurement and prediction of in-situ settling velocity for sediment flocs are challenging at current stage due to the complex estuarine dynamics and technique constraints. A powerful and interpretable prediction model is urgently required. To meet this requirement, we propose a new paradigm in predicting settling velocity with interpretable machine learning (ML) model by combination of ML with Shapley additive explanation (SHAP) using 4 hydrological parameters as independent input features. Eight ML algorithms (including recently proposed XGBoost, LightGBM and so on) and ensemble learning (Stacking) were employed, with Bayesian Optimization Algorithm to determine the hyper-parameters. XGBoost model was found to have the highest prediction accuracy in terms of 6 performance metrics, among the 8 base learners. Stacking of these models outperform each component model. With the powerful SHAP methods, the proposed paradigm is able to interpret/explain ML model predictions and highlight the highly influential features for the system of sediment dynamics in both local and global points of view. Additionally, the developed ML model provides transparent insights into the contribution of each feature without compromising predictive accuracy. These ML-based models outperform conventional modified Stokes law or empirical equations. Specially, physics-informed neural network (PINN) is employed for prediction of floc settling velocity, which is also a useful approach to overcome the limitations of data driven model characterized by black box. Combinations of ML with physical law and SHAP, the key of the proposed paradigm, could be very powerful tool for researchers on sediment transport.
期刊介绍:
Marine Geology is the premier international journal on marine geological processes in the broadest sense. We seek papers that are comprehensive, interdisciplinary and synthetic that will be lasting contributions to the field. Although most papers are based on regional studies, they must demonstrate new findings of international significance. We accept papers on subjects as diverse as seafloor hydrothermal systems, beach dynamics, early diagenesis, microbiological studies in sediments, palaeoclimate studies and geophysical studies of the seabed. We encourage papers that address emerging new fields, for example the influence of anthropogenic processes on coastal/marine geology and coastal/marine geoarchaeology. We insist that the papers are concerned with the marine realm and that they deal with geology: with rocks, sediments, and physical and chemical processes affecting them. Papers should address scientific hypotheses: highly descriptive data compilations or papers that deal only with marine management and risk assessment should be submitted to other journals. Papers on laboratory or modelling studies must demonstrate direct relevance to marine processes or deposits. The primary criteria for acceptance of papers is that the science is of high quality, novel, significant, and of broad international interest.