基于序列数据同化和机器学习的非测站土壤湿度剖面估计

IF 6.3 1区 地球科学 Q1 ENGINEERING, CIVIL
Yakun Wang , Qiuru Zhang , Shikun Sun , Yifei Yao , Xiaotao Hu , Shibiao Cai , Hanbo Wang
{"title":"基于序列数据同化和机器学习的非测站土壤湿度剖面估计","authors":"Yakun Wang ,&nbsp;Qiuru Zhang ,&nbsp;Shikun Sun ,&nbsp;Yifei Yao ,&nbsp;Xiaotao Hu ,&nbsp;Shibiao Cai ,&nbsp;Hanbo Wang","doi":"10.1016/j.jhydrol.2025.134292","DOIUrl":null,"url":null,"abstract":"<div><div>Although soil moisture content (SMC) is crucial for understanding land–atmosphere interface interactions, there is still a paucity of regional-scale long-term high resolution SMC datasets. How to deduce regional continuous SMC with discrete point-scale observations remains a prevalent and challenging issue. Existing physical models often struggle with parameter acquisition and high computational costs, whereas purely data-driven machine learning (ML) models may perform poorly outside its calibration range due to the lack of physical constraints. This study proposed a sequential hybrid method (mRestart-EnKF-ML) that combined the modified restart ensemble Kalman filter (mRestart-EnKF) with ML to estimate SMC in ungauged stations using historical information from adjacent available stations. With the aid of a series of real-world cases, we demonstrated the ability, and the challenge as well, of retrieving SMC in ungauged stations sequentially. The results showed that Kalman update improved the reliability of ML modeling by updating soil hydraulic parameters in real-time, thereby enhancing the overall estimation accuracy of SMC, especially for the surface-layer SMC. In contrast to purely data-driven models, the proposed mRestart-EnKF-ML significantly reduced the RMSE of SMC retrievals by means of both expanding the training dataset and introducing physical constraints into ML models. The impacts of input features on mRestart-EnKF-ML for SMC estimation exhibited significant spatial heterogeneity and align with physical processes based on the SHAP analysis. Training data with denser vertical spatial resolution can implicitly offer more accurate physical knowledge like mass conservation, thus ensuring the proposed method’s robustness in various application scenarios. The performance of the mRestart-EnKF-ML was significantly influenced by observation error settings, station-specific characteristics, and depth-dependent responses, with optimal error configurations varying by station type and depth playing key roles in shaping error-accuracy relationships.</div></div>","PeriodicalId":362,"journal":{"name":"Journal of Hydrology","volume":"663 ","pages":"Article 134292"},"PeriodicalIF":6.3000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Estimation of soil moisture profiles in ungauged stations by hybridizing sequential data assimilation and machine learning\",\"authors\":\"Yakun Wang ,&nbsp;Qiuru Zhang ,&nbsp;Shikun Sun ,&nbsp;Yifei Yao ,&nbsp;Xiaotao Hu ,&nbsp;Shibiao Cai ,&nbsp;Hanbo Wang\",\"doi\":\"10.1016/j.jhydrol.2025.134292\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Although soil moisture content (SMC) is crucial for understanding land–atmosphere interface interactions, there is still a paucity of regional-scale long-term high resolution SMC datasets. How to deduce regional continuous SMC with discrete point-scale observations remains a prevalent and challenging issue. Existing physical models often struggle with parameter acquisition and high computational costs, whereas purely data-driven machine learning (ML) models may perform poorly outside its calibration range due to the lack of physical constraints. This study proposed a sequential hybrid method (mRestart-EnKF-ML) that combined the modified restart ensemble Kalman filter (mRestart-EnKF) with ML to estimate SMC in ungauged stations using historical information from adjacent available stations. With the aid of a series of real-world cases, we demonstrated the ability, and the challenge as well, of retrieving SMC in ungauged stations sequentially. The results showed that Kalman update improved the reliability of ML modeling by updating soil hydraulic parameters in real-time, thereby enhancing the overall estimation accuracy of SMC, especially for the surface-layer SMC. In contrast to purely data-driven models, the proposed mRestart-EnKF-ML significantly reduced the RMSE of SMC retrievals by means of both expanding the training dataset and introducing physical constraints into ML models. The impacts of input features on mRestart-EnKF-ML for SMC estimation exhibited significant spatial heterogeneity and align with physical processes based on the SHAP analysis. Training data with denser vertical spatial resolution can implicitly offer more accurate physical knowledge like mass conservation, thus ensuring the proposed method’s robustness in various application scenarios. The performance of the mRestart-EnKF-ML was significantly influenced by observation error settings, station-specific characteristics, and depth-dependent responses, with optimal error configurations varying by station type and depth playing key roles in shaping error-accuracy relationships.</div></div>\",\"PeriodicalId\":362,\"journal\":{\"name\":\"Journal of Hydrology\",\"volume\":\"663 \",\"pages\":\"Article 134292\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Hydrology\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0022169425016324\",\"RegionNum\":1,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CIVIL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hydrology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022169425016324","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0

摘要

虽然土壤含水量(SMC)是了解陆地-大气界面相互作用的关键,但区域尺度的长期高分辨率SMC数据集仍然缺乏。如何利用离散的点尺度观测推断区域连续SMC是一个普遍而具有挑战性的问题。现有的物理模型经常与参数获取和高计算成本作斗争,而纯数据驱动的机器学习(ML)模型由于缺乏物理约束,在其校准范围之外可能表现不佳。本研究提出了一种序列混合方法(mRestart-EnKF-ML),该方法将改进的重启集合卡尔曼滤波(mRestart-EnKF)与ML相结合,利用邻近可用台站的历史信息估计未测量台站的SMC。在一系列实际案例的帮助下,我们展示了在未测量的站点中顺序检索SMC的能力和挑战。结果表明,卡尔曼更新通过实时更新土壤水力参数提高了ML建模的可靠性,从而提高了SMC的整体估计精度,特别是对表层SMC的估计精度。与纯粹的数据驱动模型相比,本文提出的mRestart-EnKF-ML通过扩展训练数据集和在ML模型中引入物理约束,显著降低了SMC检索的RMSE。基于SHAP分析,输入特征对SMC估计的mRestart-EnKF-ML的影响表现出显著的空间异质性,并与物理过程一致。具有更密集的垂直空间分辨率的训练数据可以隐含地提供更精确的物理知识,如质量守恒,从而保证了所提方法在各种应用场景下的鲁棒性。mreboot - enkf - ml的性能受到观测误差设置、台站特征和深度相关响应的显著影响,其中最优误差配置随台站类型和深度的变化而变化,在误差-精度关系的形成中起着关键作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Estimation of soil moisture profiles in ungauged stations by hybridizing sequential data assimilation and machine learning
Although soil moisture content (SMC) is crucial for understanding land–atmosphere interface interactions, there is still a paucity of regional-scale long-term high resolution SMC datasets. How to deduce regional continuous SMC with discrete point-scale observations remains a prevalent and challenging issue. Existing physical models often struggle with parameter acquisition and high computational costs, whereas purely data-driven machine learning (ML) models may perform poorly outside its calibration range due to the lack of physical constraints. This study proposed a sequential hybrid method (mRestart-EnKF-ML) that combined the modified restart ensemble Kalman filter (mRestart-EnKF) with ML to estimate SMC in ungauged stations using historical information from adjacent available stations. With the aid of a series of real-world cases, we demonstrated the ability, and the challenge as well, of retrieving SMC in ungauged stations sequentially. The results showed that Kalman update improved the reliability of ML modeling by updating soil hydraulic parameters in real-time, thereby enhancing the overall estimation accuracy of SMC, especially for the surface-layer SMC. In contrast to purely data-driven models, the proposed mRestart-EnKF-ML significantly reduced the RMSE of SMC retrievals by means of both expanding the training dataset and introducing physical constraints into ML models. The impacts of input features on mRestart-EnKF-ML for SMC estimation exhibited significant spatial heterogeneity and align with physical processes based on the SHAP analysis. Training data with denser vertical spatial resolution can implicitly offer more accurate physical knowledge like mass conservation, thus ensuring the proposed method’s robustness in various application scenarios. The performance of the mRestart-EnKF-ML was significantly influenced by observation error settings, station-specific characteristics, and depth-dependent responses, with optimal error configurations varying by station type and depth playing key roles in shaping error-accuracy relationships.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Hydrology
Journal of Hydrology 地学-地球科学综合
CiteScore
11.00
自引率
12.50%
发文量
1309
审稿时长
7.5 months
期刊介绍: The Journal of Hydrology publishes original research papers and comprehensive reviews in all the subfields of the hydrological sciences including water based management and policy issues that impact on economics and society. These comprise, but are not limited to the physical, chemical, biogeochemical, stochastic and systems aspects of surface and groundwater hydrology, hydrometeorology and hydrogeology. Relevant topics incorporating the insights and methodologies of disciplines such as climatology, water resource systems, hydraulics, agrohydrology, geomorphology, soil science, instrumentation and remote sensing, civil and environmental engineering are included. Social science perspectives on hydrological problems such as resource and ecological economics, environmental sociology, psychology and behavioural science, management and policy analysis are also invited. Multi-and interdisciplinary analyses of hydrological problems are within scope. The science published in the Journal of Hydrology is relevant to catchment scales rather than exclusively to a local scale or site.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信