Chunlin Zhang , Jiangyuan Zeng , Pengfei Shi , Hongliang Ma , Husi Letu , Xiang Zhang , Panshan Wang , Haiyun Bi , Jiaming Rong
{"title":"Global-scale gap filling of satellite soil moisture products: Methods and validation","authors":"Chunlin Zhang , Jiangyuan Zeng , Pengfei Shi , Hongliang Ma , Husi Letu , Xiang Zhang , Panshan Wang , Haiyun Bi , Jiaming Rong","doi":"10.1016/j.jhydrol.2025.132762","DOIUrl":null,"url":null,"abstract":"<div><div>The utility of satellite soil moisture products is often limited by their missing values, and thus it is crucial to develop gap-filling methods to obtain soil moisture datasets with high-precision and spatiotemporal coverage. Previous studies often used a single gap-filling method in specific regions without analysis of the factors affecting the gap-filling accuracy. To narrow this research gap, this study first compared the correlation of SMAP soil moisture products with five spatially seamless model-based soil moisture datasets globally. Then based on the optimal ERA5 data from 2016 to 2019, the performance of four machine learning methods in filling the SMAP missing values was compared. The best-performing random forest (RF) method was compared with other five traditional bias-corrected methods. Subsequently, twelve auxiliary data were incorporated into the RF to improve the accuracy of gap-filled SMAP data, which were validated by ground measurements from 1071 sites worldwide. Finally, the environmental factors affecting the filling accuracy of SMAP data were analyzed on a global scale. The results indicate: 1) RF generally performs the best among the four machine learning approaches. When only using the ERA5 dataset for the model input, RF achieves higher accuracy compared to the other five bias-corrected methods during the training phase, but its skill degrades noticeably in the validation phase. The performance of RF improves significantly after adding auxiliary data; 2) against globally distributed <em>in situ</em> data, the gap-filled products show improved skill over the original SMAP data, with smaller ubRMSE of 0.049 m<sup>3</sup>m<sup>−3</sup> (<em>vs.</em> 0.060 m<sup>3</sup>m<sup>−3</sup>), demonstrating the RF method with auxiliary data can effectively fill the missing values of SMAP data; 3) the gap-filling accuracy is mainly affected by vegetation cover, soil moisture conditions, and land cover heterogeneity. Specifically, the filling accuracy is lower in denser vegetation coverage, wetter soil, and larger land cover heterogeneity.</div></div>","PeriodicalId":362,"journal":{"name":"Journal of Hydrology","volume":"653 ","pages":"Article 132762"},"PeriodicalIF":5.9000,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hydrology","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022169425001003","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
The utility of satellite soil moisture products is often limited by their missing values, and thus it is crucial to develop gap-filling methods to obtain soil moisture datasets with high-precision and spatiotemporal coverage. Previous studies often used a single gap-filling method in specific regions without analysis of the factors affecting the gap-filling accuracy. To narrow this research gap, this study first compared the correlation of SMAP soil moisture products with five spatially seamless model-based soil moisture datasets globally. Then based on the optimal ERA5 data from 2016 to 2019, the performance of four machine learning methods in filling the SMAP missing values was compared. The best-performing random forest (RF) method was compared with other five traditional bias-corrected methods. Subsequently, twelve auxiliary data were incorporated into the RF to improve the accuracy of gap-filled SMAP data, which were validated by ground measurements from 1071 sites worldwide. Finally, the environmental factors affecting the filling accuracy of SMAP data were analyzed on a global scale. The results indicate: 1) RF generally performs the best among the four machine learning approaches. When only using the ERA5 dataset for the model input, RF achieves higher accuracy compared to the other five bias-corrected methods during the training phase, but its skill degrades noticeably in the validation phase. The performance of RF improves significantly after adding auxiliary data; 2) against globally distributed in situ data, the gap-filled products show improved skill over the original SMAP data, with smaller ubRMSE of 0.049 m3m−3 (vs. 0.060 m3m−3), demonstrating the RF method with auxiliary data can effectively fill the missing values of SMAP data; 3) the gap-filling accuracy is mainly affected by vegetation cover, soil moisture conditions, and land cover heterogeneity. Specifically, the filling accuracy is lower in denser vegetation coverage, wetter soil, and larger land cover heterogeneity.
期刊介绍:
The Journal of Hydrology publishes original research papers and comprehensive reviews in all the subfields of the hydrological sciences including water based management and policy issues that impact on economics and society. These comprise, but are not limited to the physical, chemical, biogeochemical, stochastic and systems aspects of surface and groundwater hydrology, hydrometeorology and hydrogeology. Relevant topics incorporating the insights and methodologies of disciplines such as climatology, water resource systems, hydraulics, agrohydrology, geomorphology, soil science, instrumentation and remote sensing, civil and environmental engineering are included. Social science perspectives on hydrological problems such as resource and ecological economics, environmental sociology, psychology and behavioural science, management and policy analysis are also invited. Multi-and interdisciplinary analyses of hydrological problems are within scope. The science published in the Journal of Hydrology is relevant to catchment scales rather than exclusively to a local scale or site.