Using remote sensing and machine learning to generate 100-cm soil moisture at 30-m resolution for the black soil region of China: Implication for agricultural water management
Liwen Chen , Boting Hu , Jingxuan Sun , Y. Jun Xu , Guangxin Zhang , Hongbo Ma , Jingquan Ren
{"title":"Using remote sensing and machine learning to generate 100-cm soil moisture at 30-m resolution for the black soil region of China: Implication for agricultural water management","authors":"Liwen Chen , Boting Hu , Jingxuan Sun , Y. Jun Xu , Guangxin Zhang , Hongbo Ma , Jingquan Ren","doi":"10.1016/j.agwat.2025.109353","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-layer soil moisture is an important factor in predicting agricultural droughts and waterlogging, with significant implications for the growth, development, and yield prediction of rain fed crops. However, soil moisture datasets or algorithms fail to simultaneously meet the requirements of multi-layer, high spatiotemporal resolution soil moisture information for large-scale agricultural production areas. To fill this gap, we propose a novel framework for estimation high spatial resolution multi-layer soil moisture data. Firstly, utilizing the Google Earth Engine (GEE) platform and Enhanced Spatial and Temporal Adaptive Reflectance Fusion Model (ESTARFM), we achieve the fusion of multi-source remote sensing data at large scales to obtain high spatiotemporal resolution Normalized Difference Vegetation Index (NDVI) and Land Surface Temperature (LST) data. Secondly, leveraging the Extreme Gradient Boosting (XGBoost) model along with reanalysis and in-situ measurements, we estimate soil moisture information across depths of 0–100 cm depth by 10 cm interval over large geographical extents. Finally, the accuracy of the soil moisture model is assessed using metrics such as Pearson correlation coefficient, root mean square error (RMSE), unbiased RMSE (ubRMSE), and bias. To assess the applicability of our research methodology, we selected the typical black soil zone in Northeast of China, which is one of the four major black soil regions globally and characterized by intensive agricultural activities. We estimated the long-term time series of soil moisture information during the growing seasons from 2000 to 2020 in this study area. We found that the soil moisture simulation based on the XGBoost model the worst values of R, RMSE, ubRMSE, and Bias values for the training set are 0.86,1.49,1.49 and −0.039 respectively. For the validation set, the worst value of R is 0.83. The proposed methodology in this study enables the acquisition of soil moisture information with both large-scale coverage and high spatiotemporal resolution. This advancement holds significant promise for fine-scale research and applications in agricultural, hydrological, and environmental fields.</div></div>","PeriodicalId":7634,"journal":{"name":"Agricultural Water Management","volume":"309 ","pages":"Article 109353"},"PeriodicalIF":5.9000,"publicationDate":"2025-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Agricultural Water Management","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378377425000678","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
Abstract
Multi-layer soil moisture is an important factor in predicting agricultural droughts and waterlogging, with significant implications for the growth, development, and yield prediction of rain fed crops. However, soil moisture datasets or algorithms fail to simultaneously meet the requirements of multi-layer, high spatiotemporal resolution soil moisture information for large-scale agricultural production areas. To fill this gap, we propose a novel framework for estimation high spatial resolution multi-layer soil moisture data. Firstly, utilizing the Google Earth Engine (GEE) platform and Enhanced Spatial and Temporal Adaptive Reflectance Fusion Model (ESTARFM), we achieve the fusion of multi-source remote sensing data at large scales to obtain high spatiotemporal resolution Normalized Difference Vegetation Index (NDVI) and Land Surface Temperature (LST) data. Secondly, leveraging the Extreme Gradient Boosting (XGBoost) model along with reanalysis and in-situ measurements, we estimate soil moisture information across depths of 0–100 cm depth by 10 cm interval over large geographical extents. Finally, the accuracy of the soil moisture model is assessed using metrics such as Pearson correlation coefficient, root mean square error (RMSE), unbiased RMSE (ubRMSE), and bias. To assess the applicability of our research methodology, we selected the typical black soil zone in Northeast of China, which is one of the four major black soil regions globally and characterized by intensive agricultural activities. We estimated the long-term time series of soil moisture information during the growing seasons from 2000 to 2020 in this study area. We found that the soil moisture simulation based on the XGBoost model the worst values of R, RMSE, ubRMSE, and Bias values for the training set are 0.86,1.49,1.49 and −0.039 respectively. For the validation set, the worst value of R is 0.83. The proposed methodology in this study enables the acquisition of soil moisture information with both large-scale coverage and high spatiotemporal resolution. This advancement holds significant promise for fine-scale research and applications in agricultural, hydrological, and environmental fields.
期刊介绍:
Agricultural Water Management publishes papers of international significance relating to the science, economics, and policy of agricultural water management. In all cases, manuscripts must address implications and provide insight regarding agricultural water management.