{"title":"Temporal and geographic extrapolation of soil moisture using machine learning algorithms","authors":"Efthymios Chrysanthopoulos, Andreas Kallioras","doi":"10.1016/j.catena.2025.109156","DOIUrl":null,"url":null,"abstract":"<div><div>The inherent characteristic of machine learning algorithms to extrapolate when the convex hull is expanded with new unseen instances, can be exploited in soil moisture prediction, concerning temporal and geographic extrapolation. This study describes the implementation of a machine learning framework, evaluating the performance of both individuals (Support Vector Regressor) and ensemble algorithms (Random Forests and Voting Regressor) in temporal and geographic extrapolation of soil moisture beyond the feature space of the calibration data. While most studies focus on temporal extrapolation and spatial interpolation of soil moisture in the framework of calibration stations, this study provides important insights on soil moisture prediction in distinct locations of a catchment where target variables are available, using pre-calibrated models at an individual station. The approach is originally based on the calibration of each machine learning algorithm with the soil moisture data from every agro-meteorological station of the monitoring networks and the evaluation both in temporal extrapolation context with future data of the same station and in geographic extrapolation with data concerning the location of rest of the stations.Overall the results indicate that in the context of temporal extrapolation the algorithms achieve adequate accuracy with the performance metrics to achieve values R<sup>2</sup> > 0.75, RMSE < 0.042 cm<sup>3</sup>cm<sup>−3</sup> and MAE < 0.001 cm<sup>3</sup>cm<sup>−3</sup>, while in the context of geographic extrapolation algorithms trained using soil moisture data from a distinct agro-meteorological station are capable of predicting soil moisture with enhanced efficiency when applied to previously unseen datasets. The results of this research indicate the applicability of the framework in unmonitored sites.</div></div>","PeriodicalId":9801,"journal":{"name":"Catena","volume":"257 ","pages":"Article 109156"},"PeriodicalIF":5.7000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Catena","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0341816225004588","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The inherent characteristic of machine learning algorithms to extrapolate when the convex hull is expanded with new unseen instances, can be exploited in soil moisture prediction, concerning temporal and geographic extrapolation. This study describes the implementation of a machine learning framework, evaluating the performance of both individuals (Support Vector Regressor) and ensemble algorithms (Random Forests and Voting Regressor) in temporal and geographic extrapolation of soil moisture beyond the feature space of the calibration data. While most studies focus on temporal extrapolation and spatial interpolation of soil moisture in the framework of calibration stations, this study provides important insights on soil moisture prediction in distinct locations of a catchment where target variables are available, using pre-calibrated models at an individual station. The approach is originally based on the calibration of each machine learning algorithm with the soil moisture data from every agro-meteorological station of the monitoring networks and the evaluation both in temporal extrapolation context with future data of the same station and in geographic extrapolation with data concerning the location of rest of the stations.Overall the results indicate that in the context of temporal extrapolation the algorithms achieve adequate accuracy with the performance metrics to achieve values R2 > 0.75, RMSE < 0.042 cm3cm−3 and MAE < 0.001 cm3cm−3, while in the context of geographic extrapolation algorithms trained using soil moisture data from a distinct agro-meteorological station are capable of predicting soil moisture with enhanced efficiency when applied to previously unseen datasets. The results of this research indicate the applicability of the framework in unmonitored sites.
期刊介绍:
Catena publishes papers describing original field and laboratory investigations and reviews on geoecology and landscape evolution with emphasis on interdisciplinary aspects of soil science, hydrology and geomorphology. It aims to disseminate new knowledge and foster better understanding of the physical environment, of evolutionary sequences that have resulted in past and current landscapes, and of the natural processes that are likely to determine the fate of our terrestrial environment.
Papers within any one of the above topics are welcome provided they are of sufficiently wide interest and relevance.