{"title":"Estimating soil organic carbon using time series Band 11 (SWIR) of multispectral Sentinel-2 satellite images and machine learning algorithms","authors":"Mehdi Golkar Amoli , Mahdi Hasanlou , Farhad Samadzadegan , Ruhollah Taghizadeh-Mehrjardi , Farzaneh Dadrass Javan","doi":"10.1016/j.rsase.2025.101736","DOIUrl":null,"url":null,"abstract":"<div><div>Soil Organic Carbon (SOC) is a critical soil property impacting food security and climate change. Traditional methods for SOC estimation are time-consuming, expensive, and unsuitable for large-scale application. Consequently, researchers have increasingly focused on utilizing Remote Sensing (RS) images for SOC estimation over the past two decades. However, achieving high SOC estimation accuracy (more than 80 %) remains challenging. This limitation often stems from a mismatch between the complexity of SOC and the information captured by traditional RS observations (e.g., reflectance bands or spectral indices), as conventional feature extraction methods from RS images may not be detailed enough to monitor the many factors influencing SOC concentration. One promising solution to enhance feature extraction is the use of time series observations, analyzing multiple images over time instead of relying on single-time images. This study proposes a novel approach leveraging time series of the Sentinel-2 satellite's B11 band (centered around 1610 nm, a region sensitive to SOC absorption features) along with Principal Component Analysis (PCA) and Independent Component Analysis (ICA) transformations to extract more meaningful temporal features. Specifically, ten new features based on temporal variations were derived by applying PCA and ICA to the B11 band time series images. These temporal features were then combined with features derived from the median of all Sentinel-2 images acquired during the summer of 2019, corresponding to the soil data collection period. Four machine learning algorithms (RF, GBRT, XGBoost, and LightGBM) were employed across four distinct scenarios to evaluate the novel feature extraction method and a feature selection algorithm. The scenarios were designed as follows: Scenario one (S#1) and Scenario two (S#2) did not utilize the time series features, while Scenario three (S#3) and Scenario four (S#4) did. A binary Genetic Algorithm (GA) for feature selection was implemented in S#2 and S#4, distinguishing them from S#1 and S#3 respectively. XGBoost performed best, achieving an R<sup>2</sup> of 0.891 in S#4 (time series features and GA). Incorporating time series features significantly improved accuracy by 0.11, while GA-based feature selection added another 0.05. The findings highlight the effectiveness of the developed feature extraction algorithm, using Sentinel-2's B11 time series and advanced transformations, for substantially improving SOC level estimation.</div></div>","PeriodicalId":53227,"journal":{"name":"Remote Sensing Applications-Society and Environment","volume":"40 ","pages":"Article 101736"},"PeriodicalIF":4.5000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Remote Sensing Applications-Society and Environment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352938525002897","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Soil Organic Carbon (SOC) is a critical soil property impacting food security and climate change. Traditional methods for SOC estimation are time-consuming, expensive, and unsuitable for large-scale application. Consequently, researchers have increasingly focused on utilizing Remote Sensing (RS) images for SOC estimation over the past two decades. However, achieving high SOC estimation accuracy (more than 80 %) remains challenging. This limitation often stems from a mismatch between the complexity of SOC and the information captured by traditional RS observations (e.g., reflectance bands or spectral indices), as conventional feature extraction methods from RS images may not be detailed enough to monitor the many factors influencing SOC concentration. One promising solution to enhance feature extraction is the use of time series observations, analyzing multiple images over time instead of relying on single-time images. This study proposes a novel approach leveraging time series of the Sentinel-2 satellite's B11 band (centered around 1610 nm, a region sensitive to SOC absorption features) along with Principal Component Analysis (PCA) and Independent Component Analysis (ICA) transformations to extract more meaningful temporal features. Specifically, ten new features based on temporal variations were derived by applying PCA and ICA to the B11 band time series images. These temporal features were then combined with features derived from the median of all Sentinel-2 images acquired during the summer of 2019, corresponding to the soil data collection period. Four machine learning algorithms (RF, GBRT, XGBoost, and LightGBM) were employed across four distinct scenarios to evaluate the novel feature extraction method and a feature selection algorithm. The scenarios were designed as follows: Scenario one (S#1) and Scenario two (S#2) did not utilize the time series features, while Scenario three (S#3) and Scenario four (S#4) did. A binary Genetic Algorithm (GA) for feature selection was implemented in S#2 and S#4, distinguishing them from S#1 and S#3 respectively. XGBoost performed best, achieving an R2 of 0.891 in S#4 (time series features and GA). Incorporating time series features significantly improved accuracy by 0.11, while GA-based feature selection added another 0.05. The findings highlight the effectiveness of the developed feature extraction algorithm, using Sentinel-2's B11 time series and advanced transformations, for substantially improving SOC level estimation.
期刊介绍:
The journal ''Remote Sensing Applications: Society and Environment'' (RSASE) focuses on remote sensing studies that address specific topics with an emphasis on environmental and societal issues - regional / local studies with global significance. Subjects are encouraged to have an interdisciplinary approach and include, but are not limited by: " -Global and climate change studies addressing the impact of increasing concentrations of greenhouse gases, CO2 emission, carbon balance and carbon mitigation, energy system on social and environmental systems -Ecological and environmental issues including biodiversity, ecosystem dynamics, land degradation, atmospheric and water pollution, urban footprint, ecosystem management and natural hazards (e.g. earthquakes, typhoons, floods, landslides) -Natural resource studies including land-use in general, biomass estimation, forests, agricultural land, plantation, soils, coral reefs, wetland and water resources -Agriculture, food production systems and food security outcomes -Socio-economic issues including urban systems, urban growth, public health, epidemics, land-use transition and land use conflicts -Oceanography and coastal zone studies, including sea level rise projections, coastlines changes and the ocean-land interface -Regional challenges for remote sensing application techniques, monitoring and analysis, such as cloud screening and atmospheric correction for tropical regions -Interdisciplinary studies combining remote sensing, household survey data, field measurements and models to address environmental, societal and sustainability issues -Quantitative and qualitative analysis that documents the impact of using remote sensing studies in social, political, environmental or economic systems