Predictive Modeling of soil salinity integrating remote sensing and soil variables: An ensembled deep learning approach

IF 8 Q1 ENERGY & FUELS

Energy nexus Pub Date : 2025-02-08 DOI:10.1016/j.nexus.2025.100374

Sana Arshad , Jamil Hasan Kazmi , Endre Harsányi , Farheen Nazli , Waseem Hassan , Saima Shaikh , Main Al-Dalahmeh , Safwan Mohammed

{"title":"Predictive Modeling of soil salinity integrating remote sensing and soil variables: An ensembled deep learning approach","authors":"Sana Arshad , Jamil Hasan Kazmi , Endre Harsányi , Farheen Nazli , Waseem Hassan , Saima Shaikh , Main Al-Dalahmeh , Safwan Mohammed","doi":"10.1016/j.nexus.2025.100374","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate predictions of soil salinity can significantly contribute to achieving the UN- Sustainable Development Goal (SDG-2) of ensuring ‘zero hunger.’ From this perspective, the current research aimed to predict soil electrical conductivity (EC) from remote sensing and soil data using advanced deep learning (DL) architectures. A total of 109 soil samples were analyzed for agricultural land use in the Middle Indus Basin of Pakistan. Seven salinity indices (SI-1 to SI-7) were derived from the 10m to 20m wavelength bands of Sentinel-2, along with vegetation and topographic covariates. Initially, Recursive Feature Elimination was implemented as a feature-selection method to select the most effective predictors. Subsequently, deep learning architectures, including a Feedforward Neural Network (FFNN), Recurrent Neural Network (RNN), and Long Short-Term Memory (LSTM), were employed to predict soil salinity. Research findings showed that EC ranged between 0.57dS/m to 11.5 dS/m in the study area. The evaluation metrics of the DL models revealed that a simple FFNN with three fully connected dense layers achieved the highest R<sup>2</sup> = 0.88 for model training. However, the ensemble of improved FFNN and LSTM outperformed with the highest R<sup>2</sup> and NSE = 0.84, and the lowest RMSE and MAE = 1.38 and 1.01, respectively, on the testing dataset. Optimized deep learning architectures with adjustments to the learning rate, dropout rate, and activation functions achieved the highest prediction accuracy with the lowest validation loss. Finally, SHapely Additive exPlanations (SHAP) revealed that elevation, pH, NDVI, SI-1, and SI-7 had highly significant impacts on EC predictions. This research provides insight into implementing advanced and interpretable DL architectures, supporting informed decision-making by agricultural stakeholders.</div></div>","PeriodicalId":93548,"journal":{"name":"Energy nexus","volume":"17 ","pages":"Article 100374"},"PeriodicalIF":8.0000,"publicationDate":"2025-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy nexus","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772427125000154","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate predictions of soil salinity can significantly contribute to achieving the UN- Sustainable Development Goal (SDG-2) of ensuring ‘zero hunger.’ From this perspective, the current research aimed to predict soil electrical conductivity (EC) from remote sensing and soil data using advanced deep learning (DL) architectures. A total of 109 soil samples were analyzed for agricultural land use in the Middle Indus Basin of Pakistan. Seven salinity indices (SI-1 to SI-7) were derived from the 10m to 20m wavelength bands of Sentinel-2, along with vegetation and topographic covariates. Initially, Recursive Feature Elimination was implemented as a feature-selection method to select the most effective predictors. Subsequently, deep learning architectures, including a Feedforward Neural Network (FFNN), Recurrent Neural Network (RNN), and Long Short-Term Memory (LSTM), were employed to predict soil salinity. Research findings showed that EC ranged between 0.57dS/m to 11.5 dS/m in the study area. The evaluation metrics of the DL models revealed that a simple FFNN with three fully connected dense layers achieved the highest R² = 0.88 for model training. However, the ensemble of improved FFNN and LSTM outperformed with the highest R² and NSE = 0.84, and the lowest RMSE and MAE = 1.38 and 1.01, respectively, on the testing dataset. Optimized deep learning architectures with adjustments to the learning rate, dropout rate, and activation functions achieved the highest prediction accuracy with the lowest validation loss. Finally, SHapely Additive exPlanations (SHAP) revealed that elevation, pH, NDVI, SI-1, and SI-7 had highly significant impacts on EC predictions. This research provides insight into implementing advanced and interpretable DL architectures, supporting informed decision-making by agricultural stakeholders.

Abstract Image

查看原文本刊更多论文

结合遥感和土壤变量的土壤盐度预测建模：一种集成深度学习方法

准确预测土壤盐度可为实现确保“零饥饿”的联合国可持续发展目标（SDG-2）做出重大贡献。从这个角度来看，目前的研究旨在利用先进的深度学习（DL）架构，从遥感和土壤数据中预测土壤电导率（EC）。对巴基斯坦中印度河流域109个土壤样本进行了农业用地分析。利用Sentinel-2卫星10m ~ 20m波段的7个盐度指数SI-1 ~ SI-7，以及植被和地形的协变量。最初，递归特征消除作为一种特征选择方法来选择最有效的预测器。随后，采用深度学习架构，包括前馈神经网络（FFNN）、循环神经网络（RNN）和长短期记忆（LSTM）来预测土壤盐度。研究结果表明，研究区EC值在0.57 ~ 11.5 dS/m之间。DL模型的评价指标显示，具有三个完全连接的致密层的简单FFNN在模型训练中获得了最高的R2 = 0.88。然而，改进的FFNN和LSTM的集成在测试数据集上表现优异，最高的R2和NSE = 0.84，最低的RMSE和MAE分别= 1.38和1.01。通过调整学习率、辍学率和激活函数，优化了深度学习架构，以最低的验证损失实现了最高的预测精度。最后，SHapely加性解释（SHAP）表明海拔、pH、NDVI、SI-1和SI-7对EC预测有极显著的影响。这项研究为实现先进和可解释的深度学习架构提供了见解，支持农业利益相关者做出明智的决策。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Energy nexus Energy (General), Ecological Modelling, Renewable Energy, Sustainability and the Environment, Water Science and Technology, Agricultural and Biological Sciences (General)

CiteScore

7.70

自引率

0.00%

发文量

审稿时长

109 days