Sana Arshad , Jamil Hasan Kazmi , Endre Harsányi , Farheen Nazli , Waseem Hassan , Saima Shaikh , Main Al-Dalahmeh , Safwan Mohammed
{"title":"Predictive Modeling of soil salinity integrating remote sensing and soil variables: An ensembled deep learning approach","authors":"Sana Arshad , Jamil Hasan Kazmi , Endre Harsányi , Farheen Nazli , Waseem Hassan , Saima Shaikh , Main Al-Dalahmeh , Safwan Mohammed","doi":"10.1016/j.nexus.2025.100374","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate predictions of soil salinity can significantly contribute to achieving the UN- Sustainable Development Goal (SDG-2) of ensuring ‘zero hunger.’ From this perspective, the current research aimed to predict soil electrical conductivity (EC) from remote sensing and soil data using advanced deep learning (DL) architectures. A total of 109 soil samples were analyzed for agricultural land use in the Middle Indus Basin of Pakistan. Seven salinity indices (SI-1 to SI-7) were derived from the 10m to 20m wavelength bands of Sentinel-2, along with vegetation and topographic covariates. Initially, Recursive Feature Elimination was implemented as a feature-selection method to select the most effective predictors. Subsequently, deep learning architectures, including a Feedforward Neural Network (FFNN), Recurrent Neural Network (RNN), and Long Short-Term Memory (LSTM), were employed to predict soil salinity. Research findings showed that EC ranged between 0.57dS/m to 11.5 dS/m in the study area. The evaluation metrics of the DL models revealed that a simple FFNN with three fully connected dense layers achieved the highest R<sup>2</sup> = 0.88 for model training. However, the ensemble of improved FFNN and LSTM outperformed with the highest R<sup>2</sup> and NSE = 0.84, and the lowest RMSE and MAE = 1.38 and 1.01, respectively, on the testing dataset. Optimized deep learning architectures with adjustments to the learning rate, dropout rate, and activation functions achieved the highest prediction accuracy with the lowest validation loss. Finally, SHapely Additive exPlanations (SHAP) revealed that elevation, pH, NDVI, SI-1, and SI-7 had highly significant impacts on EC predictions. This research provides insight into implementing advanced and interpretable DL architectures, supporting informed decision-making by agricultural stakeholders.</div></div>","PeriodicalId":93548,"journal":{"name":"Energy nexus","volume":"17 ","pages":"Article 100374"},"PeriodicalIF":8.0000,"publicationDate":"2025-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy nexus","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772427125000154","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate predictions of soil salinity can significantly contribute to achieving the UN- Sustainable Development Goal (SDG-2) of ensuring ‘zero hunger.’ From this perspective, the current research aimed to predict soil electrical conductivity (EC) from remote sensing and soil data using advanced deep learning (DL) architectures. A total of 109 soil samples were analyzed for agricultural land use in the Middle Indus Basin of Pakistan. Seven salinity indices (SI-1 to SI-7) were derived from the 10m to 20m wavelength bands of Sentinel-2, along with vegetation and topographic covariates. Initially, Recursive Feature Elimination was implemented as a feature-selection method to select the most effective predictors. Subsequently, deep learning architectures, including a Feedforward Neural Network (FFNN), Recurrent Neural Network (RNN), and Long Short-Term Memory (LSTM), were employed to predict soil salinity. Research findings showed that EC ranged between 0.57dS/m to 11.5 dS/m in the study area. The evaluation metrics of the DL models revealed that a simple FFNN with three fully connected dense layers achieved the highest R2 = 0.88 for model training. However, the ensemble of improved FFNN and LSTM outperformed with the highest R2 and NSE = 0.84, and the lowest RMSE and MAE = 1.38 and 1.01, respectively, on the testing dataset. Optimized deep learning architectures with adjustments to the learning rate, dropout rate, and activation functions achieved the highest prediction accuracy with the lowest validation loss. Finally, SHapely Additive exPlanations (SHAP) revealed that elevation, pH, NDVI, SI-1, and SI-7 had highly significant impacts on EC predictions. This research provides insight into implementing advanced and interpretable DL architectures, supporting informed decision-making by agricultural stakeholders.
Energy nexusEnergy (General), Ecological Modelling, Renewable Energy, Sustainability and the Environment, Water Science and Technology, Agricultural and Biological Sciences (General)