{"title":"Advancing soil property prediction with encoder-decoder structures integrating traditional deep learning methods in Vis-NIR spectroscopy","authors":"Ziyi Ke , Shilin Ren , Liang Yin","doi":"10.1016/j.geoderma.2024.117006","DOIUrl":null,"url":null,"abstract":"<div><p>The technology for estimating soil properties using visible and near-infrared spectroscopy has been maturing, with corresponding advances and breakthroughs in deep learning models. In this study, based on the large soil spectral library LUCAS, we explore the potential of encoder-decoder structures to improve convolutional neural network regression predictions. By introducing an encoder-decoder structure into the feature channels of a six-layer CNN model (TRNN model), we significantly enhanced the performance of shallow CNN models and successfully carried out regression predictions for seven soil properties. We employed IntegratedGradients, DeepLift, GradientShap, and DeepLiftShap methods to interpret the output of the TRNN model. Our TRNN model, built on raw spectra, demonstrated high accuracy in predicting multiple soil properties, outperforming residual architectures, LSTMs, various CNN architectures, and other traditional machine learning methods proposed in previous studies. We also investigated the impact of multi-task output structures (TRNN 1-M and TRNN M−M) and single-task output structures (TRNN 1-1) on model performance. For the TRNN model with an encoder-decoder structure, multi-task output structures resulted in a reduction in performance. The TRNN showed outstanding results in regression analysis of the seven soil properties selected in this study (cation exchange capacity, organic carbon content, calcium carbonate content, pH, clay content, silt content, and sand content), with R<sup>2</sup> values exceeding 0.93 for all seven properties. Different soil characteristics correspond to different wavelengths, with multiple characteristic peaks commonly observed. This research convincingly demonstrates the enormous potential of combining large model architectures with traditional deep learning approaches for predicting soil properties, which could significantly advance precision agriculture.</p></div>","PeriodicalId":12511,"journal":{"name":"Geoderma","volume":null,"pages":null},"PeriodicalIF":5.6000,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0016706124002350/pdfft?md5=57bf684f16b081ff2303f3711fc198cd&pid=1-s2.0-S0016706124002350-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoderma","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016706124002350","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
引用次数: 0
Abstract
The technology for estimating soil properties using visible and near-infrared spectroscopy has been maturing, with corresponding advances and breakthroughs in deep learning models. In this study, based on the large soil spectral library LUCAS, we explore the potential of encoder-decoder structures to improve convolutional neural network regression predictions. By introducing an encoder-decoder structure into the feature channels of a six-layer CNN model (TRNN model), we significantly enhanced the performance of shallow CNN models and successfully carried out regression predictions for seven soil properties. We employed IntegratedGradients, DeepLift, GradientShap, and DeepLiftShap methods to interpret the output of the TRNN model. Our TRNN model, built on raw spectra, demonstrated high accuracy in predicting multiple soil properties, outperforming residual architectures, LSTMs, various CNN architectures, and other traditional machine learning methods proposed in previous studies. We also investigated the impact of multi-task output structures (TRNN 1-M and TRNN M−M) and single-task output structures (TRNN 1-1) on model performance. For the TRNN model with an encoder-decoder structure, multi-task output structures resulted in a reduction in performance. The TRNN showed outstanding results in regression analysis of the seven soil properties selected in this study (cation exchange capacity, organic carbon content, calcium carbonate content, pH, clay content, silt content, and sand content), with R2 values exceeding 0.93 for all seven properties. Different soil characteristics correspond to different wavelengths, with multiple characteristic peaks commonly observed. This research convincingly demonstrates the enormous potential of combining large model architectures with traditional deep learning approaches for predicting soil properties, which could significantly advance precision agriculture.
期刊介绍:
Geoderma - the global journal of soil science - welcomes authors, readers and soil research from all parts of the world, encourages worldwide soil studies, and embraces all aspects of soil science and its associated pedagogy. The journal particularly welcomes interdisciplinary work focusing on dynamic soil processes and functions across space and time.