Simultaneous estimation of multiple soil properties from vis-NIR spectra using a multi-gate mixture-of-experts with data augmentation

IF 5.6 1区 农林科学 Q1 SOIL SCIENCE
Xiaoqing Wang , Mei-Wei Zhang , Ya-Nan Zhou , Lingli Wang , Ling-Tao Zeng , Yu-Pei Cui , Xiao-Lin Sun
{"title":"Simultaneous estimation of multiple soil properties from vis-NIR spectra using a multi-gate mixture-of-experts with data augmentation","authors":"Xiaoqing Wang ,&nbsp;Mei-Wei Zhang ,&nbsp;Ya-Nan Zhou ,&nbsp;Lingli Wang ,&nbsp;Ling-Tao Zeng ,&nbsp;Yu-Pei Cui ,&nbsp;Xiao-Lin Sun","doi":"10.1016/j.geoderma.2024.117127","DOIUrl":null,"url":null,"abstract":"<div><div>Simultaneous estimation of multiple soil properties from vis-NIR hyperspectra presents a cost-effective and time-efficient approach. Previous studies have utilized multi-task convolutional neural network (multi-CNN) with share-bottom structures based on the hard parameter sharing. However, multi-CNN often ignores the differential characteristics of correlations between soil properties, limiting the accuracy of soil property estimation. The multi-gate mixture-of-experts network (MMoE) offers a solution by extracting both common features across all soil properties and unique features specific to each soil property, which probably could provide better estimation outcomes than the conventional shared-bottom multi-CNN. In the present study, a MMoE was built based on a total of 17,272 mineral soil samples from the Land Use/Cover Area Frame Survey (LUCAS) topsoil database that includes vis-NIR spectra with ten physicochemical properties, i.e., clay, silt, sand, pH (in water), organic content (OC), calcium carbonate (CaCO<sub>3</sub>), nitrogen (N), phosphorous (P), potassium (K), and cation exchange capacity (CEC). To evaluate the performance of MMoE, a series of other models were also built, i.e., partial least square regression (PLSR), single-task convolutional neural network (single-CNN), multi-task convolutional neural network (multi-CNN) and multi-task long short-term memory (multi-LSTM). Furthermore, performance of feature-spectrum selected by competitive adaptive reweighted sampling (CARS) on the accuracy of the MMoE was also explored, as well as a data augmentation method of stacking raw spectra with five preprocessed spectra data. The results demonstrated that MMoE had higher accuracy than PLSR, single-CNN, and multi-LSTM models, with RMSE reduction of 5 %–48 %, R<sup>2</sup> improvement of 1 %–119 %, and CCC improvement of 0 %–74 %. Compared with multi-CNN, MMoE showed better accuracy for all properties except pH, with RMSE reduction of 3 %–8 %, R<sup>2</sup> improvement of 1 %–12 %, and CCC improvement of 0 %–5 %. However, the feature-spectrum selected by CARS did not improve the accuracy of MMoE compared to full-band spectrum, whereas the data augmentation method was effective in improving the estimation accuracy of MMoE compared to raw spectra, with RMSE reduction of 14 %–28 %, R<sup>2</sup> improvement of 3 %–88 %, and CCC improvement of 1 %–63 %. Consequently, this study proves that MMoE based on data augmentation is an efficient and accurate method for the simultaneous estimation of multiple soil properties from vis-NIR spectra.</div></div>","PeriodicalId":12511,"journal":{"name":"Geoderma","volume":"453 ","pages":"Article 117127"},"PeriodicalIF":5.6000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoderma","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016706124003562","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Simultaneous estimation of multiple soil properties from vis-NIR hyperspectra presents a cost-effective and time-efficient approach. Previous studies have utilized multi-task convolutional neural network (multi-CNN) with share-bottom structures based on the hard parameter sharing. However, multi-CNN often ignores the differential characteristics of correlations between soil properties, limiting the accuracy of soil property estimation. The multi-gate mixture-of-experts network (MMoE) offers a solution by extracting both common features across all soil properties and unique features specific to each soil property, which probably could provide better estimation outcomes than the conventional shared-bottom multi-CNN. In the present study, a MMoE was built based on a total of 17,272 mineral soil samples from the Land Use/Cover Area Frame Survey (LUCAS) topsoil database that includes vis-NIR spectra with ten physicochemical properties, i.e., clay, silt, sand, pH (in water), organic content (OC), calcium carbonate (CaCO3), nitrogen (N), phosphorous (P), potassium (K), and cation exchange capacity (CEC). To evaluate the performance of MMoE, a series of other models were also built, i.e., partial least square regression (PLSR), single-task convolutional neural network (single-CNN), multi-task convolutional neural network (multi-CNN) and multi-task long short-term memory (multi-LSTM). Furthermore, performance of feature-spectrum selected by competitive adaptive reweighted sampling (CARS) on the accuracy of the MMoE was also explored, as well as a data augmentation method of stacking raw spectra with five preprocessed spectra data. The results demonstrated that MMoE had higher accuracy than PLSR, single-CNN, and multi-LSTM models, with RMSE reduction of 5 %–48 %, R2 improvement of 1 %–119 %, and CCC improvement of 0 %–74 %. Compared with multi-CNN, MMoE showed better accuracy for all properties except pH, with RMSE reduction of 3 %–8 %, R2 improvement of 1 %–12 %, and CCC improvement of 0 %–5 %. However, the feature-spectrum selected by CARS did not improve the accuracy of MMoE compared to full-band spectrum, whereas the data augmentation method was effective in improving the estimation accuracy of MMoE compared to raw spectra, with RMSE reduction of 14 %–28 %, R2 improvement of 3 %–88 %, and CCC improvement of 1 %–63 %. Consequently, this study proves that MMoE based on data augmentation is an efficient and accurate method for the simultaneous estimation of multiple soil properties from vis-NIR spectra.
使用数据增强的多门混合专家从可见-近红外光谱中同时估计多种土壤特性
利用可见光-近红外高光谱同时估计多种土壤性质是一种经济高效的方法。以往的研究采用基于硬参数共享的多任务卷积神经网络(multi-CNN)。然而,multi-CNN往往忽略了土壤性质之间相关性的差异性特征,限制了土壤性质估计的准确性。多门混合专家网络(MMoE)提供了一种解决方案,通过提取所有土壤属性的共同特征和特定于每种土壤属性的独特特征,这可能比传统的共享底部多cnn提供更好的估计结果。基于LUCAS表层土壤数据库中17272份矿质土壤样品的可见光-近红外光谱,包括粘土、粉土、砂、pH(水中)、有机含量(OC)、碳酸钙(CaCO3)、氮(N)、磷(P)、钾(K)和阳离子交换容量(CEC)等10种理化性质,构建了MMoE模型。为了评估MMoE的性能,我们还建立了一系列其他模型,即偏最小二乘回归(PLSR)、单任务卷积神经网络(single-CNN)、多任务卷积神经网络(multi-CNN)和多任务长短期记忆(multi-LSTM)。此外,还探讨了竞争自适应重加权采样(CARS)选择的特征谱对MMoE精度的影响,以及将原始光谱与5个预处理光谱数据叠加的数据增强方法。结果表明,MMoE比PLSR、单cnn和多lstm模型具有更高的准确率,RMSE降低5% - 48%,R2提高1% - 119%,CCC提高0% - 74%。与multi-CNN相比,MMoE对除pH值外的所有属性都具有更好的准确性,RMSE降低3% - 8%,R2提高1% - 12%,CCC提高0% - 5%。然而,与全波段光谱相比,CARS选择的特征谱并没有提高MMoE的估计精度,而数据增强方法与原始光谱相比,有效提高了MMoE的估计精度,RMSE降低了14% - 28%,R2提高了3% - 88%,CCC提高了1% - 63%。因此,本研究证明了基于数据增强的MMoE是一种有效而准确的方法,可以同时估计可见光-近红外光谱中的多种土壤性质。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Geoderma
Geoderma 农林科学-土壤科学
CiteScore
11.80
自引率
6.60%
发文量
597
审稿时长
58 days
期刊介绍: Geoderma - the global journal of soil science - welcomes authors, readers and soil research from all parts of the world, encourages worldwide soil studies, and embraces all aspects of soil science and its associated pedagogy. The journal particularly welcomes interdisciplinary work focusing on dynamic soil processes and functions across space and time.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信