Subsetting reduces the error of MIR spectroscopy models for soil organic carbon prediction in the U.S. Great Plains

Minerva J. Dorantes, Bryan A. Fuentes, David M. Miller
{"title":"Subsetting reduces the error of MIR spectroscopy models for soil organic carbon prediction in the U.S. Great Plains","authors":"Minerva J. Dorantes,&nbsp;Bryan A. Fuentes,&nbsp;David M. Miller","doi":"10.1016/j.soisec.2024.100145","DOIUrl":null,"url":null,"abstract":"<div><p>High demand for soil organic carbon data to support soil health and climate change mitigation efforts must be met with rapid, accurate, and inexpensive measurement methods. Mid-infrared spectroscopy shows promise as an alternative to conventional soil carbon analysis; however, its practicality depends on the construction and efficient use of soil spectral libraries. Subsetting, a calibration optimization technique, has potential to reduce model prediction errors. Nevertheless, the effectiveness of different subsetting criteria has yet to be well explored. This study assessed whether several subsetting criteria would result in calibration models with reduced error in the prediction of soil organic carbon content compared to models constructed from a full spectral dataset. A mid-infrared spectral library composed of Nebraska and Kansas soil samples was subset by (i) a nested wetland criterion, (ii) soilscapes, (iii) presence or absence of carbonates, and (iv) a combination of soilscape and carbonates. Partial least squares regression was used to construct all calibration models. Predictive performance of the subset models was compared to that of their corresponding full set model using several statistical metrics. Subsetting by wetlands reduced model error by 22 and 56 %. Subsetting by soilscape yielded a 13 to 55 % reduction in model error, while presence or absence of carbonates reduced model error by 21 and 46 %. Five of the eight combination soilscape and carbonate subset models reduced model error by 14 to 51 %. Overall, subsetting by soilscape or carbonate presence proved effective in improving model performance, with combination subsets proving beneficial under specific calibration set conditions.</p></div>","PeriodicalId":74839,"journal":{"name":"Soil security","volume":"16 ","pages":"Article 100145"},"PeriodicalIF":0.0000,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2667006224000194/pdfft?md5=f175cfcf3dd8f64e71410e819cb7a755&pid=1-s2.0-S2667006224000194-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soil security","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667006224000194","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

High demand for soil organic carbon data to support soil health and climate change mitigation efforts must be met with rapid, accurate, and inexpensive measurement methods. Mid-infrared spectroscopy shows promise as an alternative to conventional soil carbon analysis; however, its practicality depends on the construction and efficient use of soil spectral libraries. Subsetting, a calibration optimization technique, has potential to reduce model prediction errors. Nevertheless, the effectiveness of different subsetting criteria has yet to be well explored. This study assessed whether several subsetting criteria would result in calibration models with reduced error in the prediction of soil organic carbon content compared to models constructed from a full spectral dataset. A mid-infrared spectral library composed of Nebraska and Kansas soil samples was subset by (i) a nested wetland criterion, (ii) soilscapes, (iii) presence or absence of carbonates, and (iv) a combination of soilscape and carbonates. Partial least squares regression was used to construct all calibration models. Predictive performance of the subset models was compared to that of their corresponding full set model using several statistical metrics. Subsetting by wetlands reduced model error by 22 and 56 %. Subsetting by soilscape yielded a 13 to 55 % reduction in model error, while presence or absence of carbonates reduced model error by 21 and 46 %. Five of the eight combination soilscape and carbonate subset models reduced model error by 14 to 51 %. Overall, subsetting by soilscape or carbonate presence proved effective in improving model performance, with combination subsets proving beneficial under specific calibration set conditions.

子集减少了用于美国大平原土壤有机碳预测的近红外光谱模型的误差
为支持土壤健康和气候变化减缓工作,对土壤有机碳数据的需求量很大,必须采用快速、准确和廉价的测量方法。中红外光谱法有望成为传统土壤碳分析的替代方法,但其实用性取决于土壤光谱库的构建和有效利用。子集是一种校准优化技术,具有减少模型预测误差的潜力。然而,不同子集标准的有效性还有待深入探讨。本研究评估了几种子集标准是否会导致校准模型与全光谱数据集构建的模型相比,在预测土壤有机碳含量时误差更小。由内布拉斯加州和堪萨斯州土壤样本组成的中红外光谱库按照以下标准进行了子集:(i) 嵌套湿地标准;(ii) 土壤地貌;(iii) 是否存在碳酸盐;(iv) 土壤地貌和碳酸盐的组合。所有校准模型均采用偏最小二乘回归法。使用多个统计指标比较了子集模型与相应全集模型的预测性能。按湿地分类的子集模型误差分别减少了 22% 和 56%。按土壤地貌分类可使模型误差减少 13% 到 55%,而碳酸盐的存在与否可使模型误差分别减少 21% 和 46%。在八个土壤地貌和碳酸盐子集组合模型中,有五个模型的误差减少了 14% 到 51%。总体而言,根据土壤地貌或碳酸盐的存在进行子集设置可有效提高模型性能,在特定的校准集条件下,组合子集也证明是有益的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Soil security
Soil security Soil Science
CiteScore
4.00
自引率
0.00%
发文量
0
审稿时长
90 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信