Hyperspectral inversion of organic matter content in agricultural soils based on fractional-order derivative and ensemble learning

IF 3.4 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION
Anhong Tian , Zhiyuan Li , Chengbiao Fu
{"title":"Hyperspectral inversion of organic matter content in agricultural soils based on fractional-order derivative and ensemble learning","authors":"Anhong Tian ,&nbsp;Zhiyuan Li ,&nbsp;Chengbiao Fu","doi":"10.1016/j.infrared.2025.106122","DOIUrl":null,"url":null,"abstract":"<div><div>Hyperspectral technology shows great potential for rapidly monitoring soil organic matter (SOM) in farmland. To improve prediction accuracy and model stability, this study proposes a new SOM inversion strategy based on the Stacking ensemble learning framework. By applying fractional-order derivative (FOD) processing to hyperspectral data, the spectral data quality is effectively enhanced. Feature selection was performed using the Boruta algorithm, and the modeling performance of six common machine learning algorithms, including Partial Least Squares Regression (PLSR), Support Vector Regression (SVR), Extreme Gradient Boosting (XGBoost), Random Forest (RF), Adaptive Boosting (AdaBoost), and Gradient Boosting Decision Tree (GBDT), was evaluated and compared. The results show that FOD outperforms traditional integer-order differential methods in modeling. Under the 1.8-order FOD processing, the prediction performance of all models improved significantly. Among them, the Stacking ensemble model combining SVR, RF, AdaBoost, and GBDT achieved the best performance on the test set, with an <em>R</em><sup>2</sup> of 0.83, surpassing the <em>R</em><sup>2</sup> of the best single model, GBDT (0.81). Additionally, SHAP value analysis further confirmed the effectiveness of combining FOD, Boruta, and Stacking strategies, greatly improving SOM prediction accuracy. This study offers a novel and effective technical approach for the rapid and accurate inversion of SOM content in farmland soil.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"151 ","pages":"Article 106122"},"PeriodicalIF":3.4000,"publicationDate":"2025-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infrared Physics & Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350449525004153","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0

Abstract

Hyperspectral technology shows great potential for rapidly monitoring soil organic matter (SOM) in farmland. To improve prediction accuracy and model stability, this study proposes a new SOM inversion strategy based on the Stacking ensemble learning framework. By applying fractional-order derivative (FOD) processing to hyperspectral data, the spectral data quality is effectively enhanced. Feature selection was performed using the Boruta algorithm, and the modeling performance of six common machine learning algorithms, including Partial Least Squares Regression (PLSR), Support Vector Regression (SVR), Extreme Gradient Boosting (XGBoost), Random Forest (RF), Adaptive Boosting (AdaBoost), and Gradient Boosting Decision Tree (GBDT), was evaluated and compared. The results show that FOD outperforms traditional integer-order differential methods in modeling. Under the 1.8-order FOD processing, the prediction performance of all models improved significantly. Among them, the Stacking ensemble model combining SVR, RF, AdaBoost, and GBDT achieved the best performance on the test set, with an R2 of 0.83, surpassing the R2 of the best single model, GBDT (0.81). Additionally, SHAP value analysis further confirmed the effectiveness of combining FOD, Boruta, and Stacking strategies, greatly improving SOM prediction accuracy. This study offers a novel and effective technical approach for the rapid and accurate inversion of SOM content in farmland soil.

Abstract Image

基于分数阶导数和集合学习的农业土壤有机质含量高光谱反演
高光谱技术在农田土壤有机质快速监测中显示出巨大的潜力。为了提高预测精度和模型稳定性,本文提出了一种基于叠加集成学习框架的SOM反演策略。通过对高光谱数据进行分数阶导数(FOD)处理,有效地提高了光谱数据的质量。使用Boruta算法进行特征选择,并评估和比较了6种常用机器学习算法的建模性能,包括偏最小二乘回归(PLSR)、支持向量回归(SVR)、极端梯度增强(XGBoost)、随机森林(RF)、自适应增强(AdaBoost)和梯度增强决策树(GBDT)。结果表明,该方法在建模方面优于传统的整阶微分方法。在1.8阶FOD处理下,所有模型的预测性能都有显著提高。其中,结合SVR、RF、AdaBoost和GBDT的Stacking ensemble模型在测试集上的表现最佳,R2为0.83,超过了最佳单一模型GBDT的R2(0.81)。此外,SHAP值分析进一步证实了FOD、Boruta和Stacking策略相结合的有效性,大大提高了SOM的预测精度。本研究为快速准确地反演农田土壤中SOM含量提供了一种新颖有效的技术途径。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.70
自引率
12.10%
发文量
400
审稿时长
67 days
期刊介绍: The Journal covers the entire field of infrared physics and technology: theory, experiment, application, devices and instrumentation. Infrared'' is defined as covering the near, mid and far infrared (terahertz) regions from 0.75um (750nm) to 1mm (300GHz.) Submissions in the 300GHz to 100GHz region may be accepted at the editors discretion if their content is relevant to shorter wavelengths. Submissions must be primarily concerned with and directly relevant to this spectral region. Its core topics can be summarized as the generation, propagation and detection, of infrared radiation; the associated optics, materials and devices; and its use in all fields of science, industry, engineering and medicine. Infrared techniques occur in many different fields, notably spectroscopy and interferometry; material characterization and processing; atmospheric physics, astronomy and space research. Scientific aspects include lasers, quantum optics, quantum electronics, image processing and semiconductor physics. Some important applications are medical diagnostics and treatment, industrial inspection and environmental monitoring.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信