Comparative analysis of the effects of different dimensionality reduction algorithms on hyperspectral estimation of total nitrogen content in wheat soils
Juan Bai , Shiyou Zhu , Yingchao Hao , Xinzhe Li , Chenbo Yang , Chao Wang , Xingxing Qiao , Meichen Feng , Lujie Xiao , Xiaoyan Song , Meijun Zhang , Sha Yang , Guangxin Li , Wude Yang
{"title":"Comparative analysis of the effects of different dimensionality reduction algorithms on hyperspectral estimation of total nitrogen content in wheat soils","authors":"Juan Bai , Shiyou Zhu , Yingchao Hao , Xinzhe Li , Chenbo Yang , Chao Wang , Xingxing Qiao , Meichen Feng , Lujie Xiao , Xiaoyan Song , Meijun Zhang , Sha Yang , Guangxin Li , Wude Yang","doi":"10.1016/j.eja.2025.127660","DOIUrl":null,"url":null,"abstract":"<div><h3>Context</h3><div>The level of soil nitrogen supply profoundly impacts the growth, development, and yield formation capacity of winter wheat. Excessive use of nitrogen-based fertilizers in current agricultural practices has negative consequences on both the environment and crop growth. Therefore, real-time, non-destructive estimation of soil total nitrogen content using hyperspectral remote sensing technology is crucial for advancing crop fertilization strategies and precision agriculture.</div></div><div><h3>Objectives</h3><div>(1) Explores the effects of different dimensionality reduction algorithms on hyperspectral estimation of soil total nitrogen content in wheat fields. (2) Investigates the optimal model for hyperspectral detection of total nitrogen content in wheat field soils in the Jinzhong region of Shanxi Province.</div></div><div><h3>Methods</h3><div>This study integrates various preprocessing methods and applies four dimensionality reduction algorithms—principal component analysis (PCA), singular value decomposition (SVD), unrelated variable elimination (UVE) and random forest (RF)—to reduce the data dimensions. Support vector regression (SVR) and back propagation neural network (BPNN) models for estimating soil total nitrogen content were then constructed and compared with gradient boosted decision tree (GBDT).</div></div><div><h3>Results</h3><div>The feature extraction algorithms PCA and SVD produced the same principal components and cumulative contributions when reducing the dimensionality of hyperspectral data. The number of characteristic bands selected by UVE was much smaller than that selected by RF. The characteristic bands selected by RF spanned the visible, near-infrared, and mid-infrared wavelength ranges, while those selected by UVE were mostly located within the visible light wavelength range. The modelling results following PCA and SVD dimensionality reduction were relatively similar, while the models based on RF-selected bands showed little change compared to full-spectrum band modeling. The SVR model constructed using multiplicative scatter correction (MSC) preprocessing and SVD dimensionality reduction had the highest accuracy in estimating soil total nitrogen content. (R<sub>c</sub><sup>2</sup>=0.87, R<sub>v</sub><sup>2</sup>=0.85; RMSE<sub>c</sub>=0.13, RMSE<sub>v</sub>=0.14; RPD<sub>c</sub>=2.82, RPD<sub>v</sub>=2.55; MAE<sub>c</sub>=0.10, MAE<sub>v</sub>=0.10)</div></div><div><h3>Conclusions</h3><div>Dimensionality reduction algorithms significantly contribute to the development of hyperspectral estimation models for soil total nitrogen content. The feature extraction algorithm (PCA and SVD) shows more obvious effect in improving the spectral modeling accuracy compared to the feature selection algorithm (UVE and RF). The optimal estimation model combination for hyperspectral detection of total nitrogen content in wheat field soils is MSC+SVD+SVR.</div></div>","PeriodicalId":51045,"journal":{"name":"European Journal of Agronomy","volume":"168 ","pages":"Article 127660"},"PeriodicalIF":4.5000,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Agronomy","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S116103012500156X","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
Abstract
Context
The level of soil nitrogen supply profoundly impacts the growth, development, and yield formation capacity of winter wheat. Excessive use of nitrogen-based fertilizers in current agricultural practices has negative consequences on both the environment and crop growth. Therefore, real-time, non-destructive estimation of soil total nitrogen content using hyperspectral remote sensing technology is crucial for advancing crop fertilization strategies and precision agriculture.
Objectives
(1) Explores the effects of different dimensionality reduction algorithms on hyperspectral estimation of soil total nitrogen content in wheat fields. (2) Investigates the optimal model for hyperspectral detection of total nitrogen content in wheat field soils in the Jinzhong region of Shanxi Province.
Methods
This study integrates various preprocessing methods and applies four dimensionality reduction algorithms—principal component analysis (PCA), singular value decomposition (SVD), unrelated variable elimination (UVE) and random forest (RF)—to reduce the data dimensions. Support vector regression (SVR) and back propagation neural network (BPNN) models for estimating soil total nitrogen content were then constructed and compared with gradient boosted decision tree (GBDT).
Results
The feature extraction algorithms PCA and SVD produced the same principal components and cumulative contributions when reducing the dimensionality of hyperspectral data. The number of characteristic bands selected by UVE was much smaller than that selected by RF. The characteristic bands selected by RF spanned the visible, near-infrared, and mid-infrared wavelength ranges, while those selected by UVE were mostly located within the visible light wavelength range. The modelling results following PCA and SVD dimensionality reduction were relatively similar, while the models based on RF-selected bands showed little change compared to full-spectrum band modeling. The SVR model constructed using multiplicative scatter correction (MSC) preprocessing and SVD dimensionality reduction had the highest accuracy in estimating soil total nitrogen content. (Rc2=0.87, Rv2=0.85; RMSEc=0.13, RMSEv=0.14; RPDc=2.82, RPDv=2.55; MAEc=0.10, MAEv=0.10)
Conclusions
Dimensionality reduction algorithms significantly contribute to the development of hyperspectral estimation models for soil total nitrogen content. The feature extraction algorithm (PCA and SVD) shows more obvious effect in improving the spectral modeling accuracy compared to the feature selection algorithm (UVE and RF). The optimal estimation model combination for hyperspectral detection of total nitrogen content in wheat field soils is MSC+SVD+SVR.
期刊介绍:
The European Journal of Agronomy, the official journal of the European Society for Agronomy, publishes original research papers reporting experimental and theoretical contributions to field-based agronomy and crop science. The journal will consider research at the field level for agricultural, horticultural and tree crops, that uses comprehensive and explanatory approaches. The EJA covers the following topics:
crop physiology
crop production and management including irrigation, fertilization and soil management
agroclimatology and modelling
plant-soil relationships
crop quality and post-harvest physiology
farming and cropping systems
agroecosystems and the environment
crop-weed interactions and management
organic farming
horticultural crops
papers from the European Society for Agronomy bi-annual meetings
In determining the suitability of submitted articles for publication, particular scrutiny is placed on the degree of novelty and significance of the research and the extent to which it adds to existing knowledge in agronomy.