Liu Mochen , Yang Kuankuan , Yan Yinfa , Song Zhanhua , Tian Fuyang , Li Fade , Yu Zhenwei , Rongyao Zhang , Yang Qinglu , Lu Yao
{"title":"基于改进支持向量机鲸鱼优化算法的Vis-NIR光谱土壤全氮、全磷、全钾含量多光谱评价","authors":"Liu Mochen , Yang Kuankuan , Yan Yinfa , Song Zhanhua , Tian Fuyang , Li Fade , Yu Zhenwei , Rongyao Zhang , Yang Qinglu , Lu Yao","doi":"10.1016/j.still.2025.106567","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate and non-destructive detection of total nitrogen (TN), total phosphorus (TP), and total potassium (TK) levels in soil is crucial for precise soil testing and fertilization in modernized precision agriculture. Traditional methods for soil composition analysis are expensive, time-consuming, and destructive. This research aims to establish a low-cost, high-precision, and non-destructive method for soil nutrient detection based on visible-near-infrared (Vis-NIR) spectroscopy (350–2500 nm) combined with improved machine learning algorithms. The Vis-NIR spectra of soil samples were acquired using the RS-5400 high-resolution ground feature spectrometer. Subsequently, the Monte Carlo sampling cross-validation (MCCV) algorithm was used to eliminate abnormal samples, and then different preprocessing methods were performed on the spectral data including first-derivative (FD), Savitzky-Golay smoothing (SG) and others. The optimal preprocessing method was selected from these options. In order to remove redundant information and increase the speed of calculation, five algorithms such as competitive adaptive reweighted sampling (CARS), iteratively retains informative variables (IRIV) and the variable iterative space shrinkage approach (VISSA)-IRIV algorithm were used to select feature variables. The characteristic wavelengths closely related to TN, TP, and TK in the soil have been extracted. Then, the RBF kernel (radial basis function) and poly kernel were mixed to obtain the RBF-poly hybrid kernel function, and then the hybrid kernel function support vector machine (RBF-poly-SVM) and the radial basis kernel function support vector machine (RBF-SVM) were applied respectively. Establish prediction models and introduce the whale optimization algorithm (WOA) to optimize the <em>g</em> (kernel function parameter), <em>c</em> (penalty factor) and k<sub><em>-rbf</em></sub> (weight coefficient) parameters in the two models. The performance of the developed models was tested using the coefficient of determination (<em>R</em><sup><em>2</em></sup>), the root mean squared error (<em>RMSE</em>) and the ratio of performance to deviation (<em>RPD</em>). The results demonstrated that among all models, the RBF-poly -SVM modeling methods were superior to the RBF-SVM model. The best results for estimation of TN, TP, and TK elements were achieved by the models of SG-square-FD + IRIV + RBF-poly-SVM (<em>R</em><sup><em>2</em></sup><sub><em>C</em></sub>=0.960, <em>R</em><sup><em>2</em></sup><sub><em>V</em></sub>=0.902, <em>RPD</em>=3.206), square-FD + IRIV + RBF-poly-SVM (<em>R</em><sup><em>2</em></sup><sub><em>C</em></sub>=0.999, <em>R</em><sup><em>2</em></sup><sub><em>V</em></sub>=0.937, <em>RPD</em>=3.939), square root + VISSA-IRIV + RBF-poly-SVM (<em>R</em><sup><em>2</em></sup><sub><em>C</em></sub>=0.955, <em>R</em><sup><em>2</em></sup><sub><em>V</em></sub>=0.904, <em>RPD</em>=2.608), respectively. The findings of the current approach own practical implications for agriculture and environmental management, as they enable more efficient and accurate soil nutrient monitoring and management.</div></div>","PeriodicalId":49503,"journal":{"name":"Soil & Tillage Research","volume":"252 ","pages":"Article 106567"},"PeriodicalIF":6.1000,"publicationDate":"2025-04-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-spectral evaluation of total nitrogen, phosphorus and potassium content in soil using Vis-NIR spectroscopy based on a modified support vector machine with whale optimization algorithm\",\"authors\":\"Liu Mochen , Yang Kuankuan , Yan Yinfa , Song Zhanhua , Tian Fuyang , Li Fade , Yu Zhenwei , Rongyao Zhang , Yang Qinglu , Lu Yao\",\"doi\":\"10.1016/j.still.2025.106567\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate and non-destructive detection of total nitrogen (TN), total phosphorus (TP), and total potassium (TK) levels in soil is crucial for precise soil testing and fertilization in modernized precision agriculture. Traditional methods for soil composition analysis are expensive, time-consuming, and destructive. This research aims to establish a low-cost, high-precision, and non-destructive method for soil nutrient detection based on visible-near-infrared (Vis-NIR) spectroscopy (350–2500 nm) combined with improved machine learning algorithms. The Vis-NIR spectra of soil samples were acquired using the RS-5400 high-resolution ground feature spectrometer. Subsequently, the Monte Carlo sampling cross-validation (MCCV) algorithm was used to eliminate abnormal samples, and then different preprocessing methods were performed on the spectral data including first-derivative (FD), Savitzky-Golay smoothing (SG) and others. The optimal preprocessing method was selected from these options. In order to remove redundant information and increase the speed of calculation, five algorithms such as competitive adaptive reweighted sampling (CARS), iteratively retains informative variables (IRIV) and the variable iterative space shrinkage approach (VISSA)-IRIV algorithm were used to select feature variables. The characteristic wavelengths closely related to TN, TP, and TK in the soil have been extracted. Then, the RBF kernel (radial basis function) and poly kernel were mixed to obtain the RBF-poly hybrid kernel function, and then the hybrid kernel function support vector machine (RBF-poly-SVM) and the radial basis kernel function support vector machine (RBF-SVM) were applied respectively. Establish prediction models and introduce the whale optimization algorithm (WOA) to optimize the <em>g</em> (kernel function parameter), <em>c</em> (penalty factor) and k<sub><em>-rbf</em></sub> (weight coefficient) parameters in the two models. The performance of the developed models was tested using the coefficient of determination (<em>R</em><sup><em>2</em></sup>), the root mean squared error (<em>RMSE</em>) and the ratio of performance to deviation (<em>RPD</em>). The results demonstrated that among all models, the RBF-poly -SVM modeling methods were superior to the RBF-SVM model. The best results for estimation of TN, TP, and TK elements were achieved by the models of SG-square-FD + IRIV + RBF-poly-SVM (<em>R</em><sup><em>2</em></sup><sub><em>C</em></sub>=0.960, <em>R</em><sup><em>2</em></sup><sub><em>V</em></sub>=0.902, <em>RPD</em>=3.206), square-FD + IRIV + RBF-poly-SVM (<em>R</em><sup><em>2</em></sup><sub><em>C</em></sub>=0.999, <em>R</em><sup><em>2</em></sup><sub><em>V</em></sub>=0.937, <em>RPD</em>=3.939), square root + VISSA-IRIV + RBF-poly-SVM (<em>R</em><sup><em>2</em></sup><sub><em>C</em></sub>=0.955, <em>R</em><sup><em>2</em></sup><sub><em>V</em></sub>=0.904, <em>RPD</em>=2.608), respectively. The findings of the current approach own practical implications for agriculture and environmental management, as they enable more efficient and accurate soil nutrient monitoring and management.</div></div>\",\"PeriodicalId\":49503,\"journal\":{\"name\":\"Soil & Tillage Research\",\"volume\":\"252 \",\"pages\":\"Article 106567\"},\"PeriodicalIF\":6.1000,\"publicationDate\":\"2025-04-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Soil & Tillage Research\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167198725001217\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SOIL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soil & Tillage Research","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167198725001217","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
Multi-spectral evaluation of total nitrogen, phosphorus and potassium content in soil using Vis-NIR spectroscopy based on a modified support vector machine with whale optimization algorithm
Accurate and non-destructive detection of total nitrogen (TN), total phosphorus (TP), and total potassium (TK) levels in soil is crucial for precise soil testing and fertilization in modernized precision agriculture. Traditional methods for soil composition analysis are expensive, time-consuming, and destructive. This research aims to establish a low-cost, high-precision, and non-destructive method for soil nutrient detection based on visible-near-infrared (Vis-NIR) spectroscopy (350–2500 nm) combined with improved machine learning algorithms. The Vis-NIR spectra of soil samples were acquired using the RS-5400 high-resolution ground feature spectrometer. Subsequently, the Monte Carlo sampling cross-validation (MCCV) algorithm was used to eliminate abnormal samples, and then different preprocessing methods were performed on the spectral data including first-derivative (FD), Savitzky-Golay smoothing (SG) and others. The optimal preprocessing method was selected from these options. In order to remove redundant information and increase the speed of calculation, five algorithms such as competitive adaptive reweighted sampling (CARS), iteratively retains informative variables (IRIV) and the variable iterative space shrinkage approach (VISSA)-IRIV algorithm were used to select feature variables. The characteristic wavelengths closely related to TN, TP, and TK in the soil have been extracted. Then, the RBF kernel (radial basis function) and poly kernel were mixed to obtain the RBF-poly hybrid kernel function, and then the hybrid kernel function support vector machine (RBF-poly-SVM) and the radial basis kernel function support vector machine (RBF-SVM) were applied respectively. Establish prediction models and introduce the whale optimization algorithm (WOA) to optimize the g (kernel function parameter), c (penalty factor) and k-rbf (weight coefficient) parameters in the two models. The performance of the developed models was tested using the coefficient of determination (R2), the root mean squared error (RMSE) and the ratio of performance to deviation (RPD). The results demonstrated that among all models, the RBF-poly -SVM modeling methods were superior to the RBF-SVM model. The best results for estimation of TN, TP, and TK elements were achieved by the models of SG-square-FD + IRIV + RBF-poly-SVM (R2C=0.960, R2V=0.902, RPD=3.206), square-FD + IRIV + RBF-poly-SVM (R2C=0.999, R2V=0.937, RPD=3.939), square root + VISSA-IRIV + RBF-poly-SVM (R2C=0.955, R2V=0.904, RPD=2.608), respectively. The findings of the current approach own practical implications for agriculture and environmental management, as they enable more efficient and accurate soil nutrient monitoring and management.
期刊介绍:
Soil & Tillage Research examines the physical, chemical and biological changes in the soil caused by tillage and field traffic. Manuscripts will be considered on aspects of soil science, physics, technology, mechanization and applied engineering for a sustainable balance among productivity, environmental quality and profitability. The following are examples of suitable topics within the scope of the journal of Soil and Tillage Research:
The agricultural and biosystems engineering associated with tillage (including no-tillage, reduced-tillage and direct drilling), irrigation and drainage, crops and crop rotations, fertilization, rehabilitation of mine spoils and processes used to modify soils. Soil change effects on establishment and yield of crops, growth of plants and roots, structure and erosion of soil, cycling of carbon and nutrients, greenhouse gas emissions, leaching, runoff and other processes that affect environmental quality. Characterization or modeling of tillage and field traffic responses, soil, climate, or topographic effects, soil deformation processes, tillage tools, traction devices, energy requirements, economics, surface and subsurface water quality effects, tillage effects on weed, pest and disease control, and their interactions.