Hyperspectral imaging combined with optimized SPA-GA and CPO-SVR machine learning models for the rapid determination of the total nitrogen content in cellar soil
{"title":"Hyperspectral imaging combined with optimized SPA-GA and CPO-SVR machine learning models for the rapid determination of the total nitrogen content in cellar soil","authors":"Yifei Zhou, Jianping Tian, Xinjun Hu, Haili Yang, Liangliang Xie, Yuexiang Huang, Yuanyuan Xia, Jianheng Peng, Dan Huang","doi":"10.1002/saj2.70095","DOIUrl":null,"url":null,"abstract":"<p>The fermentation of Baijiu grains in the cellar is significantly influenced by the quality of the cellar soil, which contains a diverse range of microorganisms and physicochemical components. Among these, the total nitrogen content (TNC) is a critical indicator of soil quality and thus requires real-time monitoring to ensure quality control of the Baijiu. In this study, we developed two optimized machine learning algorithms—successive projection algorithm-genetic algorithm (SPA-GA) and crown porcupine optimization (CPO) achieve the rapid and accurate detection of the TNC in cellar soil using hyperspectral imaging (HSI). The feature wavelengths were selected by combining the SPA with the GA. Subsequently, the support vector machine regression (SVR) algorithm was further optimized using the CPO algorithm to establish a prediction model for determining the TNC. Comparative analysis of the various models demonstrated that the CPO-SVR model based on the feature wavelength spectral data extracted by the SPA-GA exhibited the best performance (<span></span><math>\n <semantics>\n <mrow>\n <msup>\n <msub>\n <mi>R</mi>\n <mi>p</mi>\n </msub>\n <mn>2</mn>\n </msup>\n <mspace></mspace>\n </mrow>\n <annotation>${R_p}^{\\mathrm{2}}\\;$</annotation>\n </semantics></math>= 0.9958, root-mean square error of prediction [RMSEP] = 0.0073 g/100 g). This model reduced the number of wavelengths by 86.16%, increased the <span></span><math>\n <semantics>\n <mrow>\n <mspace></mspace>\n <msup>\n <msub>\n <mi>R</mi>\n <mi>p</mi>\n </msub>\n <mn>2</mn>\n </msup>\n </mrow>\n <annotation>$\\;{R_p}^{\\mathrm{2}}$</annotation>\n </semantics></math> by 0.3014, and decreased the RMSEP by 0.0566 compared to the same model built using the full-wavelength spectral data. These results indicated that the GA significantly enhanced the feature extraction capability of the SPA, thereby improving the model accuracy while reducing the number of wavelengths to reduce computational load. Furthermore, CPO was introduced to optimize the SVR, yielding the optimal parameter combination, which further improved the prediction model performance and accuracy while mitigating artificial parameter-seeking instability. HSI, in conjunction with the optimization algorithms, offers a novel method for the rapid, non-destructive detection of total nitrogen and other components in cellar mud.</p>","PeriodicalId":101043,"journal":{"name":"Proceedings - Soil Science Society of America","volume":"89 4","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings - Soil Science Society of America","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/saj2.70095","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
The fermentation of Baijiu grains in the cellar is significantly influenced by the quality of the cellar soil, which contains a diverse range of microorganisms and physicochemical components. Among these, the total nitrogen content (TNC) is a critical indicator of soil quality and thus requires real-time monitoring to ensure quality control of the Baijiu. In this study, we developed two optimized machine learning algorithms—successive projection algorithm-genetic algorithm (SPA-GA) and crown porcupine optimization (CPO) achieve the rapid and accurate detection of the TNC in cellar soil using hyperspectral imaging (HSI). The feature wavelengths were selected by combining the SPA with the GA. Subsequently, the support vector machine regression (SVR) algorithm was further optimized using the CPO algorithm to establish a prediction model for determining the TNC. Comparative analysis of the various models demonstrated that the CPO-SVR model based on the feature wavelength spectral data extracted by the SPA-GA exhibited the best performance (= 0.9958, root-mean square error of prediction [RMSEP] = 0.0073 g/100 g). This model reduced the number of wavelengths by 86.16%, increased the by 0.3014, and decreased the RMSEP by 0.0566 compared to the same model built using the full-wavelength spectral data. These results indicated that the GA significantly enhanced the feature extraction capability of the SPA, thereby improving the model accuracy while reducing the number of wavelengths to reduce computational load. Furthermore, CPO was introduced to optimize the SVR, yielding the optimal parameter combination, which further improved the prediction model performance and accuracy while mitigating artificial parameter-seeking instability. HSI, in conjunction with the optimization algorithms, offers a novel method for the rapid, non-destructive detection of total nitrogen and other components in cellar mud.