Optimized feature selection and machine learning for non-destructive estimation of soil volumetric water content in Chinese cabbage using hyperspectral imaging
Seung-Hyun Im , Mohammad Akbar Faqeerzada , Byoung-Kwan Cho , Geonwoo Kim , Hoonsoo Lee
{"title":"Optimized feature selection and machine learning for non-destructive estimation of soil volumetric water content in Chinese cabbage using hyperspectral imaging","authors":"Seung-Hyun Im , Mohammad Akbar Faqeerzada , Byoung-Kwan Cho , Geonwoo Kim , Hoonsoo Lee","doi":"10.1016/j.vibspec.2025.103816","DOIUrl":null,"url":null,"abstract":"<div><div>Soil volumetric water content (SVWC) is a critical factor in plant health, influencing water uptake, nutrient transport, and overall physiological performance. Adverse environmental conditions like drought and high temperatures challenge crop growth and reduce yields. Accurate monitoring of SVWC is essential for optimizing growing conditions, preventing water stress, and promoting sustainable agriculture. This study explores a non-destructive method for predicting SVWC in Chinese cabbage seedlings using short-wave infrared (SWIR, 894–2504 nm) hyperspectral imaging coupled with machine learning. Daily hyperspectral images and corresponding SVWC measurements were collected over three days following irrigation cessation, resulting in a dataset of 2700 spectra. Gaussian process regression (GPR) and support vector regression (SVR) models were applied, with Lasso and Ridge regression used for feature selection. The models were evaluated using all spectral bands (E164) and 30 selected bands (L30 and R30). The GPR model with Lasso-selected bands and smoothing preprocessing achieved the highest accuracy (R² = 0.87, RMSE = 1.33). The SVR model with smoothing preprocessing and the entire spectral range demonstrated R² = 0.82 and RMSE = 1.52. Multivariate regression models using 14 shared bands selected by Lasso and Ridge regression yielded moderate performance (R² = 0.67, RMSE = 2.07). These findings highlight the potential of hyperspectral imaging combined with machine learning for non-destructive SVWC prediction, enabling early crop detection of water stress.</div></div>","PeriodicalId":23656,"journal":{"name":"Vibrational Spectroscopy","volume":"139 ","pages":"Article 103816"},"PeriodicalIF":2.7000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vibrational Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0924203125000505","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Soil volumetric water content (SVWC) is a critical factor in plant health, influencing water uptake, nutrient transport, and overall physiological performance. Adverse environmental conditions like drought and high temperatures challenge crop growth and reduce yields. Accurate monitoring of SVWC is essential for optimizing growing conditions, preventing water stress, and promoting sustainable agriculture. This study explores a non-destructive method for predicting SVWC in Chinese cabbage seedlings using short-wave infrared (SWIR, 894–2504 nm) hyperspectral imaging coupled with machine learning. Daily hyperspectral images and corresponding SVWC measurements were collected over three days following irrigation cessation, resulting in a dataset of 2700 spectra. Gaussian process regression (GPR) and support vector regression (SVR) models were applied, with Lasso and Ridge regression used for feature selection. The models were evaluated using all spectral bands (E164) and 30 selected bands (L30 and R30). The GPR model with Lasso-selected bands and smoothing preprocessing achieved the highest accuracy (R² = 0.87, RMSE = 1.33). The SVR model with smoothing preprocessing and the entire spectral range demonstrated R² = 0.82 and RMSE = 1.52. Multivariate regression models using 14 shared bands selected by Lasso and Ridge regression yielded moderate performance (R² = 0.67, RMSE = 2.07). These findings highlight the potential of hyperspectral imaging combined with machine learning for non-destructive SVWC prediction, enabling early crop detection of water stress.
期刊介绍:
Vibrational Spectroscopy provides a vehicle for the publication of original research that focuses on vibrational spectroscopy. This covers infrared, near-infrared and Raman spectroscopies and publishes papers dealing with developments in applications, theory, techniques and instrumentation.
The topics covered by the journal include:
Sampling techniques,
Vibrational spectroscopy coupled with separation techniques,
Instrumentation (Fourier transform, conventional and laser based),
Data manipulation,
Spectra-structure correlation and group frequencies.
The application areas covered include:
Analytical chemistry,
Bio-organic and bio-inorganic chemistry,
Organic chemistry,
Inorganic chemistry,
Catalysis,
Environmental science,
Industrial chemistry,
Materials science,
Physical chemistry,
Polymer science,
Process control,
Specialized problem solving.