Gbenga Adejumo , Mervin St. Luce , David Bulmer , Preston Sorenson , Derek Peak
{"title":"Performance of the Global-Local modelling approach for FT-NIR predictions of SOC and TN in diverse Saskatchewan agricultural soils","authors":"Gbenga Adejumo , Mervin St. Luce , David Bulmer , Preston Sorenson , Derek Peak","doi":"10.1016/j.geoderma.2025.117477","DOIUrl":null,"url":null,"abstract":"<div><div>Precision agriculture requires a reliable, cost-effective method to measure soil organic carbon (SOC) and total nitrogen (TN), and Fourier Transform Near Infrared (FT-NIR) spectroscopy offers a promising solution. Here, we applied the <em>Global-Local</em> model to improve FT-NIR SOC and TN predictions in Saskatchewan agricultural soils. Soil samples (SOC: <em>n =</em> 1876; TN: <em>n =</em> 1442) were collected in 2020 and 2021 from six Saskatchewan agricultural regions. Spectral data were acquired, preprocessed using continuous wavelet transform (CWT), and modelled using Cubist regression. The <em>Global-Local</em> model was applied by combining a small subset of site-specific samples (<em>Lab</em>) with their <em>k</em>-nearest neighbours (<em>Neighbour</em>) from Saskatchewan spectral datasets. Its performance was compared with Leave-One-Site-Out (<em>LOSOV</em>), site-specific, <em>Lab</em>, <em>Neighbour</em>, and traditional spiking. Compared to <em>LOSOV</em> (SOC: R<sup>2</sup> = 0.55 – 0.76, CCC = 0.67 – 0.79, RPD = 1.20 – 1.44), site-specific models gave higher performance (SOC: R<sup>2</sup> = 0.71 – 0.88, CCC = 0.82 – 0.92; RPD = 1.59 – 2.70). The <em>Global-Local</em> model performed better than <em>LOSOV</em> and performed similarly to the best <em>Lab</em> or <em>Neighbour</em> models. Compared to the <em>Global-Local</em>, traditional spiking either improved or gave similar results due to higher variability in target variable and spectra datasets. The more accurate models using either spiking or <em>Global-Local</em> than <em>LOSOV</em> confirms the importance of incorporating site-specific samples into training datasets. Our results indicate that the application of the <em>Global-Local</em> model should be restricted to an individual field level, which was its original purpose. Future studies on optimization of the <em>Global-Local</em> model is needed to scale-up its application.</div></div>","PeriodicalId":12511,"journal":{"name":"Geoderma","volume":"461 ","pages":"Article 117477"},"PeriodicalIF":6.6000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoderma","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016706125003180","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Precision agriculture requires a reliable, cost-effective method to measure soil organic carbon (SOC) and total nitrogen (TN), and Fourier Transform Near Infrared (FT-NIR) spectroscopy offers a promising solution. Here, we applied the Global-Local model to improve FT-NIR SOC and TN predictions in Saskatchewan agricultural soils. Soil samples (SOC: n = 1876; TN: n = 1442) were collected in 2020 and 2021 from six Saskatchewan agricultural regions. Spectral data were acquired, preprocessed using continuous wavelet transform (CWT), and modelled using Cubist regression. The Global-Local model was applied by combining a small subset of site-specific samples (Lab) with their k-nearest neighbours (Neighbour) from Saskatchewan spectral datasets. Its performance was compared with Leave-One-Site-Out (LOSOV), site-specific, Lab, Neighbour, and traditional spiking. Compared to LOSOV (SOC: R2 = 0.55 – 0.76, CCC = 0.67 – 0.79, RPD = 1.20 – 1.44), site-specific models gave higher performance (SOC: R2 = 0.71 – 0.88, CCC = 0.82 – 0.92; RPD = 1.59 – 2.70). The Global-Local model performed better than LOSOV and performed similarly to the best Lab or Neighbour models. Compared to the Global-Local, traditional spiking either improved or gave similar results due to higher variability in target variable and spectra datasets. The more accurate models using either spiking or Global-Local than LOSOV confirms the importance of incorporating site-specific samples into training datasets. Our results indicate that the application of the Global-Local model should be restricted to an individual field level, which was its original purpose. Future studies on optimization of the Global-Local model is needed to scale-up its application.
期刊介绍:
Geoderma - the global journal of soil science - welcomes authors, readers and soil research from all parts of the world, encourages worldwide soil studies, and embraces all aspects of soil science and its associated pedagogy. The journal particularly welcomes interdisciplinary work focusing on dynamic soil processes and functions across space and time.