Gbenga Daniel Adejumo, David Bulmer, Preston Sorenson, Derek Peak
{"title":"根据不同的傅立叶变换近红外光谱数据集建立土壤有机碳和总氮多元模型","authors":"Gbenga Daniel Adejumo, David Bulmer, Preston Sorenson, Derek Peak","doi":"10.1016/j.geodrs.2024.e00834","DOIUrl":null,"url":null,"abstract":"<div><p>This study linked soil FT-NIR spectroscopy with soil organic carbon (SOC) and total nitrogen (TN) content in Saskatchewan (SK) agricultural soils, using a multivariate approach. Soil spectra were acquired along with laboratory measurements of SOC and TN from 1965 Saskatchewan soil samples. Spectral data were transformed using a variety of common pre-treatment approaches: Savitszky-Golay, first and second derivative, standard normal variate, multiplicative scatter correction and continuous wavelet transform. Models were next built using cubist regression tree (Cubist), support vector machine (SVM), and partial least square regression (PLSR) to evaluate the performance of the different pre-treatment/modelling approaches. The continuous wavelets transform was the best performing spectral treatment method for SK agricultural SOC and TN. For predictive model using an extensive dataset, the cubist model performed best for SOC and TN (R<sup>2</sup> = 0.80 and 0.85) followed by SVM (R<sup>2</sup> = 0.77 and 0.85) and PLSR (R<sup>2</sup> = 0.63 and 0.73). However, all models demonstrated the same correlation between predicted and observed values for SOC and TN (CCC = 0.87 and 0.93). The consistent model accuracy with extensive soil dataset suggests model's ability to generalize well beyond the data it was trained on. However, model accuracy varies if trained using different soil zones and Sk agricultural sites, and this suggest the need for careful selection of specific site or soil-zone on which model should be trained. Additionally, this study also underscores the influence of factors beyond sample size and spectra variability, such as coefficient of variation, on the accuracy of SOC and TN predictions.</p></div>","PeriodicalId":56001,"journal":{"name":"Geoderma Regional","volume":"38 ","pages":"Article e00834"},"PeriodicalIF":3.1000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2352009424000816/pdfft?md5=10bca8d0596e2d110b93e161339243cb&pid=1-s2.0-S2352009424000816-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Soil organic carbon and total nitrogen multivariate modelling from diverse FT-NIR spectral dataset\",\"authors\":\"Gbenga Daniel Adejumo, David Bulmer, Preston Sorenson, Derek Peak\",\"doi\":\"10.1016/j.geodrs.2024.e00834\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>This study linked soil FT-NIR spectroscopy with soil organic carbon (SOC) and total nitrogen (TN) content in Saskatchewan (SK) agricultural soils, using a multivariate approach. Soil spectra were acquired along with laboratory measurements of SOC and TN from 1965 Saskatchewan soil samples. Spectral data were transformed using a variety of common pre-treatment approaches: Savitszky-Golay, first and second derivative, standard normal variate, multiplicative scatter correction and continuous wavelet transform. Models were next built using cubist regression tree (Cubist), support vector machine (SVM), and partial least square regression (PLSR) to evaluate the performance of the different pre-treatment/modelling approaches. The continuous wavelets transform was the best performing spectral treatment method for SK agricultural SOC and TN. For predictive model using an extensive dataset, the cubist model performed best for SOC and TN (R<sup>2</sup> = 0.80 and 0.85) followed by SVM (R<sup>2</sup> = 0.77 and 0.85) and PLSR (R<sup>2</sup> = 0.63 and 0.73). However, all models demonstrated the same correlation between predicted and observed values for SOC and TN (CCC = 0.87 and 0.93). The consistent model accuracy with extensive soil dataset suggests model's ability to generalize well beyond the data it was trained on. However, model accuracy varies if trained using different soil zones and Sk agricultural sites, and this suggest the need for careful selection of specific site or soil-zone on which model should be trained. Additionally, this study also underscores the influence of factors beyond sample size and spectra variability, such as coefficient of variation, on the accuracy of SOC and TN predictions.</p></div>\",\"PeriodicalId\":56001,\"journal\":{\"name\":\"Geoderma Regional\",\"volume\":\"38 \",\"pages\":\"Article e00834\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2024-07-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2352009424000816/pdfft?md5=10bca8d0596e2d110b93e161339243cb&pid=1-s2.0-S2352009424000816-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geoderma Regional\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2352009424000816\",\"RegionNum\":2,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"SOIL SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoderma Regional","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352009424000816","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
Soil organic carbon and total nitrogen multivariate modelling from diverse FT-NIR spectral dataset
This study linked soil FT-NIR spectroscopy with soil organic carbon (SOC) and total nitrogen (TN) content in Saskatchewan (SK) agricultural soils, using a multivariate approach. Soil spectra were acquired along with laboratory measurements of SOC and TN from 1965 Saskatchewan soil samples. Spectral data were transformed using a variety of common pre-treatment approaches: Savitszky-Golay, first and second derivative, standard normal variate, multiplicative scatter correction and continuous wavelet transform. Models were next built using cubist regression tree (Cubist), support vector machine (SVM), and partial least square regression (PLSR) to evaluate the performance of the different pre-treatment/modelling approaches. The continuous wavelets transform was the best performing spectral treatment method for SK agricultural SOC and TN. For predictive model using an extensive dataset, the cubist model performed best for SOC and TN (R2 = 0.80 and 0.85) followed by SVM (R2 = 0.77 and 0.85) and PLSR (R2 = 0.63 and 0.73). However, all models demonstrated the same correlation between predicted and observed values for SOC and TN (CCC = 0.87 and 0.93). The consistent model accuracy with extensive soil dataset suggests model's ability to generalize well beyond the data it was trained on. However, model accuracy varies if trained using different soil zones and Sk agricultural sites, and this suggest the need for careful selection of specific site or soil-zone on which model should be trained. Additionally, this study also underscores the influence of factors beyond sample size and spectra variability, such as coefficient of variation, on the accuracy of SOC and TN predictions.
期刊介绍:
Global issues require studies and solutions on national and regional levels. Geoderma Regional focuses on studies that increase understanding and advance our scientific knowledge of soils in all regions of the world. The journal embraces every aspect of soil science and welcomes reviews of regional progress.