{"title":"Transferring near infrared spectral calibration models without standards via multistep wavelength selection","authors":"L. Ni, Zhange Zhang, Liguo Zhang, S. Luan","doi":"10.1177/09670335231168437","DOIUrl":null,"url":null,"abstract":"Two case studies were conducted to verify calibration model transfer methods without standards by multi-step wavelength selection, using 3–7 near infrared spectrometers to predict ingredients in corn and total plant alkaloids (TPA) in tobacco leaves. Based on the characteristic wavelengths of Uc, which are selected using the scale-invariant feature transform (SIFT), this study advances two multistep wavelength selection methods by selecting wavelengths with high independence and a high standard deviation of the sample spectra (SDSS). The first method, SIFT-SDSS-CORX, selects important characteristic wavelengths Uc-i from Uc whose SDSS is greater than a threshold SDSSacrit. Subsequently, rx, the correlation coefficient matrix between spectral signals of Uc-i, is calculated, and only one wavelength is retained from those whose correlation coefficients exceed a threshold, rxacrit. The wavelength set Uc-i-rx, which is finally screened, is important and independent. In the second method, SIFT-CORX-SDSS, Uc-rx is first selected from Uc by retaining only one wavelength from those whose correlation coefficients between spectral signals of Uc exceed a threshold, rxbcrit. Subsequently, the wavelengths Uc-rx-i with SDSS exceeding a threshold SDSSbcrit are selected from Uc-rx. Near infrared spectroscopy calibration models for predicting protein and oil in corn and TPA in tobacco leaves were built using partial least squares regression (PLS) based on different wavelength sets of Uc, Uc-i, Uc-i-rx, Uc-rx, and Uc-rx-i, respectively. The latent variables used in the PLS models were determined by an accumulative contribution ratio over 99.9%. The results indicate that the PLS models built on Uc-i-rx and Uc-rx-i are effective on both primary and secondary units for corn and tobacco samples. This study utilises a three-step wavelength selection method to select highly independent, important, and characteristic spectral variables, thereby enhancing the robustness, simplicity, and interpretability of NIR) calibration models and facilitating their transfer to secondary units without standards.","PeriodicalId":16551,"journal":{"name":"Journal of Near Infrared Spectroscopy","volume":"31 1","pages":"171 - 185"},"PeriodicalIF":1.6000,"publicationDate":"2023-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Near Infrared Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1177/09670335231168437","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 1
Abstract
Two case studies were conducted to verify calibration model transfer methods without standards by multi-step wavelength selection, using 3–7 near infrared spectrometers to predict ingredients in corn and total plant alkaloids (TPA) in tobacco leaves. Based on the characteristic wavelengths of Uc, which are selected using the scale-invariant feature transform (SIFT), this study advances two multistep wavelength selection methods by selecting wavelengths with high independence and a high standard deviation of the sample spectra (SDSS). The first method, SIFT-SDSS-CORX, selects important characteristic wavelengths Uc-i from Uc whose SDSS is greater than a threshold SDSSacrit. Subsequently, rx, the correlation coefficient matrix between spectral signals of Uc-i, is calculated, and only one wavelength is retained from those whose correlation coefficients exceed a threshold, rxacrit. The wavelength set Uc-i-rx, which is finally screened, is important and independent. In the second method, SIFT-CORX-SDSS, Uc-rx is first selected from Uc by retaining only one wavelength from those whose correlation coefficients between spectral signals of Uc exceed a threshold, rxbcrit. Subsequently, the wavelengths Uc-rx-i with SDSS exceeding a threshold SDSSbcrit are selected from Uc-rx. Near infrared spectroscopy calibration models for predicting protein and oil in corn and TPA in tobacco leaves were built using partial least squares regression (PLS) based on different wavelength sets of Uc, Uc-i, Uc-i-rx, Uc-rx, and Uc-rx-i, respectively. The latent variables used in the PLS models were determined by an accumulative contribution ratio over 99.9%. The results indicate that the PLS models built on Uc-i-rx and Uc-rx-i are effective on both primary and secondary units for corn and tobacco samples. This study utilises a three-step wavelength selection method to select highly independent, important, and characteristic spectral variables, thereby enhancing the robustness, simplicity, and interpretability of NIR) calibration models and facilitating their transfer to secondary units without standards.
期刊介绍:
JNIRS — Journal of Near Infrared Spectroscopy is a peer reviewed journal, publishing original research papers, short communications, review articles and letters concerned with near infrared spectroscopy and technology, its application, new instrumentation and the use of chemometric and data handling techniques within NIR.