{"title":"根据化学成分有效和无效吸收波段选择近红外反射光谱预处理方法","authors":"Tiancheng Huang, Guimin Cai, Hubin Liu, Zhiyue Feng, Longlian Zhao, Junhui Li","doi":"10.1080/00387010.2022.2136200","DOIUrl":null,"url":null,"abstract":"Abstract An appropriate pretreatment method is crucial to establish a reliable model for near-infrared spectroscopy. It is usually selected by comparing model performance with different pretreatment methods, such as the root mean square error of internal cross-validation or the root mean square error of prediction. When the statistical indexes of the models are similar and indistinguishable, or the results of model calibration and cross-validation are too different, indicating an overfitting situation, it may be due to the selection of an inappropriate pretreatment. In this paper, an approach is proposed to select appropriate pretreatment methods by chemical components valid and invalid absorption wavebands. Due to there being no chemical groups absorption in wavebands of 1250–1350 nm and 1780–1850 nm, they are considered chemical invalid absorption bands. The root mean square distance in the invalid wavebands characterizes the difference in samples’ physical state, such as particle size and instrument error. Except for the invalid absorption band, other wavebands in the near-infrared region are considered chemical valid absorption wavebands. The root mean square distance in the valid absorption wavebands mainly characterize the chemical composition differences between different samples. So, we define effective information rate of spectra. The larger effective information rate of spectra, the more favorable it is to establish a quantitative model. In this study, spectra of 60 wheat powder samples were used to compare seven pretreatment methods. Moreover, the average effective information rates of spectra were calculated for the seven pretreatment methods respectively. The results show that the combination of standard normal variate transformation and first derivative with Savitzky–Golay smoothing is the best. Finally, a comparison of model prediction with different pretreatment methods was made. The results are the same as the former. Therefore, using the effective information rate of spectra for selecting an appropriate near-infrared spectroscopy pretreatment method is practical and can be used not only for near-infrared modeling but also for designing near-infrared instruments as an index for optimization.","PeriodicalId":21953,"journal":{"name":"Spectroscopy Letters","volume":"55 1","pages":"607 - 617"},"PeriodicalIF":1.1000,"publicationDate":"2022-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Selecting near-infrared reflection spectroscopy pretreatment methods by chemical components valid and invalid absorption wavebands\",\"authors\":\"Tiancheng Huang, Guimin Cai, Hubin Liu, Zhiyue Feng, Longlian Zhao, Junhui Li\",\"doi\":\"10.1080/00387010.2022.2136200\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract An appropriate pretreatment method is crucial to establish a reliable model for near-infrared spectroscopy. It is usually selected by comparing model performance with different pretreatment methods, such as the root mean square error of internal cross-validation or the root mean square error of prediction. When the statistical indexes of the models are similar and indistinguishable, or the results of model calibration and cross-validation are too different, indicating an overfitting situation, it may be due to the selection of an inappropriate pretreatment. In this paper, an approach is proposed to select appropriate pretreatment methods by chemical components valid and invalid absorption wavebands. Due to there being no chemical groups absorption in wavebands of 1250–1350 nm and 1780–1850 nm, they are considered chemical invalid absorption bands. The root mean square distance in the invalid wavebands characterizes the difference in samples’ physical state, such as particle size and instrument error. Except for the invalid absorption band, other wavebands in the near-infrared region are considered chemical valid absorption wavebands. The root mean square distance in the valid absorption wavebands mainly characterize the chemical composition differences between different samples. So, we define effective information rate of spectra. The larger effective information rate of spectra, the more favorable it is to establish a quantitative model. In this study, spectra of 60 wheat powder samples were used to compare seven pretreatment methods. Moreover, the average effective information rates of spectra were calculated for the seven pretreatment methods respectively. The results show that the combination of standard normal variate transformation and first derivative with Savitzky–Golay smoothing is the best. Finally, a comparison of model prediction with different pretreatment methods was made. The results are the same as the former. Therefore, using the effective information rate of spectra for selecting an appropriate near-infrared spectroscopy pretreatment method is practical and can be used not only for near-infrared modeling but also for designing near-infrared instruments as an index for optimization.\",\"PeriodicalId\":21953,\"journal\":{\"name\":\"Spectroscopy Letters\",\"volume\":\"55 1\",\"pages\":\"607 - 617\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2022-10-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Spectroscopy Letters\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1080/00387010.2022.2136200\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"SPECTROSCOPY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectroscopy Letters","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1080/00387010.2022.2136200","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
Selecting near-infrared reflection spectroscopy pretreatment methods by chemical components valid and invalid absorption wavebands
Abstract An appropriate pretreatment method is crucial to establish a reliable model for near-infrared spectroscopy. It is usually selected by comparing model performance with different pretreatment methods, such as the root mean square error of internal cross-validation or the root mean square error of prediction. When the statistical indexes of the models are similar and indistinguishable, or the results of model calibration and cross-validation are too different, indicating an overfitting situation, it may be due to the selection of an inappropriate pretreatment. In this paper, an approach is proposed to select appropriate pretreatment methods by chemical components valid and invalid absorption wavebands. Due to there being no chemical groups absorption in wavebands of 1250–1350 nm and 1780–1850 nm, they are considered chemical invalid absorption bands. The root mean square distance in the invalid wavebands characterizes the difference in samples’ physical state, such as particle size and instrument error. Except for the invalid absorption band, other wavebands in the near-infrared region are considered chemical valid absorption wavebands. The root mean square distance in the valid absorption wavebands mainly characterize the chemical composition differences between different samples. So, we define effective information rate of spectra. The larger effective information rate of spectra, the more favorable it is to establish a quantitative model. In this study, spectra of 60 wheat powder samples were used to compare seven pretreatment methods. Moreover, the average effective information rates of spectra were calculated for the seven pretreatment methods respectively. The results show that the combination of standard normal variate transformation and first derivative with Savitzky–Golay smoothing is the best. Finally, a comparison of model prediction with different pretreatment methods was made. The results are the same as the former. Therefore, using the effective information rate of spectra for selecting an appropriate near-infrared spectroscopy pretreatment method is practical and can be used not only for near-infrared modeling but also for designing near-infrared instruments as an index for optimization.
期刊介绍:
Spectroscopy Letters provides vital coverage of all types of spectroscopy across all the disciplines where they are used—including novel work in fundamental spectroscopy, applications, diagnostics and instrumentation. The audience is intended to be all practicing spectroscopists across all scientific (and some engineering) disciplines, including: physics, chemistry, biology, instrumentation science, and pharmaceutical science.