{"title":"将统计机器学习技术应用于近红外光谱数据,通过优化预处理和波长选择快速检测姜黄粉中的苏丹染料-I","authors":"Saumita Kar, Bipan Tudu, Rajib Bandyopadhyay","doi":"10.1007/s13197-024-05971-9","DOIUrl":null,"url":null,"abstract":"<div><p>Machine learning techniques were applied systematically to the spectral data of near-infrared (NIR) spectroscopy to find out the sudan dye I adulterants in turmeric powders. Turmeric powder is one of the most commonly used spice and a simple target for adulteration. Pure turmeric powder was prepared at the laboratory and spiked with sudan dye I adulterants. The spectral data of these adulterated mixtures were obtained by NIR spectrometer and investigated accordingly. The concentrations of the adulterants were 1%, 5%, 10%, 15%, 20%, 25%, 30% (w/w) respectively. Exploratory data analysis was done for the visualization of the adulterant classes by principal component analysis (PCA). Optimization of the pre-processing and wavelength selection was done by cross-validation techniques using a partial least squares regression (PLSR) model. For quantitative analysis four different regression techniques were applied namely ensemble tree regression (ENTR), support vector regression (SVR), principal component regression (PCR), and PLSR, and a comparative analysis was done. The best method was found to be PLSR. The accuracy of the PLSR analysis was determined with the coefficients of determination (R<sup>2</sup>) of greater than 0.97 and with root mean square error (RMSE) of less than 0.93 respectively. For the verification of the robustness of the model, the <i>Figure of merit</i> (FOM) of the model was derived with the help of the Net analyte signal (NAS) theory. The current study established that the NIR spectroscopy can be applied to detect and quantify the amount of sudan dye I adulterants added to the turmeric powders with satisfactory accuracy.</p></div>","PeriodicalId":632,"journal":{"name":"Journal of Food Science and Technology","volume":null,"pages":null},"PeriodicalIF":2.7010,"publicationDate":"2024-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Statistical machine learning techniques applied to NIR spectral data for rapid detection of sudan dye-I in turmeric powders with optimized pre-processing and wavelength selection\",\"authors\":\"Saumita Kar, Bipan Tudu, Rajib Bandyopadhyay\",\"doi\":\"10.1007/s13197-024-05971-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Machine learning techniques were applied systematically to the spectral data of near-infrared (NIR) spectroscopy to find out the sudan dye I adulterants in turmeric powders. Turmeric powder is one of the most commonly used spice and a simple target for adulteration. Pure turmeric powder was prepared at the laboratory and spiked with sudan dye I adulterants. The spectral data of these adulterated mixtures were obtained by NIR spectrometer and investigated accordingly. The concentrations of the adulterants were 1%, 5%, 10%, 15%, 20%, 25%, 30% (w/w) respectively. Exploratory data analysis was done for the visualization of the adulterant classes by principal component analysis (PCA). Optimization of the pre-processing and wavelength selection was done by cross-validation techniques using a partial least squares regression (PLSR) model. For quantitative analysis four different regression techniques were applied namely ensemble tree regression (ENTR), support vector regression (SVR), principal component regression (PCR), and PLSR, and a comparative analysis was done. The best method was found to be PLSR. The accuracy of the PLSR analysis was determined with the coefficients of determination (R<sup>2</sup>) of greater than 0.97 and with root mean square error (RMSE) of less than 0.93 respectively. For the verification of the robustness of the model, the <i>Figure of merit</i> (FOM) of the model was derived with the help of the Net analyte signal (NAS) theory. The current study established that the NIR spectroscopy can be applied to detect and quantify the amount of sudan dye I adulterants added to the turmeric powders with satisfactory accuracy.</p></div>\",\"PeriodicalId\":632,\"journal\":{\"name\":\"Journal of Food Science and Technology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.7010,\"publicationDate\":\"2024-03-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Food Science and Technology\",\"FirstCategoryId\":\"1\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s13197-024-05971-9\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Science and Technology","FirstCategoryId":"1","ListUrlMain":"https://link.springer.com/article/10.1007/s13197-024-05971-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
将机器学习技术系统地应用于近红外光谱(NIR)的光谱数据,以找出姜黄粉中的苏丹染料 I 掺杂物。姜黄粉是最常用的香料之一,也是最容易掺假的目标。实验室制备了纯姜黄粉,并在其中添加了茚三酮染料 I 掺杂物。用近红外光谱仪获得了这些掺假混合物的光谱数据,并进行了相应的研究。掺杂物的浓度分别为 1%、5%、10%、15%、20%、25%、30%(重量比)。通过主成分分析(PCA)进行了探索性数据分析,以确定掺杂物的类别。使用偏最小二乘回归(PLSR)模型,通过交叉验证技术对预处理和波长选择进行了优化。在定量分析方面,应用了四种不同的回归技术,即集合树回归(ENTR)、支持向量回归(SVR)、主成分回归(PCR)和 PLSR,并进行了比较分析。结果发现,PLSR 是最好的方法。PLSR 分析的确定系数 (R2) 大于 0.97,均方根误差 (RMSE) 小于 0.93,从而确定了 PLSR 分析的准确性。为了验证模型的稳健性,借助净分析物信号(NAS)理论得出了模型的优点图(FOM)。本研究证实,近红外光谱法可用于检测和定量姜黄粉中添加的茚三酮染料 I 掺杂物的含量,其准确性令人满意。
Statistical machine learning techniques applied to NIR spectral data for rapid detection of sudan dye-I in turmeric powders with optimized pre-processing and wavelength selection
Machine learning techniques were applied systematically to the spectral data of near-infrared (NIR) spectroscopy to find out the sudan dye I adulterants in turmeric powders. Turmeric powder is one of the most commonly used spice and a simple target for adulteration. Pure turmeric powder was prepared at the laboratory and spiked with sudan dye I adulterants. The spectral data of these adulterated mixtures were obtained by NIR spectrometer and investigated accordingly. The concentrations of the adulterants were 1%, 5%, 10%, 15%, 20%, 25%, 30% (w/w) respectively. Exploratory data analysis was done for the visualization of the adulterant classes by principal component analysis (PCA). Optimization of the pre-processing and wavelength selection was done by cross-validation techniques using a partial least squares regression (PLSR) model. For quantitative analysis four different regression techniques were applied namely ensemble tree regression (ENTR), support vector regression (SVR), principal component regression (PCR), and PLSR, and a comparative analysis was done. The best method was found to be PLSR. The accuracy of the PLSR analysis was determined with the coefficients of determination (R2) of greater than 0.97 and with root mean square error (RMSE) of less than 0.93 respectively. For the verification of the robustness of the model, the Figure of merit (FOM) of the model was derived with the help of the Net analyte signal (NAS) theory. The current study established that the NIR spectroscopy can be applied to detect and quantify the amount of sudan dye I adulterants added to the turmeric powders with satisfactory accuracy.