Chenlu Wu, Yanqing Xie, Qiang Xi, Xiangli Han, Zheng Li, Gang Li, Jing Zhao, Ming Liu
{"title":"通过高光谱显微镜成像与机器学习算法相结合,快速、高精度地识别大肠杆菌的活性和失活状态","authors":"Chenlu Wu, Yanqing Xie, Qiang Xi, Xiangli Han, Zheng Li, Gang Li, Jing Zhao, Ming Liu","doi":"10.1016/j.vibspec.2023.103645","DOIUrl":null,"url":null,"abstract":"<p>Rapid identification of the active state of foodborne bacteria is crucial for ensuring the safety and quality control of food or pharmaceutical products. In this study, a combination of hyperspectral microscope imaging (HMI) and machine learning algorithm is employed for the identification of active state of Escherichia coli (E. coli). Hyperspectral microscope images of live, 100℃ heat inactivation and 121℃ high-pressure inactivation of E. coli are collected in wavelength range of 370-1060<!-- --> <!-- -->nm. Savitzky-Golay (SG) smoothing combing with normalization is used for spectra preprocessing. And principal component analysis (PCA) is employed for spectral dimension reduction. Four different regions of interest (ROIs), including the entire bacterial cell ROI (cell), the outer cell wall ROI (cell_r), the membrane structure ROI (cell_w) formed by the cell wall and cell membrane, and the central of the cell ROI (cell_cy), are extracted and used as model input variables to investigate the influence on the modeling results. Five model algorithms, support vector machines (SVM), random forests (RF), k-nearest neighbors (KNN) algorithms, discriminant analysis (DA) classifiers, and long short-term memory (LSTM) neural networks are used and compared. Modeling results with spectral data of cell_r perform better than those with other ROIs. Accuracy of the models with data of the cell_r ROI are as follows: 79.78% for SVM, 95.11% for RF, 91.33% for KNN, 98.22% for DA, and 93.78% for LSTM. DA achieves the highest classification accuracy. The results show that high-temperature inactivation induces changes in bacterial tissue and morphology, resulting in certain spectral differences among bacteria in three different states. The combination of hyperspectral microscope imaging and machine learning algorithm can provide an effective method for identification of active and inactive states of E. coli. Furthermore, the model, constructed with the data of cell_r ROI, exhibits the best performance in identification.</p>","PeriodicalId":23656,"journal":{"name":"Vibrational Spectroscopy","volume":"13 1","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Rapid and high accurate identification of Escherichia coli active and inactivated state by hyperspectral microscope imaging combing with machine learning algorithm\",\"authors\":\"Chenlu Wu, Yanqing Xie, Qiang Xi, Xiangli Han, Zheng Li, Gang Li, Jing Zhao, Ming Liu\",\"doi\":\"10.1016/j.vibspec.2023.103645\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Rapid identification of the active state of foodborne bacteria is crucial for ensuring the safety and quality control of food or pharmaceutical products. In this study, a combination of hyperspectral microscope imaging (HMI) and machine learning algorithm is employed for the identification of active state of Escherichia coli (E. coli). Hyperspectral microscope images of live, 100℃ heat inactivation and 121℃ high-pressure inactivation of E. coli are collected in wavelength range of 370-1060<!-- --> <!-- -->nm. Savitzky-Golay (SG) smoothing combing with normalization is used for spectra preprocessing. And principal component analysis (PCA) is employed for spectral dimension reduction. Four different regions of interest (ROIs), including the entire bacterial cell ROI (cell), the outer cell wall ROI (cell_r), the membrane structure ROI (cell_w) formed by the cell wall and cell membrane, and the central of the cell ROI (cell_cy), are extracted and used as model input variables to investigate the influence on the modeling results. Five model algorithms, support vector machines (SVM), random forests (RF), k-nearest neighbors (KNN) algorithms, discriminant analysis (DA) classifiers, and long short-term memory (LSTM) neural networks are used and compared. Modeling results with spectral data of cell_r perform better than those with other ROIs. Accuracy of the models with data of the cell_r ROI are as follows: 79.78% for SVM, 95.11% for RF, 91.33% for KNN, 98.22% for DA, and 93.78% for LSTM. DA achieves the highest classification accuracy. The results show that high-temperature inactivation induces changes in bacterial tissue and morphology, resulting in certain spectral differences among bacteria in three different states. The combination of hyperspectral microscope imaging and machine learning algorithm can provide an effective method for identification of active and inactive states of E. coli. Furthermore, the model, constructed with the data of cell_r ROI, exhibits the best performance in identification.</p>\",\"PeriodicalId\":23656,\"journal\":{\"name\":\"Vibrational Spectroscopy\",\"volume\":\"13 1\",\"pages\":\"\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2023-12-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Vibrational Spectroscopy\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1016/j.vibspec.2023.103645\",\"RegionNum\":3,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Vibrational Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1016/j.vibspec.2023.103645","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
摘要
快速识别食源性细菌的活性状态对于确保食品或药品的安全和质量控制至关重要。本研究采用高光谱显微成像(HMI)和机器学习算法相结合的方法来识别大肠杆菌(E. coli)的活性状态。在 370-1060 nm 波长范围内采集活体、100℃ 热灭活和 121℃高压灭活大肠杆菌的高光谱显微镜图像。萨维茨基-戈莱(SG)平滑梳理和归一化用于光谱预处理。主成分分析(PCA)用于降低光谱维度。提取四个不同的感兴趣区(ROI),包括整个细菌细胞感兴趣区(cell)、细胞外壁感兴趣区(cell_r)、由细胞壁和细胞膜形成的膜结构感兴趣区(cell_w)以及细胞中心感兴趣区(cell_cy),并将其作为模型输入变量,以研究其对建模结果的影响。使用了支持向量机(SVM)、随机森林(RF)、k-近邻(KNN)算法、判别分析(DA)分类器和长短期记忆(LSTM)神经网络等五种模型算法,并对其进行了比较。使用 cell_r 光谱数据的建模结果优于使用其他 ROI 的结果。使用 cell_r ROI 数据的模型准确率如下:SVM 为 79.78%,RF 为 95.11%,KNN 为 91.33%,DA 为 98.22%,LSTM 为 93.78%。DA 的分类准确率最高。结果表明,高温灭活会引起细菌组织和形态的变化,导致三种不同状态下的细菌存在一定的光谱差异。高光谱显微成像与机器学习算法的结合可为识别大肠杆菌的活性和非活性状态提供一种有效的方法。此外,利用 cell_r ROI 数据构建的模型在识别方面表现最佳。
Rapid and high accurate identification of Escherichia coli active and inactivated state by hyperspectral microscope imaging combing with machine learning algorithm
Rapid identification of the active state of foodborne bacteria is crucial for ensuring the safety and quality control of food or pharmaceutical products. In this study, a combination of hyperspectral microscope imaging (HMI) and machine learning algorithm is employed for the identification of active state of Escherichia coli (E. coli). Hyperspectral microscope images of live, 100℃ heat inactivation and 121℃ high-pressure inactivation of E. coli are collected in wavelength range of 370-1060 nm. Savitzky-Golay (SG) smoothing combing with normalization is used for spectra preprocessing. And principal component analysis (PCA) is employed for spectral dimension reduction. Four different regions of interest (ROIs), including the entire bacterial cell ROI (cell), the outer cell wall ROI (cell_r), the membrane structure ROI (cell_w) formed by the cell wall and cell membrane, and the central of the cell ROI (cell_cy), are extracted and used as model input variables to investigate the influence on the modeling results. Five model algorithms, support vector machines (SVM), random forests (RF), k-nearest neighbors (KNN) algorithms, discriminant analysis (DA) classifiers, and long short-term memory (LSTM) neural networks are used and compared. Modeling results with spectral data of cell_r perform better than those with other ROIs. Accuracy of the models with data of the cell_r ROI are as follows: 79.78% for SVM, 95.11% for RF, 91.33% for KNN, 98.22% for DA, and 93.78% for LSTM. DA achieves the highest classification accuracy. The results show that high-temperature inactivation induces changes in bacterial tissue and morphology, resulting in certain spectral differences among bacteria in three different states. The combination of hyperspectral microscope imaging and machine learning algorithm can provide an effective method for identification of active and inactive states of E. coli. Furthermore, the model, constructed with the data of cell_r ROI, exhibits the best performance in identification.
期刊介绍:
Vibrational Spectroscopy provides a vehicle for the publication of original research that focuses on vibrational spectroscopy. This covers infrared, near-infrared and Raman spectroscopies and publishes papers dealing with developments in applications, theory, techniques and instrumentation.
The topics covered by the journal include:
Sampling techniques,
Vibrational spectroscopy coupled with separation techniques,
Instrumentation (Fourier transform, conventional and laser based),
Data manipulation,
Spectra-structure correlation and group frequencies.
The application areas covered include:
Analytical chemistry,
Bio-organic and bio-inorganic chemistry,
Organic chemistry,
Inorganic chemistry,
Catalysis,
Environmental science,
Industrial chemistry,
Materials science,
Physical chemistry,
Polymer science,
Process control,
Specialized problem solving.