{"title":"近红外光谱与机器学习技术快速测定菊花有效成分含量及品种鉴定","authors":"Ranran Cheng , Yangfei Ding , Dongliang Jiang , Jiuba Zhang , Mengru Wang , Yan Xu , Hongsu Zhao , Xiang Cheng , Deling Wu , Wei Zhang","doi":"10.1016/j.jfca.2025.108334","DOIUrl":null,"url":null,"abstract":"<div><div>In this study, a qualitative discrimination and quantitative prediction model for the rapid and effective identification of <em>Chrysanthemi Flos</em> (CF) was developed utilizing near-infrared spectroscopy (NIRS) in conjunction with ultra-performance liquid chromatography (UPLC) and multivariate algorithms. The contents of six active constituents-chlorogenic acid, luteolin-7-<em>O</em>-<em>β</em>-D-glucoside, 3,5-<em>O</em>-dicaffeoylquinic acid, tilianin, apigenin, and acacetin-in CF were quantified using UPLC. Significant variations were observed in the contents of these components across different CF varieties. Utilizing NIRS data, the subspace clustering algorithm effectively differentiated various CF varieties, achieving an accuracy rate of 95.7 %. Subsequently, the acquired NIRS data underwent preprocessing and feature selection variables, partial least squares regression prediction models for the six active components were successfully established, with R<sup>2</sup> values exceeding 0.8 for both the training and prediction sets. In conclusion, the integration of NIRS with machine learning technology has been demonstrated to be a rapid, effective, and feasible approach for the identification of CF varieties.</div></div>","PeriodicalId":15867,"journal":{"name":"Journal of Food Composition and Analysis","volume":"148 ","pages":"Article 108334"},"PeriodicalIF":4.6000,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Rapid determination and variety identification of active ingredient contents in Chrysanthemi Flos by near-infrared spectroscopy and machine learning\",\"authors\":\"Ranran Cheng , Yangfei Ding , Dongliang Jiang , Jiuba Zhang , Mengru Wang , Yan Xu , Hongsu Zhao , Xiang Cheng , Deling Wu , Wei Zhang\",\"doi\":\"10.1016/j.jfca.2025.108334\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>In this study, a qualitative discrimination and quantitative prediction model for the rapid and effective identification of <em>Chrysanthemi Flos</em> (CF) was developed utilizing near-infrared spectroscopy (NIRS) in conjunction with ultra-performance liquid chromatography (UPLC) and multivariate algorithms. The contents of six active constituents-chlorogenic acid, luteolin-7-<em>O</em>-<em>β</em>-D-glucoside, 3,5-<em>O</em>-dicaffeoylquinic acid, tilianin, apigenin, and acacetin-in CF were quantified using UPLC. Significant variations were observed in the contents of these components across different CF varieties. Utilizing NIRS data, the subspace clustering algorithm effectively differentiated various CF varieties, achieving an accuracy rate of 95.7 %. Subsequently, the acquired NIRS data underwent preprocessing and feature selection variables, partial least squares regression prediction models for the six active components were successfully established, with R<sup>2</sup> values exceeding 0.8 for both the training and prediction sets. In conclusion, the integration of NIRS with machine learning technology has been demonstrated to be a rapid, effective, and feasible approach for the identification of CF varieties.</div></div>\",\"PeriodicalId\":15867,\"journal\":{\"name\":\"Journal of Food Composition and Analysis\",\"volume\":\"148 \",\"pages\":\"Article 108334\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Food Composition and Analysis\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0889157525011500\",\"RegionNum\":2,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Composition and Analysis","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0889157525011500","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
摘要
本研究利用近红外光谱(NIRS)、超高效液相色谱(UPLC)和多元算法,建立了快速有效鉴别菊花(chrysanthemum i Flos, CF)的定性鉴别和定量预测模型。采用超高效液相色谱法(UPLC)定量测定CF中绿原酸、木犀草素-7- o -β- d -葡萄糖苷、3,5- o -二咖啡酰奎宁酸、天青素、芹菜素和棘球虫苷6种有效成分的含量。这些成分的含量在不同品种间存在显著差异。利用近红外光谱数据,子空间聚类算法有效区分了各种CF品种,准确率达到95.7% %。随后,对获取的近红外光谱数据进行预处理和特征选择变量,成功建立了6个有效分量的偏最小二乘回归预测模型,训练集和预测集的R2值均超过0.8。综上所述,近红外光谱与机器学习技术的结合已被证明是一种快速、有效和可行的方法来识别CF品种。
Rapid determination and variety identification of active ingredient contents in Chrysanthemi Flos by near-infrared spectroscopy and machine learning
In this study, a qualitative discrimination and quantitative prediction model for the rapid and effective identification of Chrysanthemi Flos (CF) was developed utilizing near-infrared spectroscopy (NIRS) in conjunction with ultra-performance liquid chromatography (UPLC) and multivariate algorithms. The contents of six active constituents-chlorogenic acid, luteolin-7-O-β-D-glucoside, 3,5-O-dicaffeoylquinic acid, tilianin, apigenin, and acacetin-in CF were quantified using UPLC. Significant variations were observed in the contents of these components across different CF varieties. Utilizing NIRS data, the subspace clustering algorithm effectively differentiated various CF varieties, achieving an accuracy rate of 95.7 %. Subsequently, the acquired NIRS data underwent preprocessing and feature selection variables, partial least squares regression prediction models for the six active components were successfully established, with R2 values exceeding 0.8 for both the training and prediction sets. In conclusion, the integration of NIRS with machine learning technology has been demonstrated to be a rapid, effective, and feasible approach for the identification of CF varieties.
期刊介绍:
The Journal of Food Composition and Analysis publishes manuscripts on scientific aspects of data on the chemical composition of human foods, with particular emphasis on actual data on composition of foods; analytical methods; studies on the manipulation, storage, distribution and use of food composition data; and studies on the statistics, use and distribution of such data and data systems. The Journal''s basis is nutrient composition, with increasing emphasis on bioactive non-nutrient and anti-nutrient components. Papers must provide sufficient description of the food samples, analytical methods, quality control procedures and statistical treatments of the data to permit the end users of the food composition data to evaluate the appropriateness of such data in their projects.
The Journal does not publish papers on: microbiological compounds; sensory quality; aromatics/volatiles in food and wine; essential oils; organoleptic characteristics of food; physical properties; or clinical papers and pharmacology-related papers.