An quality evaluation method based on three-dimensional integration and machine learning: Advanced data processing

IF 4 2区 化学 Q1 BIOCHEMICAL RESEARCH METHODS
Jianglei Zhang, Yu Ren, Jin Zeng, Liuwei Zhang, Ming Cai, Lili Lan, Guoxiang Sun
{"title":"An quality evaluation method based on three-dimensional integration and machine learning: Advanced data processing","authors":"Jianglei Zhang,&nbsp;Yu Ren,&nbsp;Jin Zeng,&nbsp;Liuwei Zhang,&nbsp;Ming Cai,&nbsp;Lili Lan,&nbsp;Guoxiang Sun","doi":"10.1016/j.chroma.2025.465826","DOIUrl":null,"url":null,"abstract":"<div><div>This study presents an innovative approach for the quality evaluation of traditional Chinese medicine (TCM) by integrating three-dimensional (3D) data processing with machine learning, aimed at enhancing the efficiency and accuracy of HPLC-DAD data analysis. Through 3D data integration, multi-dimensional signals from the time and wavelength domains are transformed into two-dimensional data, simplifying the analytical process while ensuring precise quantification of component contents. Building on this foundation, dynamic time warping (DTW) and correlation optimized warping (COW) algorithms were applied to effectively resolve retention time drift across different sample batches, achieving both global and local alignment of chromatographic peak shapes. A Binary Evaluation System (BES), incorporating macro qualitative similarity (<strong><em>S</em></strong><sub>m</sub>) and macro quantitative similarity (<strong><em>P</em></strong><sub>m</sub>), was employed to provide a comprehensive assessment of the quality of TCM samples. Additionally, machine learning models such as Multiple Linear Regression (MLR), Decision Tree Regression (DTR), and Random Forest Regression (RFR) were introduced to further improve the automation and accuracy of the evaluation system. In the analysis of 20 Scutellaria baicalensis samples, the method demonstrated a prediction error range of ±0.2 % for Baicalin content. This approach not only enhances data processing efficiency and reduces experimental resource consumption but also provides a robust theoretical and technical foundation for TCM quality assessment. Ultimately, the results of this study confirm the broad applicability of 3D integration and machine learning in TCM quality control, offering innovative technical support for the modernization of TCM quality evaluation systems.</div></div>","PeriodicalId":347,"journal":{"name":"Journal of Chromatography A","volume":"1747 ","pages":"Article 465826"},"PeriodicalIF":4.0000,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chromatography A","FirstCategoryId":"1","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0021967325001748","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

This study presents an innovative approach for the quality evaluation of traditional Chinese medicine (TCM) by integrating three-dimensional (3D) data processing with machine learning, aimed at enhancing the efficiency and accuracy of HPLC-DAD data analysis. Through 3D data integration, multi-dimensional signals from the time and wavelength domains are transformed into two-dimensional data, simplifying the analytical process while ensuring precise quantification of component contents. Building on this foundation, dynamic time warping (DTW) and correlation optimized warping (COW) algorithms were applied to effectively resolve retention time drift across different sample batches, achieving both global and local alignment of chromatographic peak shapes. A Binary Evaluation System (BES), incorporating macro qualitative similarity (Sm) and macro quantitative similarity (Pm), was employed to provide a comprehensive assessment of the quality of TCM samples. Additionally, machine learning models such as Multiple Linear Regression (MLR), Decision Tree Regression (DTR), and Random Forest Regression (RFR) were introduced to further improve the automation and accuracy of the evaluation system. In the analysis of 20 Scutellaria baicalensis samples, the method demonstrated a prediction error range of ±0.2 % for Baicalin content. This approach not only enhances data processing efficiency and reduces experimental resource consumption but also provides a robust theoretical and technical foundation for TCM quality assessment. Ultimately, the results of this study confirm the broad applicability of 3D integration and machine learning in TCM quality control, offering innovative technical support for the modernization of TCM quality evaluation systems.
基于三维集成和机器学习的质量评估方法:高级数据处理
本研究提出了一种将三维数据处理与机器学习相结合的中药质量评价方法,旨在提高HPLC-DAD数据分析的效率和准确性。通过三维数据集成,将时间域和波长域的多维信号转换为二维数据,简化了分析过程,同时保证了组分含量的精确量化。在此基础上,应用动态时间翘曲(DTW)和相关优化翘曲(COW)算法有效解决不同样品批次间的保留时间漂移,实现色谱峰形状的全局和局部对齐。采用宏观定性相似度(Sm)和宏观定量相似度(Pm)相结合的二元评价体系(BES)对中药样品的质量进行综合评价。此外,还引入了多元线性回归(MLR)、决策树回归(DTR)、随机森林回归(RFR)等机器学习模型,进一步提高了评价系统的自动化程度和准确性。在对20份黄芩样品的分析中,该方法对黄芩苷含量的预测误差为±0.2%。该方法不仅提高了数据处理效率,减少了实验资源消耗,而且为中药质量评价提供了坚实的理论和技术基础。最终,本研究结果证实了3D集成和机器学习在中医药质量控制中的广泛适用性,为中医药质量评价体系的现代化提供了创新的技术支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Chromatography A
Journal of Chromatography A 化学-分析化学
CiteScore
7.90
自引率
14.60%
发文量
742
审稿时长
45 days
期刊介绍: The Journal of Chromatography A provides a forum for the publication of original research and critical reviews on all aspects of fundamental and applied separation science. The scope of the journal includes chromatography and related techniques, electromigration techniques (e.g. electrophoresis, electrochromatography), hyphenated and other multi-dimensional techniques, sample preparation, and detection methods such as mass spectrometry. Contributions consist mainly of research papers dealing with the theory of separation methods, instrumental developments and analytical and preparative applications of general interest.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信