Jianglei Zhang, Yu Ren, Jin Zeng, Liuwei Zhang, Ming Cai, Lili Lan, Guoxiang Sun
{"title":"An quality evaluation method based on three-dimensional integration and machine learning: Advanced data processing","authors":"Jianglei Zhang, Yu Ren, Jin Zeng, Liuwei Zhang, Ming Cai, Lili Lan, Guoxiang Sun","doi":"10.1016/j.chroma.2025.465826","DOIUrl":null,"url":null,"abstract":"<div><div>This study presents an innovative approach for the quality evaluation of traditional Chinese medicine (TCM) by integrating three-dimensional (3D) data processing with machine learning, aimed at enhancing the efficiency and accuracy of HPLC-DAD data analysis. Through 3D data integration, multi-dimensional signals from the time and wavelength domains are transformed into two-dimensional data, simplifying the analytical process while ensuring precise quantification of component contents. Building on this foundation, dynamic time warping (DTW) and correlation optimized warping (COW) algorithms were applied to effectively resolve retention time drift across different sample batches, achieving both global and local alignment of chromatographic peak shapes. A Binary Evaluation System (BES), incorporating macro qualitative similarity (<strong><em>S</em></strong><sub>m</sub>) and macro quantitative similarity (<strong><em>P</em></strong><sub>m</sub>), was employed to provide a comprehensive assessment of the quality of TCM samples. Additionally, machine learning models such as Multiple Linear Regression (MLR), Decision Tree Regression (DTR), and Random Forest Regression (RFR) were introduced to further improve the automation and accuracy of the evaluation system. In the analysis of 20 Scutellaria baicalensis samples, the method demonstrated a prediction error range of ±0.2 % for Baicalin content. This approach not only enhances data processing efficiency and reduces experimental resource consumption but also provides a robust theoretical and technical foundation for TCM quality assessment. Ultimately, the results of this study confirm the broad applicability of 3D integration and machine learning in TCM quality control, offering innovative technical support for the modernization of TCM quality evaluation systems.</div></div>","PeriodicalId":347,"journal":{"name":"Journal of Chromatography A","volume":"1747 ","pages":"Article 465826"},"PeriodicalIF":4.0000,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chromatography A","FirstCategoryId":"1","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0021967325001748","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
This study presents an innovative approach for the quality evaluation of traditional Chinese medicine (TCM) by integrating three-dimensional (3D) data processing with machine learning, aimed at enhancing the efficiency and accuracy of HPLC-DAD data analysis. Through 3D data integration, multi-dimensional signals from the time and wavelength domains are transformed into two-dimensional data, simplifying the analytical process while ensuring precise quantification of component contents. Building on this foundation, dynamic time warping (DTW) and correlation optimized warping (COW) algorithms were applied to effectively resolve retention time drift across different sample batches, achieving both global and local alignment of chromatographic peak shapes. A Binary Evaluation System (BES), incorporating macro qualitative similarity (Sm) and macro quantitative similarity (Pm), was employed to provide a comprehensive assessment of the quality of TCM samples. Additionally, machine learning models such as Multiple Linear Regression (MLR), Decision Tree Regression (DTR), and Random Forest Regression (RFR) were introduced to further improve the automation and accuracy of the evaluation system. In the analysis of 20 Scutellaria baicalensis samples, the method demonstrated a prediction error range of ±0.2 % for Baicalin content. This approach not only enhances data processing efficiency and reduces experimental resource consumption but also provides a robust theoretical and technical foundation for TCM quality assessment. Ultimately, the results of this study confirm the broad applicability of 3D integration and machine learning in TCM quality control, offering innovative technical support for the modernization of TCM quality evaluation systems.
期刊介绍:
The Journal of Chromatography A provides a forum for the publication of original research and critical reviews on all aspects of fundamental and applied separation science. The scope of the journal includes chromatography and related techniques, electromigration techniques (e.g. electrophoresis, electrochromatography), hyphenated and other multi-dimensional techniques, sample preparation, and detection methods such as mass spectrometry. Contributions consist mainly of research papers dealing with the theory of separation methods, instrumental developments and analytical and preparative applications of general interest.