Data fusion of Fourier transform infrared spectroscopy and high-performance liquid chromatography for the origin identification of different medicinal rhizomes of genus Atractylodes
Hongfei Wu , Mingjun Wang , Zhiming Zeng , Changyun Dai , Feilong Ren , Hongbo Yin , Lu Chen
{"title":"Data fusion of Fourier transform infrared spectroscopy and high-performance liquid chromatography for the origin identification of different medicinal rhizomes of genus Atractylodes","authors":"Hongfei Wu , Mingjun Wang , Zhiming Zeng , Changyun Dai , Feilong Ren , Hongbo Yin , Lu Chen","doi":"10.1016/j.microc.2025.113110","DOIUrl":null,"url":null,"abstract":"<div><div>The rhizomes of the genus <em>Atractylodes</em>, such as <em>A. lancea</em>, <em>A. chinensis</em>, <em>A. japonica</em>, <em>A. coreana</em>, and <em>A. macrocephala</em>, have been extensively utilized as prominent traditional herbal medicines across China, Japan, South Korea and other Asian countries. Due to the close genetic relationships and morphological similarities, confusion and misuse frequently arise. Although various methods exist to identify <em>Atractylodes</em> species based on their chemical profiles, they often focus on a limited subset, overlooking species like <em>A. coreana</em> which are frequently observed in the traditional herbal medicine market in northern China and limiting comprehensive identification. In this study, data fusion of spectral and chromatographic data was used for the first time to identify five different medicinal rhizomes derived from genus <em>Atractylodes</em>. It also serves as a preliminary exploration of the integration of Fourier Transform Infrared Spectroscopy (FTIR), High-Performance Liquid Chromatography (HPLC) fingerprinting, and chemical pattern recognition within the domain of origin identification of traditional herbal medicines. Our study demonstrated that the mid-level data fusion of FTIR and HPLC data using partial least squares-discriminant analysis (PLS-DA) and t-distributed stochastic neighbor embedding (t-SNE) constituted an effective approach for identification. While PLS-DA excels in supervised classification, t-SNE complements it by offering intuitive visualization of high-dimensional data, revealing clustering patterns among the species. The 81 batches of dried rhizomes from five species of <em>Atractylodes</em> were divided into training and prediction sets at a 2:1 ratio, employing the K-S algorithm, achieving a prediction accuracy of 100%. The integration of t-SNE further confirmed the separation achieved by PLS-DA, enhancing the interpretability of the classification results and highlighting the potential of data fusion combined with advanced visualization techniques in distinguishing closely related herbal species. Additionally, the results showed that the chemical differences of <em>Atractylodes</em> among various varieties were mainly reflected in polysaccharides, alkynes, and ketones, the chemical composition of <em>A. macrocephala</em> was very different from that of other species, while <em>A. japonica</em> was close to that of <em>A. coreana</em>. This may indicate the genetic distance among them. It can successfully distinguish the five often-confused medicinal rhizomes of <em>Atractylodes</em>, achieving a prediction accuracy of 100%. This study presents a feasible approach for identifying five closely related medicinal rhizomes of <em>Atractylodes</em> using data fusion, demonstrating its potential in addressing challenges associated with distinguishing morphologically similar herbal medicines.</div></div>","PeriodicalId":391,"journal":{"name":"Microchemical Journal","volume":"211 ","pages":"Article 113110"},"PeriodicalIF":4.9000,"publicationDate":"2025-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microchemical Journal","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0026265X25004643","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0
Abstract
The rhizomes of the genus Atractylodes, such as A. lancea, A. chinensis, A. japonica, A. coreana, and A. macrocephala, have been extensively utilized as prominent traditional herbal medicines across China, Japan, South Korea and other Asian countries. Due to the close genetic relationships and morphological similarities, confusion and misuse frequently arise. Although various methods exist to identify Atractylodes species based on their chemical profiles, they often focus on a limited subset, overlooking species like A. coreana which are frequently observed in the traditional herbal medicine market in northern China and limiting comprehensive identification. In this study, data fusion of spectral and chromatographic data was used for the first time to identify five different medicinal rhizomes derived from genus Atractylodes. It also serves as a preliminary exploration of the integration of Fourier Transform Infrared Spectroscopy (FTIR), High-Performance Liquid Chromatography (HPLC) fingerprinting, and chemical pattern recognition within the domain of origin identification of traditional herbal medicines. Our study demonstrated that the mid-level data fusion of FTIR and HPLC data using partial least squares-discriminant analysis (PLS-DA) and t-distributed stochastic neighbor embedding (t-SNE) constituted an effective approach for identification. While PLS-DA excels in supervised classification, t-SNE complements it by offering intuitive visualization of high-dimensional data, revealing clustering patterns among the species. The 81 batches of dried rhizomes from five species of Atractylodes were divided into training and prediction sets at a 2:1 ratio, employing the K-S algorithm, achieving a prediction accuracy of 100%. The integration of t-SNE further confirmed the separation achieved by PLS-DA, enhancing the interpretability of the classification results and highlighting the potential of data fusion combined with advanced visualization techniques in distinguishing closely related herbal species. Additionally, the results showed that the chemical differences of Atractylodes among various varieties were mainly reflected in polysaccharides, alkynes, and ketones, the chemical composition of A. macrocephala was very different from that of other species, while A. japonica was close to that of A. coreana. This may indicate the genetic distance among them. It can successfully distinguish the five often-confused medicinal rhizomes of Atractylodes, achieving a prediction accuracy of 100%. This study presents a feasible approach for identifying five closely related medicinal rhizomes of Atractylodes using data fusion, demonstrating its potential in addressing challenges associated with distinguishing morphologically similar herbal medicines.
期刊介绍:
The Microchemical Journal is a peer reviewed journal devoted to all aspects and phases of analytical chemistry and chemical analysis. The Microchemical Journal publishes articles which are at the forefront of modern analytical chemistry and cover innovations in the techniques to the finest possible limits. This includes fundamental aspects, instrumentation, new developments, innovative and novel methods and applications including environmental and clinical field.
Traditional classical analytical methods such as spectrophotometry and titrimetry as well as established instrumentation methods such as flame and graphite furnace atomic absorption spectrometry, gas chromatography, and modified glassy or carbon electrode electrochemical methods will be considered, provided they show significant improvements and novelty compared to the established methods.