Integrating metabolite-based molecular networking with database matching and LC-MS-guided targeted isolation for the discovery of novel chemical constituents: application to Euphorbia helioscopia L.
Tianren Wu, Yaling Deng, Weijia Hu, Caihong Bai, Han Yu, Kaifeng Hu
{"title":"Integrating metabolite-based molecular networking with database matching and LC-MS-guided targeted isolation for the discovery of novel chemical constituents: application to Euphorbia helioscopia L.","authors":"Tianren Wu, Yaling Deng, Weijia Hu, Caihong Bai, Han Yu, Kaifeng Hu","doi":"10.1007/s00216-025-05893-1","DOIUrl":null,"url":null,"abstract":"<p><p>Molecular networking (MN) analysis facilitates the targeted discovery of novel constituents and enhances the understanding of natural products. While various molecular networks could reduce the effects of redundant nodes, current research is still limited by the interference from the same and coeluted metabolites, including isotopic peaks, a variety of adduct ions, in-source fragmentations, and dehydration. This research proposes a novel strategy: stratified precursor lists (SPLs)-guided Metabolite-Based Molecular Networking (MBMN), which ensures a high-quality MS<sup>2</sup> spectrum for each metabolite precursor due to the absence of retention time overlap with other coeluted metabolites, and each node represents a unique metabolite. By collecting over 40 MS<sup>2</sup> databases from multiple online platforms and public databases, an integrated MS<sup>2</sup> database (IM2DB) containing more than two million MS<sup>2</sup> fragmentation data was constructed. In addition, a customized MS<sup>1</sup> database (M1DB) of reported compounds was also created. Nodes representing known compounds were annotated compared to the IM2DB and M1DB. Combining with MBMN analysis significantly enhances compound identification and characterization, thereby facilitating the discerning of potential novel constituents. To demonstrate the applicability of this strategy, we selected Euphorbia helioscopia L. as an example. 135 nodes were annotated, and three reference nodes were obtained. From the selected 35 target nodes, 10 purified compounds were isolated and elucidated. Among these, three were identified as novel compounds, while the remaining nine were discovered for the first time in Euphorbia helioscopia L. By using this strategy, we can effectively minimize the interference from redundant nodes and discover potentially new compounds.</p>","PeriodicalId":462,"journal":{"name":"Analytical and Bioanalytical Chemistry","volume":" ","pages":"3685-3702"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical and Bioanalytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s00216-025-05893-1","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/10 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Molecular networking (MN) analysis facilitates the targeted discovery of novel constituents and enhances the understanding of natural products. While various molecular networks could reduce the effects of redundant nodes, current research is still limited by the interference from the same and coeluted metabolites, including isotopic peaks, a variety of adduct ions, in-source fragmentations, and dehydration. This research proposes a novel strategy: stratified precursor lists (SPLs)-guided Metabolite-Based Molecular Networking (MBMN), which ensures a high-quality MS2 spectrum for each metabolite precursor due to the absence of retention time overlap with other coeluted metabolites, and each node represents a unique metabolite. By collecting over 40 MS2 databases from multiple online platforms and public databases, an integrated MS2 database (IM2DB) containing more than two million MS2 fragmentation data was constructed. In addition, a customized MS1 database (M1DB) of reported compounds was also created. Nodes representing known compounds were annotated compared to the IM2DB and M1DB. Combining with MBMN analysis significantly enhances compound identification and characterization, thereby facilitating the discerning of potential novel constituents. To demonstrate the applicability of this strategy, we selected Euphorbia helioscopia L. as an example. 135 nodes were annotated, and three reference nodes were obtained. From the selected 35 target nodes, 10 purified compounds were isolated and elucidated. Among these, three were identified as novel compounds, while the remaining nine were discovered for the first time in Euphorbia helioscopia L. By using this strategy, we can effectively minimize the interference from redundant nodes and discover potentially new compounds.
期刊介绍:
Analytical and Bioanalytical Chemistry’s mission is the rapid publication of excellent and high-impact research articles on fundamental and applied topics of analytical and bioanalytical measurement science. Its scope is broad, and ranges from novel measurement platforms and their characterization to multidisciplinary approaches that effectively address important scientific problems. The Editors encourage submissions presenting innovative analytical research in concept, instrumentation, methods, and/or applications, including: mass spectrometry, spectroscopy, and electroanalysis; advanced separations; analytical strategies in “-omics” and imaging, bioanalysis, and sampling; miniaturized devices, medical diagnostics, sensors; analytical characterization of nano- and biomaterials; chemometrics and advanced data analysis.