Nerilson M. Lima, Luana A. Pereira, Lucas S. Tironi, Matheus P. G. do Carmo, Milbya L. Costa, Renato A. Oliveira, Salva Asghar, Vinicius Fortes da Silva
{"title":"From MS1 to Structure: A Van Krevelen–DBE–Aromaticity-Based Framework for Annotating Specialized Metabolites via High-Resolution Mass Spectrometry","authors":"Nerilson M. Lima, Luana A. Pereira, Lucas S. Tironi, Matheus P. G. do Carmo, Milbya L. Costa, Renato A. Oliveira, Salva Asghar, Vinicius Fortes da Silva","doi":"10.1002/rcm.10145","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Rationale</h3>\n \n <p>Classifying specialized metabolites in untargeted metabolomics remains a major challenge, particularly when relying solely on high-resolution mass spectrometry (HRMS) data at the MS1 level. Traditional approaches using Van Krevelen diagrams often lack sufficient resolution to distinguish structurally similar metabolite classes.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>We developed a chemoinformatic framework that combines Van Krevelen analysis (H/C vs. O/C) with double bond equivalent (DBE) calculations to refine metabolite class annotation at Level 3 of the Metabolomics Standards Initiative (MSI). Molecular formulas were retrieved from curated structure databases and natural product repositories, and DBE values were used to refine structural classification. A dataset of over 600 curated molecular formulas representing phenolics, alkaloids, and isoprenoids was analyzed to define class-specific patterns.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The combined use of DBE and Van Krevelen plots enabled improved discrimination between overlapping metabolite classes, including flavonoids, phenolic acids, coumarins, and tannins. Our framework revealed structural trends associated with aromaticity and unsaturation that are not captured by conventional MS1-based tools. It outperforms existing Level 3 annotation strategies that rely on in silico MS/MS fragmentation or substructure matching. A case study using <i>Eugenia jambolana</i> fruit extract validated the method, revealing dominant classes such as flavonoids, phenolic acids, and tannins using only MS1 data.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>This is the first scalable framework to annotate specialized metabolites from MS1 data alone using integrated elemental ratios and structural descriptors. It enhances the annotation confidence for untargeted metabolomics, especially in complex, undercharacterized plant matrices, without requiring MS2 fragmentation.</p>\n </section>\n </div>","PeriodicalId":225,"journal":{"name":"Rapid Communications in Mass Spectrometry","volume":"39 24","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/rcm.10145","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Rapid Communications in Mass Spectrometry","FirstCategoryId":"92","ListUrlMain":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/10.1002/rcm.10145","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Rationale
Classifying specialized metabolites in untargeted metabolomics remains a major challenge, particularly when relying solely on high-resolution mass spectrometry (HRMS) data at the MS1 level. Traditional approaches using Van Krevelen diagrams often lack sufficient resolution to distinguish structurally similar metabolite classes.
Methods
We developed a chemoinformatic framework that combines Van Krevelen analysis (H/C vs. O/C) with double bond equivalent (DBE) calculations to refine metabolite class annotation at Level 3 of the Metabolomics Standards Initiative (MSI). Molecular formulas were retrieved from curated structure databases and natural product repositories, and DBE values were used to refine structural classification. A dataset of over 600 curated molecular formulas representing phenolics, alkaloids, and isoprenoids was analyzed to define class-specific patterns.
Results
The combined use of DBE and Van Krevelen plots enabled improved discrimination between overlapping metabolite classes, including flavonoids, phenolic acids, coumarins, and tannins. Our framework revealed structural trends associated with aromaticity and unsaturation that are not captured by conventional MS1-based tools. It outperforms existing Level 3 annotation strategies that rely on in silico MS/MS fragmentation or substructure matching. A case study using Eugenia jambolana fruit extract validated the method, revealing dominant classes such as flavonoids, phenolic acids, and tannins using only MS1 data.
Conclusions
This is the first scalable framework to annotate specialized metabolites from MS1 data alone using integrated elemental ratios and structural descriptors. It enhances the annotation confidence for untargeted metabolomics, especially in complex, undercharacterized plant matrices, without requiring MS2 fragmentation.
期刊介绍:
Rapid Communications in Mass Spectrometry is a journal whose aim is the rapid publication of original research results and ideas on all aspects of the science of gas-phase ions; it covers all the associated scientific disciplines. There is no formal limit on paper length ("rapid" is not synonymous with "brief"), but papers should be of a length that is commensurate with the importance and complexity of the results being reported. Contributions may be theoretical or practical in nature; they may deal with methods, techniques and applications, or with the interpretation of results; they may cover any area in science that depends directly on measurements made upon gaseous ions or that is associated with such measurements.