From MS1 to Structure: A Van Krevelen–DBE–Aromaticity-Based Framework for Annotating Specialized Metabolites via High-Resolution Mass Spectrometry

IF 1.7 3区 化学 Q4 BIOCHEMICAL RESEARCH METHODS
Nerilson M. Lima, Luana A. Pereira, Lucas S. Tironi, Matheus P. G. do Carmo, Milbya L. Costa, Renato A. Oliveira, Salva Asghar, Vinicius Fortes da Silva
{"title":"From MS1 to Structure: A Van Krevelen–DBE–Aromaticity-Based Framework for Annotating Specialized Metabolites via High-Resolution Mass Spectrometry","authors":"Nerilson M. Lima,&nbsp;Luana A. Pereira,&nbsp;Lucas S. Tironi,&nbsp;Matheus P. G. do Carmo,&nbsp;Milbya L. Costa,&nbsp;Renato A. Oliveira,&nbsp;Salva Asghar,&nbsp;Vinicius Fortes da Silva","doi":"10.1002/rcm.10145","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Rationale</h3>\n \n <p>Classifying specialized metabolites in untargeted metabolomics remains a major challenge, particularly when relying solely on high-resolution mass spectrometry (HRMS) data at the MS1 level. Traditional approaches using Van Krevelen diagrams often lack sufficient resolution to distinguish structurally similar metabolite classes.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>We developed a chemoinformatic framework that combines Van Krevelen analysis (H/C vs. O/C) with double bond equivalent (DBE) calculations to refine metabolite class annotation at Level 3 of the Metabolomics Standards Initiative (MSI). Molecular formulas were retrieved from curated structure databases and natural product repositories, and DBE values were used to refine structural classification. A dataset of over 600 curated molecular formulas representing phenolics, alkaloids, and isoprenoids was analyzed to define class-specific patterns.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>The combined use of DBE and Van Krevelen plots enabled improved discrimination between overlapping metabolite classes, including flavonoids, phenolic acids, coumarins, and tannins. Our framework revealed structural trends associated with aromaticity and unsaturation that are not captured by conventional MS1-based tools. It outperforms existing Level 3 annotation strategies that rely on in silico MS/MS fragmentation or substructure matching. A case study using <i>Eugenia jambolana</i> fruit extract validated the method, revealing dominant classes such as flavonoids, phenolic acids, and tannins using only MS1 data.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>This is the first scalable framework to annotate specialized metabolites from MS1 data alone using integrated elemental ratios and structural descriptors. It enhances the annotation confidence for untargeted metabolomics, especially in complex, undercharacterized plant matrices, without requiring MS2 fragmentation.</p>\n </section>\n </div>","PeriodicalId":225,"journal":{"name":"Rapid Communications in Mass Spectrometry","volume":"39 24","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/rcm.10145","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Rapid Communications in Mass Spectrometry","FirstCategoryId":"92","ListUrlMain":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/10.1002/rcm.10145","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Rationale

Classifying specialized metabolites in untargeted metabolomics remains a major challenge, particularly when relying solely on high-resolution mass spectrometry (HRMS) data at the MS1 level. Traditional approaches using Van Krevelen diagrams often lack sufficient resolution to distinguish structurally similar metabolite classes.

Methods

We developed a chemoinformatic framework that combines Van Krevelen analysis (H/C vs. O/C) with double bond equivalent (DBE) calculations to refine metabolite class annotation at Level 3 of the Metabolomics Standards Initiative (MSI). Molecular formulas were retrieved from curated structure databases and natural product repositories, and DBE values were used to refine structural classification. A dataset of over 600 curated molecular formulas representing phenolics, alkaloids, and isoprenoids was analyzed to define class-specific patterns.

Results

The combined use of DBE and Van Krevelen plots enabled improved discrimination between overlapping metabolite classes, including flavonoids, phenolic acids, coumarins, and tannins. Our framework revealed structural trends associated with aromaticity and unsaturation that are not captured by conventional MS1-based tools. It outperforms existing Level 3 annotation strategies that rely on in silico MS/MS fragmentation or substructure matching. A case study using Eugenia jambolana fruit extract validated the method, revealing dominant classes such as flavonoids, phenolic acids, and tannins using only MS1 data.

Conclusions

This is the first scalable framework to annotate specialized metabolites from MS1 data alone using integrated elemental ratios and structural descriptors. It enhances the annotation confidence for untargeted metabolomics, especially in complex, undercharacterized plant matrices, without requiring MS2 fragmentation.

Abstract Image

Abstract Image

Abstract Image

Abstract Image

从MS1到结构:一个基于Van krevelen - dbe芳香性的框架,通过高分辨率质谱法注释专门的代谢物
在非靶向代谢组学中对特定代谢物进行分类仍然是一个主要挑战,特别是当仅依赖MS1水平的高分辨率质谱(HRMS)数据时。使用Van Krevelen图的传统方法通常缺乏足够的分辨率来区分结构相似的代谢物类别。我们开发了一个化学信息学框架,将Van Krevelen分析(H/C vs. O/C)与双键当量(DBE)计算相结合,以完善代谢组学标准倡议(MSI)第3级的代谢物类别注释。分子式从精心策划的结构数据库和天然产物库中检索,并使用DBE值来细化结构分类。研究人员分析了600多个分子式的数据集,这些分子式代表了酚类、生物碱和类异戊二烯,以确定类特定的模式。结果DBE和Van Krevelen图的联合使用可以提高对重叠代谢物类别的区分,包括黄酮类、酚酸类、香豆素类和单宁类。我们的框架揭示了传统的基于ms1的工具无法捕捉到的与芳香性和不饱和性相关的结构趋势。它优于现有的依赖于计算机MS/MS碎片或子结构匹配的Level 3注释策略。一项使用白桦果提取物的案例研究验证了该方法,仅使用MS1数据就揭示了黄酮类、酚酸类和单宁类等主要类别。这是第一个可扩展的框架,可以单独使用综合元素比率和结构描述符从MS1数据中注释专门的代谢物。它提高了对非靶向代谢组学的注释可信度,特别是在复杂的、未充分表征的植物基质中,而不需要MS2片段化。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.10
自引率
5.00%
发文量
219
审稿时长
2.6 months
期刊介绍: Rapid Communications in Mass Spectrometry is a journal whose aim is the rapid publication of original research results and ideas on all aspects of the science of gas-phase ions; it covers all the associated scientific disciplines. There is no formal limit on paper length ("rapid" is not synonymous with "brief"), but papers should be of a length that is commensurate with the importance and complexity of the results being reported. Contributions may be theoretical or practical in nature; they may deal with methods, techniques and applications, or with the interpretation of results; they may cover any area in science that depends directly on measurements made upon gaseous ions or that is associated with such measurements.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信