基于高分辨率质谱(HRMS) MS/MS数据综合模拟和相似度评分的代谢物注释质量评价

IF 3.8 2区 化学 Q1 BIOCHEMICAL RESEARCH METHODS
Analytical and Bioanalytical Chemistry Pub Date : 2025-06-01 Epub Date: 2025-04-18 DOI:10.1007/s00216-025-05847-7
Yingjiao Shi, Ji Yang, Qianxu Yang, Yipeng Zhang, Zhongda Zeng
{"title":"基于高分辨率质谱(HRMS) MS/MS数据综合模拟和相似度评分的代谢物注释质量评价","authors":"Yingjiao Shi, Ji Yang, Qianxu Yang, Yipeng Zhang, Zhongda Zeng","doi":"10.1007/s00216-025-05847-7","DOIUrl":null,"url":null,"abstract":"<p><p>Metabolite annotation is a critical step in discovery metabolomics, but remains a significant challenge. In this study, the accuracy of metabolite annotation was systematically evaluated by leveraging the proposed strategies for simulation of tandem mass spectrometry (MS/MS) data from high-resolution mass spectrometry (HRMS) and then construction of a large-scale virtual database. Furthermore, various similarity scoring methods were comprehensively compared to assess the performance for annotation. First, three key characteristics that are essential for simulating MS/MS spectra to closely resemble experimental data were identified: (i) the number of mass-to-charge ratio (m/z) features, (ii) the differences between neighboring m/z values, and (iii) the intensity distribution of MS/MS features. These factors were employed to generate representative MS/MS spectra for subsequent study. A meticulously designed virtual MS/MS database was constructed to facilitate accurate annotation assessment, which covered over 100,000 metabolites with diverse structural similarities and differences. To evaluate annotation quality, two simulation strategies on the basis of strong and weak data inference were respectively proposed to replicate MS/MS spectra for unknown metabolites. These simulated spectra were then compared with the virtual database, which provided insights into the expected variations in experimental MS/MS data. Furthermore, eight similarity evaluation methods, including entropy similarity (ES) and weighted dot product (W/DP) algorithms, were rigorously evaluated for their effectiveness in metabolite annotation. The results revealed that some methods, such as ES, exhibited strong resistance to interference and broad adaptability across different MS/MS patterns, whereas others selectively yielded reliable outcomes under specific conditions. This study provided a systematic framework for quality evaluation in metabolite annotation and offered strategies to mitigate false-positive identifications. The findings held great significance for advancing metabolomics research and further improving annotation reliability in complex biological samples.</p>","PeriodicalId":462,"journal":{"name":"Analytical and Bioanalytical Chemistry","volume":" ","pages":"3061-3077"},"PeriodicalIF":3.8000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Quality evaluation of metabolite annotation based on comprehensive simulation of MS/MS data from high-resolution mass spectrometry (HRMS) and similarity scoring.\",\"authors\":\"Yingjiao Shi, Ji Yang, Qianxu Yang, Yipeng Zhang, Zhongda Zeng\",\"doi\":\"10.1007/s00216-025-05847-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Metabolite annotation is a critical step in discovery metabolomics, but remains a significant challenge. In this study, the accuracy of metabolite annotation was systematically evaluated by leveraging the proposed strategies for simulation of tandem mass spectrometry (MS/MS) data from high-resolution mass spectrometry (HRMS) and then construction of a large-scale virtual database. Furthermore, various similarity scoring methods were comprehensively compared to assess the performance for annotation. First, three key characteristics that are essential for simulating MS/MS spectra to closely resemble experimental data were identified: (i) the number of mass-to-charge ratio (m/z) features, (ii) the differences between neighboring m/z values, and (iii) the intensity distribution of MS/MS features. These factors were employed to generate representative MS/MS spectra for subsequent study. A meticulously designed virtual MS/MS database was constructed to facilitate accurate annotation assessment, which covered over 100,000 metabolites with diverse structural similarities and differences. To evaluate annotation quality, two simulation strategies on the basis of strong and weak data inference were respectively proposed to replicate MS/MS spectra for unknown metabolites. These simulated spectra were then compared with the virtual database, which provided insights into the expected variations in experimental MS/MS data. Furthermore, eight similarity evaluation methods, including entropy similarity (ES) and weighted dot product (W/DP) algorithms, were rigorously evaluated for their effectiveness in metabolite annotation. The results revealed that some methods, such as ES, exhibited strong resistance to interference and broad adaptability across different MS/MS patterns, whereas others selectively yielded reliable outcomes under specific conditions. This study provided a systematic framework for quality evaluation in metabolite annotation and offered strategies to mitigate false-positive identifications. The findings held great significance for advancing metabolomics research and further improving annotation reliability in complex biological samples.</p>\",\"PeriodicalId\":462,\"journal\":{\"name\":\"Analytical and Bioanalytical Chemistry\",\"volume\":\" \",\"pages\":\"3061-3077\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Analytical and Bioanalytical Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1007/s00216-025-05847-7\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/4/18 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical and Bioanalytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s00216-025-05847-7","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/18 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

代谢物注释是发现代谢组学的关键一步,但仍然是一个重大挑战。本研究采用高分辨率质谱(HRMS)串联质谱(MS/MS)数据模拟策略,构建大规模虚拟数据库,系统评估代谢物注释的准确性。此外,综合比较了各种相似度评分方法,以评估标注的性能。首先,确定了模拟MS/MS光谱与实验数据接近的三个关键特征:(i)质荷比(m/z)特征的数量,(ii)相邻m/z值之间的差异,以及(iii) MS/MS特征的强度分布。利用这些因素生成具有代表性的MS/MS谱图,用于后续研究。构建了一个精心设计的虚拟MS/MS数据库,以方便准确的注释评估,该数据库涵盖了100,000多种具有不同结构相似性和差异性的代谢物。为了评估注释质量,分别提出了基于强数据推断和弱数据推断的两种模拟策略来复制未知代谢物的MS/MS谱。然后将这些模拟光谱与虚拟数据库进行比较,从而深入了解实验MS/MS数据的预期变化。此外,对熵相似度(ES)和加权点积(W/DP)算法等8种相似度评价方法在代谢物注释中的有效性进行了严格评价。结果表明,一些方法,如ES,在不同的MS/MS模式中表现出很强的抗干扰性和广泛的适应性,而其他方法在特定条件下选择性地产生可靠的结果。该研究为代谢物注释的质量评估提供了一个系统框架,并提供了减少假阳性鉴定的策略。这一发现对于推进代谢组学研究,进一步提高复杂生物样本的标注可靠性具有重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Quality evaluation of metabolite annotation based on comprehensive simulation of MS/MS data from high-resolution mass spectrometry (HRMS) and similarity scoring.

Metabolite annotation is a critical step in discovery metabolomics, but remains a significant challenge. In this study, the accuracy of metabolite annotation was systematically evaluated by leveraging the proposed strategies for simulation of tandem mass spectrometry (MS/MS) data from high-resolution mass spectrometry (HRMS) and then construction of a large-scale virtual database. Furthermore, various similarity scoring methods were comprehensively compared to assess the performance for annotation. First, three key characteristics that are essential for simulating MS/MS spectra to closely resemble experimental data were identified: (i) the number of mass-to-charge ratio (m/z) features, (ii) the differences between neighboring m/z values, and (iii) the intensity distribution of MS/MS features. These factors were employed to generate representative MS/MS spectra for subsequent study. A meticulously designed virtual MS/MS database was constructed to facilitate accurate annotation assessment, which covered over 100,000 metabolites with diverse structural similarities and differences. To evaluate annotation quality, two simulation strategies on the basis of strong and weak data inference were respectively proposed to replicate MS/MS spectra for unknown metabolites. These simulated spectra were then compared with the virtual database, which provided insights into the expected variations in experimental MS/MS data. Furthermore, eight similarity evaluation methods, including entropy similarity (ES) and weighted dot product (W/DP) algorithms, were rigorously evaluated for their effectiveness in metabolite annotation. The results revealed that some methods, such as ES, exhibited strong resistance to interference and broad adaptability across different MS/MS patterns, whereas others selectively yielded reliable outcomes under specific conditions. This study provided a systematic framework for quality evaluation in metabolite annotation and offered strategies to mitigate false-positive identifications. The findings held great significance for advancing metabolomics research and further improving annotation reliability in complex biological samples.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
8.00
自引率
4.70%
发文量
638
审稿时长
2.1 months
期刊介绍: Analytical and Bioanalytical Chemistry’s mission is the rapid publication of excellent and high-impact research articles on fundamental and applied topics of analytical and bioanalytical measurement science. Its scope is broad, and ranges from novel measurement platforms and their characterization to multidisciplinary approaches that effectively address important scientific problems. The Editors encourage submissions presenting innovative analytical research in concept, instrumentation, methods, and/or applications, including: mass spectrometry, spectroscopy, and electroanalysis; advanced separations; analytical strategies in “-omics” and imaging, bioanalysis, and sampling; miniaturized devices, medical diagnostics, sensors; analytical characterization of nano- and biomaterials; chemometrics and advanced data analysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信