NMRFinder: a novel method for 1D 1H-NMR metabolite annotation.

Sara Cardoso, Débora Cabral, Marcelo Maraschin, Miguel Rocha
{"title":"NMRFinder: a novel method for 1D <sup>1</sup>H-NMR metabolite annotation.","authors":"Sara Cardoso,&nbsp;Débora Cabral,&nbsp;Marcelo Maraschin,&nbsp;Miguel Rocha","doi":"10.1007/s11306-021-01772-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Methods for the automated and accurate identification of metabolites in 1D <sup>1</sup>H-NMR samples are crucial, but this is still an unsolved problem. Most available tools are mainly focused on metabolite quantification, thus limiting the number of metabolites that can be identified. Also, most only use reference spectra obtained under the same specific conditions of the target sample, limiting the use of available knowledge.</p><p><strong>Objectives: </strong>The main goal of this work was to develop novel methods to perform metabolite annotation from 1D <sup>1</sup>H-NMR peaks with enhanced reliability, to aid the users in metabolite identification. An essential step was to construct a vast and up-do-date library of reference 1D <sup>1</sup>H-NMR peak lists collected under distinct experimental conditions.</p><p><strong>Methods: </strong>Three different algorithms were evaluated for their capacity to correctly annotate metabolites present in both synthetic and real samples and compared to publicly available tools. The best proposed method was evaluated in a plethora of scenarios, including missing references, missing peaks and peak shifts, to assess its annotation accuracy, precision and recall.</p><p><strong>Results: </strong>We gathered 1816 peak lists for 1387 different metabolites from several sources across different conditions for our reference library. A new method, NMRFinder, is proposed and allows matching 1D <sup>1</sup>H-NMR samples with all the reference peak lists in the library, regardless of acquisition conditions. Metabolites are scored according to the number of peaks matching the samples, how unique their peaks are in the library and how close the spectrum acquisition conditions are in relation to those of the samples. Results show a true positive rate of 0.984 when analysing computationally created samples, while 71.8% of the metabolites were annotated when analysing samples from previously identified public datasets.</p><p><strong>Conclusion: </strong>NMRFinder performs metabolite annotation reliably and outperforms previous methods, being of great value in helping the user to ultimately identify metabolites. It is implemented in the R package specmine.</p>","PeriodicalId":144887,"journal":{"name":"Metabolomics : Official journal of the Metabolomic Society","volume":" ","pages":"21"},"PeriodicalIF":0.0000,"publicationDate":"2021-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s11306-021-01772-9","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Metabolomics : Official journal of the Metabolomic Society","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s11306-021-01772-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Introduction: Methods for the automated and accurate identification of metabolites in 1D 1H-NMR samples are crucial, but this is still an unsolved problem. Most available tools are mainly focused on metabolite quantification, thus limiting the number of metabolites that can be identified. Also, most only use reference spectra obtained under the same specific conditions of the target sample, limiting the use of available knowledge.

Objectives: The main goal of this work was to develop novel methods to perform metabolite annotation from 1D 1H-NMR peaks with enhanced reliability, to aid the users in metabolite identification. An essential step was to construct a vast and up-do-date library of reference 1D 1H-NMR peak lists collected under distinct experimental conditions.

Methods: Three different algorithms were evaluated for their capacity to correctly annotate metabolites present in both synthetic and real samples and compared to publicly available tools. The best proposed method was evaluated in a plethora of scenarios, including missing references, missing peaks and peak shifts, to assess its annotation accuracy, precision and recall.

Results: We gathered 1816 peak lists for 1387 different metabolites from several sources across different conditions for our reference library. A new method, NMRFinder, is proposed and allows matching 1D 1H-NMR samples with all the reference peak lists in the library, regardless of acquisition conditions. Metabolites are scored according to the number of peaks matching the samples, how unique their peaks are in the library and how close the spectrum acquisition conditions are in relation to those of the samples. Results show a true positive rate of 0.984 when analysing computationally created samples, while 71.8% of the metabolites were annotated when analysing samples from previously identified public datasets.

Conclusion: NMRFinder performs metabolite annotation reliably and outperforms previous methods, being of great value in helping the user to ultimately identify metabolites. It is implemented in the R package specmine.

NMRFinder:一种新的1D 1H-NMR代谢物注释方法。
1D 1H-NMR样品中代谢物的自动准确鉴定方法至关重要,但这仍然是一个未解决的问题。大多数可用的工具主要集中在代谢物的量化上,从而限制了可以识别的代谢物的数量。此外,大多数只使用在目标样品的相同特定条件下获得的参考光谱,限制了现有知识的使用。目的:本工作的主要目的是开发新的方法,从1D 1H-NMR峰进行代谢物注释,提高可靠性,以帮助用户识别代谢物。一个重要的步骤是建立一个庞大的、最新的参考1D 1H-NMR峰表库,这些峰表是在不同的实验条件下收集的。方法:评估了三种不同的算法正确注释合成和真实样品中代谢物的能力,并与公开可用的工具进行了比较。在缺失参考文献、缺失峰和峰移位等多种情况下,对所提出的最佳方法进行了评估,以评估其标注的准确性、精密度和召回率。结果:我们收集了1387种不同代谢物在不同条件下的1816个峰表作为参考库。提出了一种新的方法NMRFinder,该方法允许将1D 1H-NMR样品与库中的所有参考峰表进行匹配,而不管采集条件如何。代谢物根据与样品匹配的峰的数量、它们的峰在文库中的独特程度以及与样品的光谱获取条件的接近程度进行评分。结果显示,在分析计算生成的样本时,真阳性率为0.984,而在分析先前确定的公共数据集的样本时,71.8%的代谢物被注释。结论:NMRFinder对代谢物进行了可靠的标注,优于以往的方法,在帮助用户最终识别代谢物方面具有重要价值。它在R包规范中实现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信