Julien Cordonnier, Dr. Simon Remy, Prof. Dr. Jean-Hugues Renault, Dr. Jean-Marc Nuzillard
{"title":"Versa DB: Assisting 13C NMR and MS/MS Joint Data Annotation Through On-Demand Databases","authors":"Julien Cordonnier, Dr. Simon Remy, Prof. Dr. Jean-Hugues Renault, Dr. Jean-Marc Nuzillard","doi":"10.1002/cmtd.202300020","DOIUrl":null,"url":null,"abstract":"<p>Compound identification in complex mixtures by NMR and MS is best achieved through experimental databases (DB) mining. Experimental DB frequently show limitations regarding their completeness, availability or data quality, thus making predicted database of increasing common use. Querying large databases may lead to select unlikely structure candidates. Two approaches to dereplication are thus possible: filtering of a large DB before search or scoring of the results after a large scale search. The present work relies on the former approach. As far as we know, nmrshiftdb2 is the only open-source <sup>13</sup>NMR chemical shift predictor that can be freely operated in batch mode. CFM-ID 4.0 is one of the best-performing open-source tools for ESI-MS/MS spectra prediction. LOTUS is a freely usable and comprehensive collection of secondary metabolites. Integrating the open source database and software LOTUS, CFM-ID, and nmrshiftdb2 in a dereplication workflow requires presently programming skills, owing to the diversity of data encoding and processing procedures. A graphical user interface that integrates seamlessly chemical structure collection, spectral data prediction and database building still does not exist, as far as we know. The present work proposes a stand–alone software tool that assists the identification of mixture components in a simple way.</p>","PeriodicalId":72562,"journal":{"name":"Chemistry methods : new approaches to solving problems in chemistry","volume":null,"pages":null},"PeriodicalIF":6.1000,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cmtd.202300020","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemistry methods : new approaches to solving problems in chemistry","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cmtd.202300020","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Compound identification in complex mixtures by NMR and MS is best achieved through experimental databases (DB) mining. Experimental DB frequently show limitations regarding their completeness, availability or data quality, thus making predicted database of increasing common use. Querying large databases may lead to select unlikely structure candidates. Two approaches to dereplication are thus possible: filtering of a large DB before search or scoring of the results after a large scale search. The present work relies on the former approach. As far as we know, nmrshiftdb2 is the only open-source 13NMR chemical shift predictor that can be freely operated in batch mode. CFM-ID 4.0 is one of the best-performing open-source tools for ESI-MS/MS spectra prediction. LOTUS is a freely usable and comprehensive collection of secondary metabolites. Integrating the open source database and software LOTUS, CFM-ID, and nmrshiftdb2 in a dereplication workflow requires presently programming skills, owing to the diversity of data encoding and processing procedures. A graphical user interface that integrates seamlessly chemical structure collection, spectral data prediction and database building still does not exist, as far as we know. The present work proposes a stand–alone software tool that assists the identification of mixture components in a simple way.