NAPROC-13 的化学信息学特征:天然产品 13C NMR 去复制数据库

José L., Medina-Franco, Juan F., Avellaneda-Tamayo, Naicolette A., Agudo-Muñoz, Javier E., Sánchez-Galán, José Luis, López-Pérez
{"title":"NAPROC-13 的化学信息学特征:天然产品 13C NMR 去复制数据库","authors":"José L., Medina-Franco, Juan F., Avellaneda-Tamayo, Naicolette A., Agudo-Muñoz, Javier E., Sánchez-Galán, José Luis, López-Pérez","doi":"10.26434/chemrxiv-2024-spksf-v2","DOIUrl":null,"url":null,"abstract":"Natural products (NPs) are secondary metabolites of natural origin with broad applications across various human activities, particularly discovering bioactive compounds. Structural elucidation of new NPs entails significant cost and effort. On the other hand, the dereplication of known compounds is crucial for the early exclusion of irrelevant compounds in contemporary pharmaceutical research. NAPROC-13 stands out as a publicly accessible database, providing structural and 13C NMR spectroscopic information for over 25,000 compounds, rendering it a pivotal resource in natural product (NP) research, favoring open science. This study seeks to quantitatively analyze the chemical content, structural diversity, and chemical space coverage of NPs within NAPROC-13, compared to FDA-approved drugs and a very diverse subset of NPs, UNPD-A. Findings indicated that NPs in NAPROC-13 exhibit comparable properties to those in UNPD-A, albeit showcasing a notably diverse array of structural content, scaffolds, ring systems of pharmaceutical interest, and molecular fragments. NAPROC-13 covers a specific region of the chemical multiverse (a generalization of the chemical space from different chemical representations) regarding physicochemical properties and a region as broad as UNPD-A in terms of structural features represented by fingerprints.","PeriodicalId":9813,"journal":{"name":"ChemRxiv","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Chemoinformatic characterization of NAPROC-13: A database for natural product 13C NMR dereplication\",\"authors\":\"José L., Medina-Franco, Juan F., Avellaneda-Tamayo, Naicolette A., Agudo-Muñoz, Javier E., Sánchez-Galán, José Luis, López-Pérez\",\"doi\":\"10.26434/chemrxiv-2024-spksf-v2\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Natural products (NPs) are secondary metabolites of natural origin with broad applications across various human activities, particularly discovering bioactive compounds. Structural elucidation of new NPs entails significant cost and effort. On the other hand, the dereplication of known compounds is crucial for the early exclusion of irrelevant compounds in contemporary pharmaceutical research. NAPROC-13 stands out as a publicly accessible database, providing structural and 13C NMR spectroscopic information for over 25,000 compounds, rendering it a pivotal resource in natural product (NP) research, favoring open science. This study seeks to quantitatively analyze the chemical content, structural diversity, and chemical space coverage of NPs within NAPROC-13, compared to FDA-approved drugs and a very diverse subset of NPs, UNPD-A. Findings indicated that NPs in NAPROC-13 exhibit comparable properties to those in UNPD-A, albeit showcasing a notably diverse array of structural content, scaffolds, ring systems of pharmaceutical interest, and molecular fragments. NAPROC-13 covers a specific region of the chemical multiverse (a generalization of the chemical space from different chemical representations) regarding physicochemical properties and a region as broad as UNPD-A in terms of structural features represented by fingerprints.\",\"PeriodicalId\":9813,\"journal\":{\"name\":\"ChemRxiv\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ChemRxiv\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.26434/chemrxiv-2024-spksf-v2\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ChemRxiv","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.26434/chemrxiv-2024-spksf-v2","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

天然产物(NPs)是源于自然的次级代谢产物,在人类的各种活动中有着广泛的应用,特别是在发现生物活性化合物方面。对新的 NPs 进行结构阐释需要花费大量的成本和精力。另一方面,在当代药物研究中,已知化合物的去复制对于尽早排除无关化合物至关重要。NAPROC-13 是一个可公开访问的数据库,提供 25,000 多种化合物的结构和 13C NMR 光谱信息,是天然产物(NP)研究中的重要资源,有利于开放科学的发展。本研究旨在定量分析 NAPROC-13 中 NPs 的化学成分、结构多样性和化学空间覆盖率,并与 FDA 批准的药物和非常多样化的 NPs 子集 UNPD-A 进行比较。研究结果表明,NAPROC-13 中的 NPs 与 UNPD-A 中的 NPs 具有相似的特性,但在结构内容、支架、药物环系统和分子片段方面却呈现出明显的多样性。就理化性质而言,NAPROC-13 涵盖了化学多元宇宙(从不同化学表征中概括出的化学空间)的一个特定区域,而就指纹所代表的结构特征而言,该区域与 UNPD-A 一样宽广。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Chemoinformatic characterization of NAPROC-13: A database for natural product 13C NMR dereplication
Natural products (NPs) are secondary metabolites of natural origin with broad applications across various human activities, particularly discovering bioactive compounds. Structural elucidation of new NPs entails significant cost and effort. On the other hand, the dereplication of known compounds is crucial for the early exclusion of irrelevant compounds in contemporary pharmaceutical research. NAPROC-13 stands out as a publicly accessible database, providing structural and 13C NMR spectroscopic information for over 25,000 compounds, rendering it a pivotal resource in natural product (NP) research, favoring open science. This study seeks to quantitatively analyze the chemical content, structural diversity, and chemical space coverage of NPs within NAPROC-13, compared to FDA-approved drugs and a very diverse subset of NPs, UNPD-A. Findings indicated that NPs in NAPROC-13 exhibit comparable properties to those in UNPD-A, albeit showcasing a notably diverse array of structural content, scaffolds, ring systems of pharmaceutical interest, and molecular fragments. NAPROC-13 covers a specific region of the chemical multiverse (a generalization of the chemical space from different chemical representations) regarding physicochemical properties and a region as broad as UNPD-A in terms of structural features represented by fingerprints.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信