使用开源工具向MassBank和PubChem添加开放光谱数据,以支持混合物的非目标暴露组学。

IF 4.3 3区 环境科学与生态学 Q1 CHEMISTRY, ANALYTICAL
Anjana Elapavalore, Todor Kondić, Randolph R. Singh, Benjamin A. Shoemaker, Paul A. Thiessen, Jian Zhang, Evan E. Bolton and Emma L. Schymanski
{"title":"使用开源工具向MassBank和PubChem添加开放光谱数据,以支持混合物的非目标暴露组学。","authors":"Anjana Elapavalore, Todor Kondić, Randolph R. Singh, Benjamin A. Shoemaker, Paul A. Thiessen, Jian Zhang, Evan E. Bolton and Emma L. Schymanski","doi":"10.1039/D3EM00181D","DOIUrl":null,"url":null,"abstract":"<p >The term “exposome” is defined as a comprehensive study of life-course environmental exposures and the associated biological responses. Humans are exposed to many different chemicals, which can pose a major threat to the well-being of humanity. Targeted or non-targeted mass spectrometry techniques are widely used to identify and characterize various environmental stressors when linking exposures to human health. However, identification remains challenging due to the huge chemical space applicable to exposomics, combined with the lack of sufficient relevant entries in spectral libraries. Addressing these challenges requires cheminformatics tools and database resources to share curated open spectral data on chemicals to improve the identification of chemicals in exposomics studies. This article describes efforts to contribute spectra relevant for exposomics to the open mass spectral library MassBank (https://www.massbank.eu) using various open source software efforts, including the R packages RMassBank and Shinyscreen. The experimental spectra were obtained from ten mixtures containing toxicologically relevant chemicals from the US Environmental Protection Agency (EPA) Non-Targeted Analysis Collaborative Trial (ENTACT). Following processing and curation, 5582 spectra from 783 of the 1268 ENTACT compounds were added to MassBank, and through this to other open spectral libraries (<em>e.g.</em>, MoNA, GNPS) for community benefit. Additionally, an automated deposition and annotation workflow was developed with PubChem to enable the display of all MassBank mass spectra in PubChem, which is rerun with each MassBank release. The new spectral records have already been used in several studies to increase the confidence in identification in non-target small molecule identification workflows applied to environmental and exposomics research.</p>","PeriodicalId":74,"journal":{"name":"Environmental Science: Processes & Impacts","volume":" 11","pages":" 1788-1801"},"PeriodicalIF":4.3000,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10648001/pdf/","citationCount":"0","resultStr":"{\"title\":\"Adding open spectral data to MassBank and PubChem using open source tools to support non-targeted exposomics of mixtures†\",\"authors\":\"Anjana Elapavalore, Todor Kondić, Randolph R. Singh, Benjamin A. Shoemaker, Paul A. Thiessen, Jian Zhang, Evan E. Bolton and Emma L. Schymanski\",\"doi\":\"10.1039/D3EM00181D\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >The term “exposome” is defined as a comprehensive study of life-course environmental exposures and the associated biological responses. Humans are exposed to many different chemicals, which can pose a major threat to the well-being of humanity. Targeted or non-targeted mass spectrometry techniques are widely used to identify and characterize various environmental stressors when linking exposures to human health. However, identification remains challenging due to the huge chemical space applicable to exposomics, combined with the lack of sufficient relevant entries in spectral libraries. Addressing these challenges requires cheminformatics tools and database resources to share curated open spectral data on chemicals to improve the identification of chemicals in exposomics studies. This article describes efforts to contribute spectra relevant for exposomics to the open mass spectral library MassBank (https://www.massbank.eu) using various open source software efforts, including the R packages RMassBank and Shinyscreen. The experimental spectra were obtained from ten mixtures containing toxicologically relevant chemicals from the US Environmental Protection Agency (EPA) Non-Targeted Analysis Collaborative Trial (ENTACT). Following processing and curation, 5582 spectra from 783 of the 1268 ENTACT compounds were added to MassBank, and through this to other open spectral libraries (<em>e.g.</em>, MoNA, GNPS) for community benefit. Additionally, an automated deposition and annotation workflow was developed with PubChem to enable the display of all MassBank mass spectra in PubChem, which is rerun with each MassBank release. The new spectral records have already been used in several studies to increase the confidence in identification in non-target small molecule identification workflows applied to environmental and exposomics research.</p>\",\"PeriodicalId\":74,\"journal\":{\"name\":\"Environmental Science: Processes & Impacts\",\"volume\":\" 11\",\"pages\":\" 1788-1801\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2023-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10648001/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Science: Processes & Impacts\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://pubs.rsc.org/en/content/articlelanding/2023/em/d3em00181d\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Science: Processes & Impacts","FirstCategoryId":"93","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2023/em/d3em00181d","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

摘要

“暴露体”一词的定义是对生命过程中环境暴露和相关生物反应的综合研究。人类接触到许多不同的化学物质,这可能对人类的福祉构成重大威胁。在将暴露与人类健康联系起来时,靶向或非靶向质谱技术广泛用于识别和表征各种环境压力源。然而,由于暴露组学适用的化学空间巨大,加上光谱库中缺乏足够的相关条目,鉴定仍然具有挑战性。解决这些挑战需要化学信息学工具和数据库资源来共享化学物质的开放光谱数据,以提高暴露组学研究中化学物质的识别。本文描述了使用各种开源软件(包括R软件包RMassBank和shinysscreen)向开放质谱库MassBank (https://www.massbank.eu)贡献与暴露组学相关的光谱的工作。实验光谱来源于美国环境保护署(EPA)非靶向分析协同试验(ENTACT)中含有毒理学相关化学物质的10种混合物。经过处理和整理,来自1268个ENTACT化合物中的783个化合物的5582个光谱被添加到MassBank中,并通过此添加到其他开放的光谱库(如MoNA, GNPS)中,以供社区使用。此外,与PubChem一起开发了一个自动沉积和注释工作流程,以便在PubChem中显示所有MassBank质谱,并在每次MassBank发布时重新运行。新的光谱记录已经在一些研究中使用,以增加对环境和暴露学研究中应用的非靶标小分子鉴定工作流程的鉴定信心。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Adding open spectral data to MassBank and PubChem using open source tools to support non-targeted exposomics of mixtures†

Adding open spectral data to MassBank and PubChem using open source tools to support non-targeted exposomics of mixtures†

The term “exposome” is defined as a comprehensive study of life-course environmental exposures and the associated biological responses. Humans are exposed to many different chemicals, which can pose a major threat to the well-being of humanity. Targeted or non-targeted mass spectrometry techniques are widely used to identify and characterize various environmental stressors when linking exposures to human health. However, identification remains challenging due to the huge chemical space applicable to exposomics, combined with the lack of sufficient relevant entries in spectral libraries. Addressing these challenges requires cheminformatics tools and database resources to share curated open spectral data on chemicals to improve the identification of chemicals in exposomics studies. This article describes efforts to contribute spectra relevant for exposomics to the open mass spectral library MassBank (https://www.massbank.eu) using various open source software efforts, including the R packages RMassBank and Shinyscreen. The experimental spectra were obtained from ten mixtures containing toxicologically relevant chemicals from the US Environmental Protection Agency (EPA) Non-Targeted Analysis Collaborative Trial (ENTACT). Following processing and curation, 5582 spectra from 783 of the 1268 ENTACT compounds were added to MassBank, and through this to other open spectral libraries (e.g., MoNA, GNPS) for community benefit. Additionally, an automated deposition and annotation workflow was developed with PubChem to enable the display of all MassBank mass spectra in PubChem, which is rerun with each MassBank release. The new spectral records have already been used in several studies to increase the confidence in identification in non-target small molecule identification workflows applied to environmental and exposomics research.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Environmental Science: Processes & Impacts
Environmental Science: Processes & Impacts CHEMISTRY, ANALYTICAL-ENVIRONMENTAL SCIENCES
CiteScore
9.50
自引率
3.60%
发文量
202
审稿时长
1 months
期刊介绍: Environmental Science: Processes & Impacts publishes high quality papers in all areas of the environmental chemical sciences, including chemistry of the air, water, soil and sediment. We welcome studies on the environmental fate and effects of anthropogenic and naturally occurring contaminants, both chemical and microbiological, as well as related natural element cycling processes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信