In silico structures, mass spectra and retention indices database development for purposes of chemical weapons convention

IF 1.6 3区 化学 Q3 PHYSICS, ATOMIC, MOLECULAR & CHEMICAL
Albert Kireev , Sergey Osipenko , Liudmila Borisova , Evgeny Nikolaev , Yury Kostyukevich
{"title":"In silico structures, mass spectra and retention indices database development for purposes of chemical weapons convention","authors":"Albert Kireev ,&nbsp;Sergey Osipenko ,&nbsp;Liudmila Borisova ,&nbsp;Evgeny Nikolaev ,&nbsp;Yury Kostyukevich","doi":"10.1016/j.ijms.2024.117250","DOIUrl":null,"url":null,"abstract":"<div><p>The Chemical Weapons Convention (CWC) is a science-based international treaty for the disarmament and non-proliferation of chemical weapons. However, reference Orgaization for prohibition of chemical weapons (OPCW) central analytical database (OCAD) of structures with mass spectra (MS) and retention indices (RI) includes only minor part of all possible chemical species defined in the Schedules of CWC. In this work we employed OCAD <em>in silico</em> augmentation based on chemoinformatics approach for chemical structures enumeration, MS data generation based on message passing neural network and based on P-alkyl molecular pairs RI prediction to support the verification activities as provided for in the CWC. Enumerated 879 noncyclic and 5270 monocyclic alcohols became the basis for generating hundreds of thousands molecules of Schedule 1 toxic chemicals like Sarin, Tabun, VX and Novichok. Trained on ordinary and neutral loss electron ionization mass spectrometry (EI-MS) a message-passing neural network (MPNN) outperformed other quantum chemistry and machine learning methods. Generated by this MPNN <em>in silico</em> EI-MS are very similar to the library's spectra and allowed to reach desired match factor above 800 within a scale of 0–1000. Statistical data for molecular pairs based on P-alkyl fragments was collected and used to predict RIs within desired 20 RI window for some toxic chemicals of Schedule 1.A.01, for which in the current OCAD version RIs are absent.</p></div>","PeriodicalId":338,"journal":{"name":"International Journal of Mass Spectrometry","volume":null,"pages":null},"PeriodicalIF":1.6000,"publicationDate":"2024-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Mass Spectrometry","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1387380624000617","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PHYSICS, ATOMIC, MOLECULAR & CHEMICAL","Score":null,"Total":0}
引用次数: 0

Abstract

The Chemical Weapons Convention (CWC) is a science-based international treaty for the disarmament and non-proliferation of chemical weapons. However, reference Orgaization for prohibition of chemical weapons (OPCW) central analytical database (OCAD) of structures with mass spectra (MS) and retention indices (RI) includes only minor part of all possible chemical species defined in the Schedules of CWC. In this work we employed OCAD in silico augmentation based on chemoinformatics approach for chemical structures enumeration, MS data generation based on message passing neural network and based on P-alkyl molecular pairs RI prediction to support the verification activities as provided for in the CWC. Enumerated 879 noncyclic and 5270 monocyclic alcohols became the basis for generating hundreds of thousands molecules of Schedule 1 toxic chemicals like Sarin, Tabun, VX and Novichok. Trained on ordinary and neutral loss electron ionization mass spectrometry (EI-MS) a message-passing neural network (MPNN) outperformed other quantum chemistry and machine learning methods. Generated by this MPNN in silico EI-MS are very similar to the library's spectra and allowed to reach desired match factor above 800 within a scale of 0–1000. Statistical data for molecular pairs based on P-alkyl fragments was collected and used to predict RIs within desired 20 RI window for some toxic chemicals of Schedule 1.A.01, for which in the current OCAD version RIs are absent.

Abstract Image

为化学武器公约开发硅学结构、质谱和保留指数数据库
化学武器公约》(CWC)是一项以科学为基础的化学武器裁军和不扩散国际条约。然而,参考禁止化学武器组织(OPCW)中央分析数据库(OCAD)的质谱(MS)和保留指数(RI)结构只包括《化学武器公约》附表中定义的所有可能化学物种的一小部分。在这项工作中,我们利用基于化学信息学方法的 OCAD 进行了化学结构枚举、基于消息传递神经网络的质谱数据生成和基于对烷基分子对保留指数预测的硅学增强,以支持《化学武器公约》规定的验证活动。枚举出的 879 个非环醇和 5270 个单环醇成为生成沙林、塔崩、VX 和诺维乔克等附表 1 有毒化学品数十万个分子的基础。经过普通和中性损失电子电离质谱(EI-MS)训练的信息传递神经网络(MPNN)的性能优于其他量子化学和机器学习方法。由这种 MPNN 生成的硅学 EI-MS 与库中的光谱非常相似,在 0-1000 的范围内达到了 800 以上的理想匹配系数。收集了基于 P-烷基片段的分子对的统计数据,并用于预测附表 1.A.01 中某些有毒化学物质在所需的 20 个 RI 窗口内的 RI。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.60
自引率
5.60%
发文量
145
审稿时长
71 days
期刊介绍: The journal invites papers that advance the field of mass spectrometry by exploring fundamental aspects of ion processes using both the experimental and theoretical approaches, developing new instrumentation and experimental strategies for chemical analysis using mass spectrometry, developing new computational strategies for data interpretation and integration, reporting new applications of mass spectrometry and hyphenated techniques in biology, chemistry, geology, and physics. Papers, in which standard mass spectrometry techniques are used for analysis will not be considered. IJMS publishes full-length articles, short communications, reviews, and feature articles including young scientist features.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信