质谱预测的边缘映射:Xeno氨基酸的机器学习EIMS预测评估

IF 6.7 1区 化学 Q1 CHEMISTRY, ANALYTICAL
Sean M. Brown*, Evan Allgair and Robin Kryštůfek, 
{"title":"质谱预测的边缘映射:Xeno氨基酸的机器学习EIMS预测评估","authors":"Sean M. Brown*,&nbsp;Evan Allgair and Robin Kryštůfek,&nbsp;","doi":"10.1021/acs.analchem.5c0028610.1021/acs.analchem.5c00286","DOIUrl":null,"url":null,"abstract":"<p >Mass spectrometry is one of the most effective analytical methods for unknown compound identification. By comparing observed <i>m</i>/<i>z</i> spectra with a database of experimentally determined spectra, this process identifies compound(s) in any given sample. Unknown sample identification is thus limited to whatever has been experimentally determined. To address the reliance on experimentally determined signatures, multiple state-of-the-art MS spectra prediction algorithms have been developed within the past half decade. Here we evaluate the accuracy of the NEIMS spectral prediction algorithm. We focus our analyses on monosubstituted α-amino acids given their significance as important targets for astrobiology, synthetic biology, and diverse biomedical applications. Our general intent is to inform those using generated spectra for detection of unknown biomolecules. We find predicted spectra are inaccurate for amino acids beyond the algorithms training data. Interestingly, these inaccuracies are not explained by physicochemical differences or the derivatization state of the amino acids measured. We thus highlight the need to improve both current machine learning based approaches and further optimization of <i>ab initio</i> spectral prediction algorithms so as to expand databases for structures beyond what is currently experimentally possible, even including theoretical molecules.</p>","PeriodicalId":27,"journal":{"name":"Analytical Chemistry","volume":"97 19","pages":"10282–10288 10282–10288"},"PeriodicalIF":6.7000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acs.analchem.5c00286","citationCount":"0","resultStr":"{\"title\":\"Mapping the Edges of Mass Spectral Prediction: Evaluation of Machine Learning EIMS Prediction for Xeno Amino Acids\",\"authors\":\"Sean M. Brown*,&nbsp;Evan Allgair and Robin Kryštůfek,&nbsp;\",\"doi\":\"10.1021/acs.analchem.5c0028610.1021/acs.analchem.5c00286\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Mass spectrometry is one of the most effective analytical methods for unknown compound identification. By comparing observed <i>m</i>/<i>z</i> spectra with a database of experimentally determined spectra, this process identifies compound(s) in any given sample. Unknown sample identification is thus limited to whatever has been experimentally determined. To address the reliance on experimentally determined signatures, multiple state-of-the-art MS spectra prediction algorithms have been developed within the past half decade. Here we evaluate the accuracy of the NEIMS spectral prediction algorithm. We focus our analyses on monosubstituted α-amino acids given their significance as important targets for astrobiology, synthetic biology, and diverse biomedical applications. Our general intent is to inform those using generated spectra for detection of unknown biomolecules. We find predicted spectra are inaccurate for amino acids beyond the algorithms training data. Interestingly, these inaccuracies are not explained by physicochemical differences or the derivatization state of the amino acids measured. We thus highlight the need to improve both current machine learning based approaches and further optimization of <i>ab initio</i> spectral prediction algorithms so as to expand databases for structures beyond what is currently experimentally possible, even including theoretical molecules.</p>\",\"PeriodicalId\":27,\"journal\":{\"name\":\"Analytical Chemistry\",\"volume\":\"97 19\",\"pages\":\"10282–10288 10282–10288\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2025-05-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://pubs.acs.org/doi/epdf/10.1021/acs.analchem.5c00286\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Analytical Chemistry\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acs.analchem.5c00286\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, ANALYTICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.analchem.5c00286","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

摘要

质谱法是鉴定未知化合物最有效的分析方法之一。通过将观察到的m/z光谱与实验确定的光谱数据库进行比较,该过程可以识别任何给定样品中的化合物。因此,未知样品的鉴定仅限于实验确定的样品。为了解决对实验确定的特征的依赖,在过去的五年里,已经开发了多种最先进的质谱预测算法。本文对NEIMS光谱预测算法的精度进行了评价。鉴于单取代α-氨基酸作为天体生物学、合成生物学和多种生物医学应用的重要靶点,我们将重点分析它们。我们的总体目的是告知那些使用生成的光谱检测未知生物分子。我们发现预测光谱是不准确的氨基酸超出算法训练数据。有趣的是,这些不准确性不能用物理化学差异或所测氨基酸的衍生化状态来解释。因此,我们强调需要改进当前基于机器学习的方法和进一步优化从头算光谱预测算法,以便扩展数据库,以超越目前实验可能的结构,甚至包括理论分子。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Mapping the Edges of Mass Spectral Prediction: Evaluation of Machine Learning EIMS Prediction for Xeno Amino Acids

Mass spectrometry is one of the most effective analytical methods for unknown compound identification. By comparing observed m/z spectra with a database of experimentally determined spectra, this process identifies compound(s) in any given sample. Unknown sample identification is thus limited to whatever has been experimentally determined. To address the reliance on experimentally determined signatures, multiple state-of-the-art MS spectra prediction algorithms have been developed within the past half decade. Here we evaluate the accuracy of the NEIMS spectral prediction algorithm. We focus our analyses on monosubstituted α-amino acids given their significance as important targets for astrobiology, synthetic biology, and diverse biomedical applications. Our general intent is to inform those using generated spectra for detection of unknown biomolecules. We find predicted spectra are inaccurate for amino acids beyond the algorithms training data. Interestingly, these inaccuracies are not explained by physicochemical differences or the derivatization state of the amino acids measured. We thus highlight the need to improve both current machine learning based approaches and further optimization of ab initio spectral prediction algorithms so as to expand databases for structures beyond what is currently experimentally possible, even including theoretical molecules.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Analytical Chemistry
Analytical Chemistry 化学-分析化学
CiteScore
12.10
自引率
12.20%
发文量
1949
审稿时长
1.4 months
期刊介绍: Analytical Chemistry, a peer-reviewed research journal, focuses on disseminating new and original knowledge across all branches of analytical chemistry. Fundamental articles may explore general principles of chemical measurement science and need not directly address existing or potential analytical methodology. They can be entirely theoretical or report experimental results. Contributions may cover various phases of analytical operations, including sampling, bioanalysis, electrochemistry, mass spectrometry, microscale and nanoscale systems, environmental analysis, separations, spectroscopy, chemical reactions and selectivity, instrumentation, imaging, surface analysis, and data processing. Papers discussing known analytical methods should present a significant, original application of the method, a notable improvement, or results on an important analyte.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信