IF 6.7 1区 化学 Q1 CHEMISTRY, ANALYTICAL
Cailum M. K. Stienstra, Emir Nazdrajić, W. Scott Hopkins
{"title":"From Reverse Phase Chromatography to HILIC: Graph Transformers Power Method-Independent Machine Learning of Retention Times","authors":"Cailum M. K. Stienstra, Emir Nazdrajić, W. Scott Hopkins","doi":"10.1021/acs.analchem.4c05859","DOIUrl":null,"url":null,"abstract":"Liquid chromatography (LC) is a cornerstone of analytical separations, but comparing the retention times (RTs) across different LC methods is challenging because of variations in experimental parameters such as column type and solvent gradient. Nevertheless, RTs are powerful metrics in tandem mass spectrometry (MS<sup>2</sup>) that can reduce false positive rates for metabolite annotation, differentiate isobaric species, and improve peptide identification. Here, we present Graphormer-RT, a novel graph transformer that performs the first single-model method-independent prediction of RTs. We use the RepoRT data set, which contains 142,688 reverse phase (RP) RTs (from 191 methods) and 4,373 HILIC RTs (from 49 methods). Our best RP model (trained and tested on 191 methods) achieved a test set mean average error (MAE) of 29.3 ± 0.6 s, comparable performance to the state-of-the-art model which was only trained on a single LC method. Our best-performing HILIC model achieved a test MAE = 42.4 ± 2.9 s. We expect that Graphormer-RT can be used as an LC “foundation model”, where transfer learning can reduce the amount of training data needed for highly accurate “specialist” models applied to method-specific RP and HILIC tasks. These frameworks could enable the machine optimization of automated LC workflows, improved filtration of candidate structures using predicted RTs, and the <i>in silico</i> annotation of unknown analytes in LC-MS<sup>2</sup> measurements.","PeriodicalId":27,"journal":{"name":"Analytical Chemistry","volume":"15 1","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1021/acs.analchem.4c05859","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

摘要

液相色谱法(LC)是分析分离的基石,但由于实验参数(如色谱柱类型和溶剂梯度)的变化,比较不同 LC 方法的保留时间(RTs)具有挑战性。然而,保留时间是串联质谱(MS2)的强大指标,可以降低代谢物注释的假阳性率,区分等位物种,并提高肽的鉴定能力。在此,我们介绍一种新型图转换器 Graphormer-RT,它首次实现了与单一模型方法无关的 RT 预测。我们使用的 RepoRT 数据集包含 142688 个反相 (RP) RT(来自 191 种方法)和 4373 个 HILIC RT(来自 49 种方法)。我们的最佳 RP 模型(在 191 种方法上进行了训练和测试)在测试集上的平均误差 (MAE) 为 29.3 ± 0.6 s,与只在一种 LC 方法上进行训练的最先进模型性能相当。我们预计,Graphormer-RT 可用作 LC "基础模型",其中迁移学习可减少应用于特定方法 RP 和 HILIC 任务的高精度 "专业 "模型所需的训练数据量。这些框架可以实现自动液相色谱工作流程的机器优化,利用预测的 RTs 改进候选结构的过滤,以及在 LC-MS2 测量中对未知分析物进行硅标注。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

From Reverse Phase Chromatography to HILIC: Graph Transformers Power Method-Independent Machine Learning of Retention Times

From Reverse Phase Chromatography to HILIC: Graph Transformers Power Method-Independent Machine Learning of Retention Times
Liquid chromatography (LC) is a cornerstone of analytical separations, but comparing the retention times (RTs) across different LC methods is challenging because of variations in experimental parameters such as column type and solvent gradient. Nevertheless, RTs are powerful metrics in tandem mass spectrometry (MS2) that can reduce false positive rates for metabolite annotation, differentiate isobaric species, and improve peptide identification. Here, we present Graphormer-RT, a novel graph transformer that performs the first single-model method-independent prediction of RTs. We use the RepoRT data set, which contains 142,688 reverse phase (RP) RTs (from 191 methods) and 4,373 HILIC RTs (from 49 methods). Our best RP model (trained and tested on 191 methods) achieved a test set mean average error (MAE) of 29.3 ± 0.6 s, comparable performance to the state-of-the-art model which was only trained on a single LC method. Our best-performing HILIC model achieved a test MAE = 42.4 ± 2.9 s. We expect that Graphormer-RT can be used as an LC “foundation model”, where transfer learning can reduce the amount of training data needed for highly accurate “specialist” models applied to method-specific RP and HILIC tasks. These frameworks could enable the machine optimization of automated LC workflows, improved filtration of candidate structures using predicted RTs, and the in silico annotation of unknown analytes in LC-MS2 measurements.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Analytical Chemistry
Analytical Chemistry 化学-分析化学
CiteScore
12.10
自引率
12.20%
发文量
1949
审稿时长
1.4 months
期刊介绍: Analytical Chemistry, a peer-reviewed research journal, focuses on disseminating new and original knowledge across all branches of analytical chemistry. Fundamental articles may explore general principles of chemical measurement science and need not directly address existing or potential analytical methodology. They can be entirely theoretical or report experimental results. Contributions may cover various phases of analytical operations, including sampling, bioanalysis, electrochemistry, mass spectrometry, microscale and nanoscale systems, environmental analysis, separations, spectroscopy, chemical reactions and selectivity, instrumentation, imaging, surface analysis, and data processing. Papers discussing known analytical methods should present a significant, original application of the method, a notable improvement, or results on an important analyte.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信