Deciphering Important Metabolic Pathways through Reaction Pair Prediction with a Siamese Transformer Network

IF 6.1 Q1 CHEMISTRY, MULTIDISCIPLINARY
Han Bao, Xinxin Wang, Jinhui Zhao, Bin Wang, Xinjie Zhao, Chunxia Zhao, Xin Lu, Guowang Xu
{"title":"Deciphering Important Metabolic Pathways through Reaction Pair Prediction with a Siamese Transformer Network","authors":"Han Bao,&nbsp;Xinxin Wang,&nbsp;Jinhui Zhao,&nbsp;Bin Wang,&nbsp;Xinjie Zhao,&nbsp;Chunxia Zhao,&nbsp;Xin Lu,&nbsp;Guowang Xu","doi":"10.1002/cmtd.202400064","DOIUrl":null,"url":null,"abstract":"<p>Important pathway identification is essential for unraveling biological mechanisms in functional metabolomics. However, current pathway enrichment is often biased toward well-characterized pathways due to low annotation rates in untargeted metabolomics and incomplete pathway coverage. It leads to potential misinterpretation of metabolomics data. Herein, Siamese transformer reaction pair (STRP) prediction, an approach for important pathway exploration in metabolomics, is introduced. STRP leverages a weight-sharing Siamese network and a multihead attention Transformer encoder to predict metabolic reaction pairs, utilizing molecular fingerprints derived from either known metabolites or tandem mass spectra of unannotated metabolic features. Pathway labels are then deduced for metabolic features from known pathway metabolites within the reaction pairs. STRP can achieve crossvalidation metrics of 98.10%/98.13% accuracy, 97.98%/98.01% precision, 97.94%/97.97% recall, 97.96%/97.99% F1 score, and 99.56%/99.57% area under the receiver operating characteristic curve of spectral pairs in ESI<sup>+</sup>/ESI<sup>−</sup> modes. It is applied to metabolomics datasets from prostate cancer and diabetic retinopathy. STRP successfully identifies and interprets important metabolic pathways, demonstrating its robust utility for important pathway identification. Besides, STRP-based molecular network showcases potential application in metabolome annotation. This approach reveals a significant advancement in leveraging high-resolution mass spectrometry-based metabolomics data, with the potential to transform understanding of complex biological processes.</p>","PeriodicalId":72562,"journal":{"name":"Chemistry methods : new approaches to solving problems in chemistry","volume":"5 6","pages":""},"PeriodicalIF":6.1000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cmtd.202400064","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemistry methods : new approaches to solving problems in chemistry","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cmtd.202400064","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Important pathway identification is essential for unraveling biological mechanisms in functional metabolomics. However, current pathway enrichment is often biased toward well-characterized pathways due to low annotation rates in untargeted metabolomics and incomplete pathway coverage. It leads to potential misinterpretation of metabolomics data. Herein, Siamese transformer reaction pair (STRP) prediction, an approach for important pathway exploration in metabolomics, is introduced. STRP leverages a weight-sharing Siamese network and a multihead attention Transformer encoder to predict metabolic reaction pairs, utilizing molecular fingerprints derived from either known metabolites or tandem mass spectra of unannotated metabolic features. Pathway labels are then deduced for metabolic features from known pathway metabolites within the reaction pairs. STRP can achieve crossvalidation metrics of 98.10%/98.13% accuracy, 97.98%/98.01% precision, 97.94%/97.97% recall, 97.96%/97.99% F1 score, and 99.56%/99.57% area under the receiver operating characteristic curve of spectral pairs in ESI+/ESI modes. It is applied to metabolomics datasets from prostate cancer and diabetic retinopathy. STRP successfully identifies and interprets important metabolic pathways, demonstrating its robust utility for important pathway identification. Besides, STRP-based molecular network showcases potential application in metabolome annotation. This approach reveals a significant advancement in leveraging high-resolution mass spectrometry-based metabolomics data, with the potential to transform understanding of complex biological processes.

通过暹罗变压器网络的反应对预测来破译重要的代谢途径
重要的途径鉴定是揭示功能代谢组学生物学机制的必要条件。然而,由于非靶向代谢组学的低注释率和不完整的途径覆盖,目前的途径富集往往偏向于特征良好的途径。这可能导致对代谢组学数据的误解。本文介绍了Siamese transformer reaction pair (STRP)预测这一代谢组学中重要的途径探索方法。STRP利用权重共享暹罗网络和多头注意力转换器编码器来预测代谢反应对,利用来自已知代谢物的分子指纹或未注释代谢特征的串联质谱。然后从反应对内已知的途径代谢物中推断出代谢特征的途径标签。在ESI+/ESI−模式下,STRP可以实现98.10%/98.13%的准确度、97.98%/98.01%的精密度、97.94%/97.97%的召回率、97.96%/97.99%的F1评分和99.56%/99.57%的光谱对接收者工作特征曲线下面积的交叉验证指标。它应用于前列腺癌和糖尿病视网膜病变的代谢组学数据集。STRP成功地识别和解释了重要的代谢途径,证明了其在重要途径识别方面的强大实用性。此外,基于strp的分子网络在代谢组注释方面也有潜在的应用前景。这种方法揭示了利用基于高分辨率质谱的代谢组学数据的重大进步,有可能改变对复杂生物过程的理解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.30
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信