利用原子流匹配设计配体结合蛋白

Junqi Liu, Shaoning Li, Chence Shi, Zhi Yang, Jian Tang
{"title":"利用原子流匹配设计配体结合蛋白","authors":"Junqi Liu, Shaoning Li, Chence Shi, Zhi Yang, Jian Tang","doi":"arxiv-2409.12080","DOIUrl":null,"url":null,"abstract":"Designing novel proteins that bind to small molecules is a long-standing\nchallenge in computational biology, with applications in developing catalysts,\nbiosensors, and more. Current computational methods rely on the assumption that\nthe binding pose of the target molecule is known, which is not always feasible,\nas conformations of novel targets are often unknown and tend to change upon\nbinding. In this work, we formulate proteins and molecules as unified\nbiotokens, and present AtomFlow, a novel deep generative model under the\nflow-matching framework for the design of ligand-binding proteins from the 2D\ntarget molecular graph alone. Operating on representative atoms of biotokens,\nAtomFlow captures the flexibility of ligands and generates ligand conformations\nand protein backbone structures iteratively. We consider the multi-scale nature\nof biotokens and demonstrate that AtomFlow can be effectively trained on a\nsubset of structures from the Protein Data Bank, by matching flow vector field\nusing an SE(3) equivariant structure prediction network. Experimental results\nshow that our method can generate high fidelity ligand-binding proteins and\nachieve performance comparable to the state-of-the-art model RFDiffusionAA,\nwhile not requiring bound ligand structures. As a general framework, AtomFlow\nholds the potential to be applied to various biomolecule generation tasks in\nthe future.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"40 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Design of Ligand-Binding Proteins with Atomic Flow Matching\",\"authors\":\"Junqi Liu, Shaoning Li, Chence Shi, Zhi Yang, Jian Tang\",\"doi\":\"arxiv-2409.12080\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Designing novel proteins that bind to small molecules is a long-standing\\nchallenge in computational biology, with applications in developing catalysts,\\nbiosensors, and more. Current computational methods rely on the assumption that\\nthe binding pose of the target molecule is known, which is not always feasible,\\nas conformations of novel targets are often unknown and tend to change upon\\nbinding. In this work, we formulate proteins and molecules as unified\\nbiotokens, and present AtomFlow, a novel deep generative model under the\\nflow-matching framework for the design of ligand-binding proteins from the 2D\\ntarget molecular graph alone. Operating on representative atoms of biotokens,\\nAtomFlow captures the flexibility of ligands and generates ligand conformations\\nand protein backbone structures iteratively. We consider the multi-scale nature\\nof biotokens and demonstrate that AtomFlow can be effectively trained on a\\nsubset of structures from the Protein Data Bank, by matching flow vector field\\nusing an SE(3) equivariant structure prediction network. Experimental results\\nshow that our method can generate high fidelity ligand-binding proteins and\\nachieve performance comparable to the state-of-the-art model RFDiffusionAA,\\nwhile not requiring bound ligand structures. As a general framework, AtomFlow\\nholds the potential to be applied to various biomolecule generation tasks in\\nthe future.\",\"PeriodicalId\":501022,\"journal\":{\"name\":\"arXiv - QuanBio - Biomolecules\",\"volume\":\"40 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Biomolecules\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.12080\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Biomolecules","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.12080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

设计能与小分子结合的新型蛋白质是计算生物学中一个长期存在的挑战,它可应用于开发催化剂、生物传感器等。目前的计算方法依赖于目标分子的结合姿态是已知的这一假设,但这并不总是可行的,因为新型目标分子的构象通常是未知的,而且在结合时往往会发生变化。在这项工作中,我们将蛋白质和分子表述为统一的生物托肯(biotokens),并提出了AtomFlow--一种新颖的深度生成模型,在流匹配框架下,仅从二维目标分子图设计配体结合蛋白质。AtomFlow 在生物试剂的代表性原子上运行,捕捉配体的灵活性,迭代生成配体构象和蛋白质骨架结构。我们考虑了生物试剂的多尺度性质,并证明 AtomFlow 可以通过使用 SE(3) 等变结构预测网络匹配流矢量场,在蛋白质数据库的结构子集上进行有效训练。实验结果表明,我们的方法可以生成高保真配体结合蛋白,其性能可与最先进的模型 RFDiffusionAA 相媲美,而且不需要结合配体结构。作为一个通用框架,AtomFlow 有潜力在未来应用于各种生物大分子生成任务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Design of Ligand-Binding Proteins with Atomic Flow Matching
Designing novel proteins that bind to small molecules is a long-standing challenge in computational biology, with applications in developing catalysts, biosensors, and more. Current computational methods rely on the assumption that the binding pose of the target molecule is known, which is not always feasible, as conformations of novel targets are often unknown and tend to change upon binding. In this work, we formulate proteins and molecules as unified biotokens, and present AtomFlow, a novel deep generative model under the flow-matching framework for the design of ligand-binding proteins from the 2D target molecular graph alone. Operating on representative atoms of biotokens, AtomFlow captures the flexibility of ligands and generates ligand conformations and protein backbone structures iteratively. We consider the multi-scale nature of biotokens and demonstrate that AtomFlow can be effectively trained on a subset of structures from the Protein Data Bank, by matching flow vector field using an SE(3) equivariant structure prediction network. Experimental results show that our method can generate high fidelity ligand-binding proteins and achieve performance comparable to the state-of-the-art model RFDiffusionAA, while not requiring bound ligand structures. As a general framework, AtomFlow holds the potential to be applied to various biomolecule generation tasks in the future.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信