{"title":"利用原子流匹配设计配体结合蛋白","authors":"Junqi Liu, Shaoning Li, Chence Shi, Zhi Yang, Jian Tang","doi":"arxiv-2409.12080","DOIUrl":null,"url":null,"abstract":"Designing novel proteins that bind to small molecules is a long-standing\nchallenge in computational biology, with applications in developing catalysts,\nbiosensors, and more. Current computational methods rely on the assumption that\nthe binding pose of the target molecule is known, which is not always feasible,\nas conformations of novel targets are often unknown and tend to change upon\nbinding. In this work, we formulate proteins and molecules as unified\nbiotokens, and present AtomFlow, a novel deep generative model under the\nflow-matching framework for the design of ligand-binding proteins from the 2D\ntarget molecular graph alone. Operating on representative atoms of biotokens,\nAtomFlow captures the flexibility of ligands and generates ligand conformations\nand protein backbone structures iteratively. We consider the multi-scale nature\nof biotokens and demonstrate that AtomFlow can be effectively trained on a\nsubset of structures from the Protein Data Bank, by matching flow vector field\nusing an SE(3) equivariant structure prediction network. Experimental results\nshow that our method can generate high fidelity ligand-binding proteins and\nachieve performance comparable to the state-of-the-art model RFDiffusionAA,\nwhile not requiring bound ligand structures. As a general framework, AtomFlow\nholds the potential to be applied to various biomolecule generation tasks in\nthe future.","PeriodicalId":501022,"journal":{"name":"arXiv - QuanBio - Biomolecules","volume":"40 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Design of Ligand-Binding Proteins with Atomic Flow Matching\",\"authors\":\"Junqi Liu, Shaoning Li, Chence Shi, Zhi Yang, Jian Tang\",\"doi\":\"arxiv-2409.12080\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Designing novel proteins that bind to small molecules is a long-standing\\nchallenge in computational biology, with applications in developing catalysts,\\nbiosensors, and more. Current computational methods rely on the assumption that\\nthe binding pose of the target molecule is known, which is not always feasible,\\nas conformations of novel targets are often unknown and tend to change upon\\nbinding. In this work, we formulate proteins and molecules as unified\\nbiotokens, and present AtomFlow, a novel deep generative model under the\\nflow-matching framework for the design of ligand-binding proteins from the 2D\\ntarget molecular graph alone. Operating on representative atoms of biotokens,\\nAtomFlow captures the flexibility of ligands and generates ligand conformations\\nand protein backbone structures iteratively. We consider the multi-scale nature\\nof biotokens and demonstrate that AtomFlow can be effectively trained on a\\nsubset of structures from the Protein Data Bank, by matching flow vector field\\nusing an SE(3) equivariant structure prediction network. Experimental results\\nshow that our method can generate high fidelity ligand-binding proteins and\\nachieve performance comparable to the state-of-the-art model RFDiffusionAA,\\nwhile not requiring bound ligand structures. As a general framework, AtomFlow\\nholds the potential to be applied to various biomolecule generation tasks in\\nthe future.\",\"PeriodicalId\":501022,\"journal\":{\"name\":\"arXiv - QuanBio - Biomolecules\",\"volume\":\"40 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - QuanBio - Biomolecules\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.12080\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - QuanBio - Biomolecules","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.12080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Design of Ligand-Binding Proteins with Atomic Flow Matching
Designing novel proteins that bind to small molecules is a long-standing
challenge in computational biology, with applications in developing catalysts,
biosensors, and more. Current computational methods rely on the assumption that
the binding pose of the target molecule is known, which is not always feasible,
as conformations of novel targets are often unknown and tend to change upon
binding. In this work, we formulate proteins and molecules as unified
biotokens, and present AtomFlow, a novel deep generative model under the
flow-matching framework for the design of ligand-binding proteins from the 2D
target molecular graph alone. Operating on representative atoms of biotokens,
AtomFlow captures the flexibility of ligands and generates ligand conformations
and protein backbone structures iteratively. We consider the multi-scale nature
of biotokens and demonstrate that AtomFlow can be effectively trained on a
subset of structures from the Protein Data Bank, by matching flow vector field
using an SE(3) equivariant structure prediction network. Experimental results
show that our method can generate high fidelity ligand-binding proteins and
achieve performance comparable to the state-of-the-art model RFDiffusionAA,
while not requiring bound ligand structures. As a general framework, AtomFlow
holds the potential to be applied to various biomolecule generation tasks in
the future.