通过深度学习优化的氨基酰基- trna合成酶酶表现为准种。

IF 2.3 2区 物理与天体物理 Q3 CHEMISTRY, PHYSICAL
Structural Dynamics-Us Pub Date : 2025-04-25 eCollection Date: 2025-03-01 DOI:10.1063/4.0000294
Sourav Kumar Patra, Nicholas Randolph, Brian Kuhlman, Henry Dieckhaus, Laurie Betts, Jordan Douglas, Peter R Wills, Charles W Carter
{"title":"通过深度学习优化的氨基酰基- trna合成酶酶表现为准种。","authors":"Sourav Kumar Patra, Nicholas Randolph, Brian Kuhlman, Henry Dieckhaus, Laurie Betts, Jordan Douglas, Peter R Wills, Charles W Carter","doi":"10.1063/4.0000294","DOIUrl":null,"url":null,"abstract":"<p><p>Protein design plays a key role in our efforts to work out how genetic coding began. That effort entails urzymes. Urzymes are small, conserved excerpts from full-length aminoacyl-tRNA synthetases that remain active. Urzymes require design to connect disjoint pieces and repair naked nonpolar patches created by removing large domains. Rosetta allowed us to create the first urzymes, but those urzymes were only sparingly soluble. We could measure activity, but it was hard to concentrate those samples to levels required for structural biology. Here, we used the deep learning algorithms ProteinMPNN and AlphaFold2 to redesign a set of optimized LeuAC urzymes derived from leucyl-tRNA synthetase. We select a balanced, representative subset of eight variants for testing using principal component analysis. Most tested variants are much more soluble than the original LeuAC. They also span a range of catalytic proficiency and amino acid specificity. The data enable detailed statistical analyses of the sources of both solubility and specificity. In that way, we show how to begin to unwrap the elements of protein chemistry that were hidden within the neural networks. Deep learning networks have thus helped us surmount several vexing obstacles to further investigations into the nature of ancestral proteins. Finally, we discuss how the eight variants might resemble a sample drawn from a population similar to one subject to natural selection.</p>","PeriodicalId":48683,"journal":{"name":"Structural Dynamics-Us","volume":"12 2","pages":"024701"},"PeriodicalIF":2.3000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12033045/pdf/","citationCount":"0","resultStr":"{\"title\":\"Aminoacyl-tRNA synthetase urzymes optimized by deep learning behave as a quasispecies.\",\"authors\":\"Sourav Kumar Patra, Nicholas Randolph, Brian Kuhlman, Henry Dieckhaus, Laurie Betts, Jordan Douglas, Peter R Wills, Charles W Carter\",\"doi\":\"10.1063/4.0000294\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Protein design plays a key role in our efforts to work out how genetic coding began. That effort entails urzymes. Urzymes are small, conserved excerpts from full-length aminoacyl-tRNA synthetases that remain active. Urzymes require design to connect disjoint pieces and repair naked nonpolar patches created by removing large domains. Rosetta allowed us to create the first urzymes, but those urzymes were only sparingly soluble. We could measure activity, but it was hard to concentrate those samples to levels required for structural biology. Here, we used the deep learning algorithms ProteinMPNN and AlphaFold2 to redesign a set of optimized LeuAC urzymes derived from leucyl-tRNA synthetase. We select a balanced, representative subset of eight variants for testing using principal component analysis. Most tested variants are much more soluble than the original LeuAC. They also span a range of catalytic proficiency and amino acid specificity. The data enable detailed statistical analyses of the sources of both solubility and specificity. In that way, we show how to begin to unwrap the elements of protein chemistry that were hidden within the neural networks. Deep learning networks have thus helped us surmount several vexing obstacles to further investigations into the nature of ancestral proteins. Finally, we discuss how the eight variants might resemble a sample drawn from a population similar to one subject to natural selection.</p>\",\"PeriodicalId\":48683,\"journal\":{\"name\":\"Structural Dynamics-Us\",\"volume\":\"12 2\",\"pages\":\"024701\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2025-04-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12033045/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Structural Dynamics-Us\",\"FirstCategoryId\":\"101\",\"ListUrlMain\":\"https://doi.org/10.1063/4.0000294\",\"RegionNum\":2,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q3\",\"JCRName\":\"CHEMISTRY, PHYSICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Structural Dynamics-Us","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1063/4.0000294","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"CHEMISTRY, PHYSICAL","Score":null,"Total":0}
引用次数: 0

摘要

蛋白质设计在我们研究基因编码如何开始的过程中起着关键作用。这种努力需要酶。Urzymes是全长氨基酰trna合成酶的小而保守的片段,仍然具有活性。Urzymes需要设计来连接不连接的片段,并修复通过去除大结构域而产生的裸露的非极性补丁。罗塞塔号让我们创造了第一批酶,但这些酶只能少量溶解。我们可以测量活性,但很难将这些样本集中到结构生物学所需的水平。在这里,我们使用深度学习算法ProteinMPNN和AlphaFold2重新设计了一组优化的LeuAC酶,这些酶来自于亮氨酸- trna合成酶。我们选择一个平衡的,有代表性的八个变量子集,使用主成分分析进行测试。大多数测试的变种比原来的LeuAC更易溶解。它们还跨越了催化能力和氨基酸特异性的范围。这些数据可以对溶解度和特异性的来源进行详细的统计分析。通过这种方式,我们展示了如何开始揭开隐藏在神经网络中的蛋白质化学元素。因此,深度学习网络帮助我们克服了几个令人烦恼的障碍,从而进一步研究了祖先蛋白质的本质。最后,我们讨论了这八种变体如何类似于自然选择中从类似于一个主体的群体中抽取的样本。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Aminoacyl-tRNA synthetase urzymes optimized by deep learning behave as a quasispecies.

Protein design plays a key role in our efforts to work out how genetic coding began. That effort entails urzymes. Urzymes are small, conserved excerpts from full-length aminoacyl-tRNA synthetases that remain active. Urzymes require design to connect disjoint pieces and repair naked nonpolar patches created by removing large domains. Rosetta allowed us to create the first urzymes, but those urzymes were only sparingly soluble. We could measure activity, but it was hard to concentrate those samples to levels required for structural biology. Here, we used the deep learning algorithms ProteinMPNN and AlphaFold2 to redesign a set of optimized LeuAC urzymes derived from leucyl-tRNA synthetase. We select a balanced, representative subset of eight variants for testing using principal component analysis. Most tested variants are much more soluble than the original LeuAC. They also span a range of catalytic proficiency and amino acid specificity. The data enable detailed statistical analyses of the sources of both solubility and specificity. In that way, we show how to begin to unwrap the elements of protein chemistry that were hidden within the neural networks. Deep learning networks have thus helped us surmount several vexing obstacles to further investigations into the nature of ancestral proteins. Finally, we discuss how the eight variants might resemble a sample drawn from a population similar to one subject to natural selection.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Structural Dynamics-Us
Structural Dynamics-Us CHEMISTRY, PHYSICALPHYSICS, ATOMIC, MOLECU-PHYSICS, ATOMIC, MOLECULAR & CHEMICAL
CiteScore
5.50
自引率
3.60%
发文量
24
审稿时长
16 weeks
期刊介绍: Structural Dynamics focuses on the recent developments in experimental and theoretical methods and techniques that allow a visualization of the electronic and geometric structural changes in real time of chemical, biological, and condensed-matter systems. The community of scientists and engineers working on structural dynamics in such diverse systems often use similar instrumentation and methods. The journal welcomes articles dealing with fundamental problems of electronic and structural dynamics that are tackled by new methods, such as: Time-resolved X-ray and electron diffraction and scattering, Coherent diffractive imaging, Time-resolved X-ray spectroscopies (absorption, emission, resonant inelastic scattering, etc.), Time-resolved electron energy loss spectroscopy (EELS) and electron microscopy, Time-resolved photoelectron spectroscopies (UPS, XPS, ARPES, etc.), Multidimensional spectroscopies in the infrared, the visible and the ultraviolet, Nonlinear spectroscopies in the VUV, the soft and the hard X-ray domains, Theory and computational methods and algorithms for the analysis and description of structuraldynamics and their associated experimental signals. These new methods are enabled by new instrumentation, such as: X-ray free electron lasers, which provide flux, coherence, and time resolution, New sources of ultrashort electron pulses, New sources of ultrashort vacuum ultraviolet (VUV) to hard X-ray pulses, such as high-harmonic generation (HHG) sources or plasma-based sources, New sources of ultrashort infrared and terahertz (THz) radiation, New detectors for X-rays and electrons, New sample handling and delivery schemes, New computational capabilities.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信