在阿尔法fold时代追求远程进化配对

IF 3.2 4区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
Biopolymers Pub Date : 2023-02-08 DOI:10.1002/bip.23530
Theodoros K. Karamanos
{"title":"在阿尔法fold时代追求远程进化配对","authors":"Theodoros K. Karamanos","doi":"10.1002/bip.23530","DOIUrl":null,"url":null,"abstract":"<p>Coevolution between protein residues is normally interpreted as direct contact. However, the evolutionary record of a protein sequence contains rich information that may include long-range functional couplings, couplings that report on homo-oligomeric states or even conformational changes. Due to the complexity of the sequence space and the lack of structural information on various members of a protein family, it has been difficult to effectively mine the additional information encoded in a multiple sequence alignment (MSA). Here, taking advantage of the recent release of the AlphaFold (AF) database we attempt to identify coevolutionary couplings that cannot be explained simply by spatial proximity. We propose a simple computational method that performs direct coupling analysis on a MSA and searches for couplings that are not satisfied in any of the AF models of members of the identified protein family. Application of this method on 2012 protein families suggests that ~12% of the total identified coevolving residue pairs are spatially distant and more likely to be disordered than their contacting counterparts. We expect that this analysis will help improve the quality of coevolutionary distance restraints used for structure determination and will be useful in identifying potentially functional/allosteric cross-talk between distant residues.</p>","PeriodicalId":8866,"journal":{"name":"Biopolymers","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2023-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bip.23530","citationCount":"2","resultStr":"{\"title\":\"Chasing long-range evolutionary couplings in the AlphaFold era\",\"authors\":\"Theodoros K. Karamanos\",\"doi\":\"10.1002/bip.23530\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Coevolution between protein residues is normally interpreted as direct contact. However, the evolutionary record of a protein sequence contains rich information that may include long-range functional couplings, couplings that report on homo-oligomeric states or even conformational changes. Due to the complexity of the sequence space and the lack of structural information on various members of a protein family, it has been difficult to effectively mine the additional information encoded in a multiple sequence alignment (MSA). Here, taking advantage of the recent release of the AlphaFold (AF) database we attempt to identify coevolutionary couplings that cannot be explained simply by spatial proximity. We propose a simple computational method that performs direct coupling analysis on a MSA and searches for couplings that are not satisfied in any of the AF models of members of the identified protein family. Application of this method on 2012 protein families suggests that ~12% of the total identified coevolving residue pairs are spatially distant and more likely to be disordered than their contacting counterparts. We expect that this analysis will help improve the quality of coevolutionary distance restraints used for structure determination and will be useful in identifying potentially functional/allosteric cross-talk between distant residues.</p>\",\"PeriodicalId\":8866,\"journal\":{\"name\":\"Biopolymers\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2023-02-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/bip.23530\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biopolymers\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/bip.23530\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biopolymers","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/bip.23530","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 2

摘要

蛋白质残基之间的共同进化通常被解释为直接接触。然而,蛋白质序列的进化记录包含丰富的信息,可能包括远程功能偶联,偶联报告同源寡聚态甚至构象变化。由于序列空间的复杂性和缺乏蛋白质家族各成员的结构信息,难以有效地挖掘多序列比对(MSA)中编码的附加信息。在这里,利用最近发布的AlphaFold (AF)数据库,我们试图识别不能简单地用空间接近来解释的共同进化耦合。我们提出了一种简单的计算方法,对MSA进行直接耦合分析,并搜索在鉴定蛋白家族成员的任何AF模型中不满足的耦合。该方法对2012个蛋白质家族的应用表明,在已鉴定的共进化残基对中,约12%的残基在空间上距离较远,且比其相邻的残基更容易无序。我们期望这一分析将有助于提高用于结构测定的共进化距离限制的质量,并将有助于识别远距离残基之间潜在的功能/变构串扰。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Chasing long-range evolutionary couplings in the AlphaFold era

Chasing long-range evolutionary couplings in the AlphaFold era

Coevolution between protein residues is normally interpreted as direct contact. However, the evolutionary record of a protein sequence contains rich information that may include long-range functional couplings, couplings that report on homo-oligomeric states or even conformational changes. Due to the complexity of the sequence space and the lack of structural information on various members of a protein family, it has been difficult to effectively mine the additional information encoded in a multiple sequence alignment (MSA). Here, taking advantage of the recent release of the AlphaFold (AF) database we attempt to identify coevolutionary couplings that cannot be explained simply by spatial proximity. We propose a simple computational method that performs direct coupling analysis on a MSA and searches for couplings that are not satisfied in any of the AF models of members of the identified protein family. Application of this method on 2012 protein families suggests that ~12% of the total identified coevolving residue pairs are spatially distant and more likely to be disordered than their contacting counterparts. We expect that this analysis will help improve the quality of coevolutionary distance restraints used for structure determination and will be useful in identifying potentially functional/allosteric cross-talk between distant residues.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biopolymers
Biopolymers 生物-生化与分子生物学
CiteScore
5.30
自引率
0.00%
发文量
48
审稿时长
3 months
期刊介绍: Founded in 1963, Biopolymers publishes strictly peer-reviewed papers examining naturally occurring and synthetic biological macromolecules. By including experimental and theoretical studies on the fundamental behaviour as well as applications of biopolymers, the journal serves the interdisciplinary biochemical, biophysical, biomaterials and biomedical research communities.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信