Structural and functional studies of S-adenosyl-L-methionine binding proteins: a ligand-centric approach

IF 2.222 Q3 Biochemistry, Genetics and Molecular Biology
Rajaram Gana, Shruti Rao, Hongzhan Huang, Cathy Wu, Sona Vasudevan
{"title":"Structural and functional studies of S-adenosyl-L-methionine binding proteins: a ligand-centric approach","authors":"Rajaram Gana,&nbsp;Shruti Rao,&nbsp;Hongzhan Huang,&nbsp;Cathy Wu,&nbsp;Sona Vasudevan","doi":"10.1186/1472-6807-13-6","DOIUrl":null,"url":null,"abstract":"<p>The post-genomic era poses several challenges. The biggest is the identification of biochemical function for protein sequences and structures resulting from genomic initiatives. Most sequences lack a characterized function and are annotated as hypothetical or uncharacterized. While homology-based methods are useful, and work well for sequences with sequence identities above 50%, they fail for sequences in the twilight zone (&lt;30%) of sequence identity. For cases where sequence methods fail, structural approaches are often used, based on the premise that structure preserves function for longer evolutionary time-frames than sequence alone. It is now clear that no single method can be used successfully for functional inference. Given the growing need for functional assignments, we describe here a systematic new approach, designated ligand-centric, which is primarily based on analysis of ligand-bound/unbound structures in the PDB. Results of applying our approach to S-adenosyl-L-methionine (SAM) binding proteins are presented.</p><p>Our analysis included 1,224 structures that belong to 172 unique families of the Protein Information Resource Superfamily system. Our ligand-centric approach was divided into four levels: residue, protein/domain, ligand, and family levels. The residue level included the identification of conserved binding site residues based on structure-guided sequence alignments of representative members of a family, and the identification of conserved structural motifs. The protein/domain level included structural classification of proteins, Pfam domains, domain architectures, and protein topologies. The ligand level included ligand conformations, ribose sugar puckering, and the identification of conserved ligand-atom interactions. The family level included phylogenetic analysis.</p><p>We found that SAM bound to a total of 18 different fold types (I-XVIII). We identified 4 new fold types and 11 additional topological arrangements of strands within the well-studied Rossmann fold Methyltransferases (MTases). This extends the existing structural classification of SAM binding proteins. A striking correlation between fold type and the conformation of the bound SAM (classified as types) was found across the 18 fold types. Several site-specific rules were created for the assignment of functional residues to families and proteins that do not have a bound SAM or a solved structure.</p>","PeriodicalId":498,"journal":{"name":"BMC Structural Biology","volume":"13 1","pages":""},"PeriodicalIF":2.2220,"publicationDate":"2013-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/1472-6807-13-6","citationCount":"44","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Structural Biology","FirstCategoryId":"1085","ListUrlMain":"https://link.springer.com/article/10.1186/1472-6807-13-6","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 44

Abstract

The post-genomic era poses several challenges. The biggest is the identification of biochemical function for protein sequences and structures resulting from genomic initiatives. Most sequences lack a characterized function and are annotated as hypothetical or uncharacterized. While homology-based methods are useful, and work well for sequences with sequence identities above 50%, they fail for sequences in the twilight zone (<30%) of sequence identity. For cases where sequence methods fail, structural approaches are often used, based on the premise that structure preserves function for longer evolutionary time-frames than sequence alone. It is now clear that no single method can be used successfully for functional inference. Given the growing need for functional assignments, we describe here a systematic new approach, designated ligand-centric, which is primarily based on analysis of ligand-bound/unbound structures in the PDB. Results of applying our approach to S-adenosyl-L-methionine (SAM) binding proteins are presented.

Our analysis included 1,224 structures that belong to 172 unique families of the Protein Information Resource Superfamily system. Our ligand-centric approach was divided into four levels: residue, protein/domain, ligand, and family levels. The residue level included the identification of conserved binding site residues based on structure-guided sequence alignments of representative members of a family, and the identification of conserved structural motifs. The protein/domain level included structural classification of proteins, Pfam domains, domain architectures, and protein topologies. The ligand level included ligand conformations, ribose sugar puckering, and the identification of conserved ligand-atom interactions. The family level included phylogenetic analysis.

We found that SAM bound to a total of 18 different fold types (I-XVIII). We identified 4 new fold types and 11 additional topological arrangements of strands within the well-studied Rossmann fold Methyltransferases (MTases). This extends the existing structural classification of SAM binding proteins. A striking correlation between fold type and the conformation of the bound SAM (classified as types) was found across the 18 fold types. Several site-specific rules were created for the assignment of functional residues to families and proteins that do not have a bound SAM or a solved structure.

Abstract Image

s -腺苷- l-蛋氨酸结合蛋白的结构和功能研究:以配体为中心的方法
后基因组时代带来了几个挑战。最大的进步是鉴定由基因组计划产生的蛋白质序列和结构的生化功能。大多数序列缺乏特征函数,被注释为假设的或未特征的。虽然基于同源性的方法是有用的,并且对序列同一性高于50%的序列工作得很好,但它们对序列同一性的模糊区域(<30%)的序列就失效了。对于序列方法失败的情况,通常使用结构方法,其前提是结构比单独使用序列在更长的进化时间框架内保持功能。现在很清楚,没有一种方法可以成功地用于功能推理。鉴于对功能分配的需求日益增长,我们在这里描述了一种系统的新方法,指定配体中心,这主要基于对PDB中配体结合/非结合结构的分析。本文介绍了将我们的方法应用于s -腺苷- l-蛋氨酸(SAM)结合蛋白的结果。我们分析了蛋白质信息资源超家族系统中172个独特家族的1,224个结构。我们以配体为中心的方法分为四个层次:残基、蛋白质/结构域、配体和家族水平。残基水平包括基于结构导向序列比对的保守结合位点残基鉴定,以及保守结构基序的鉴定。蛋白质/结构域水平包括蛋白质的结构分类、Pfam结构域、结构域结构和蛋白质拓扑结构。配体水平包括配体构象、核糖糖皱缩和确定保守的配体-原子相互作用。家族水平包括系统发育分析。我们发现SAM共结合了18种不同的折叠类型(I-XVIII)。我们确定了4种新的折叠类型和11种额外的链的拓扑安排在充分研究的Rossmann折叠甲基转移酶(MTases)。这扩展了SAM结合蛋白的现有结构分类。在18种褶皱类型中发现了褶皱类型与束缚的SAM(分类为类型)的构象之间的显著相关性。为将功能残基分配给没有结合的SAM或解决结构的家族和蛋白质,创建了几个位点特异性规则。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
BMC Structural Biology
BMC Structural Biology 生物-生物物理
CiteScore
3.60
自引率
0.00%
发文量
0
期刊介绍: BMC Structural Biology is an open access, peer-reviewed journal that considers articles on investigations into the structure of biological macromolecules, including solving structures, structural and functional analyses, and computational modeling.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信