Expectations from structural genomics revisited: an analysis of structural genomics targets.

Mansoor A S Saqi, David L Wild
{"title":"Expectations from structural genomics revisited: an analysis of structural genomics targets.","authors":"Mansoor A S Saqi,&nbsp;David L Wild","doi":"10.2165/00129785-200505050-00006","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Current structural genomics projects are being driven by two main goals; to produce a representative set of protein folds that could be used as templates for comparative modeling purposes, and to provide insight into the function of the currently unannotated protein sequences. Such projects may reveal that a newly determined protein structure shares structural similarity with a previously observed structure or that it is a novel fold. The manner in which structure can be used to suggest the function of a protein will depend on the number and diversity of homologous sequences and the extent to which these sequences are functionally characterized.</p><p><strong>Method and results: </strong>Using sequence searching methods, we analyzed structural genomics target sequences to ascertain if they were members of functionally characterized protein families, protein families of unknown function, or orphan sequences. This analysis provided an indication of what could be expected to emerge from structural genomics projects. Matches were found to approximately 25% of the current functionally unannotated protein families in the PFAM database (protein families database of alignments and hidden Markov models). The 16% of strict orphan sequences will be the most problematic if their structures reveal novel folds. However, out of the remaining target sequences that match families whose members are largely of unknown function, 28% are particularly interesting in that they are part of protein families with considerable sequence diversity.</p><p><strong>Conclusion: </strong>The determination of a new structure of a member of these families is likely to offer considerable insight into possible functional roles of these proteins even if it is a new fold. Mapping the sequence conservation onto the structure may reveal functionally important residues for further study by experimental methods.</p>","PeriodicalId":72171,"journal":{"name":"American journal of pharmacogenomics : genomics-related research in drug development and clinical practice","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2005-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.2165/00129785-200505050-00006","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of pharmacogenomics : genomics-related research in drug development and clinical practice","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2165/00129785-200505050-00006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Background: Current structural genomics projects are being driven by two main goals; to produce a representative set of protein folds that could be used as templates for comparative modeling purposes, and to provide insight into the function of the currently unannotated protein sequences. Such projects may reveal that a newly determined protein structure shares structural similarity with a previously observed structure or that it is a novel fold. The manner in which structure can be used to suggest the function of a protein will depend on the number and diversity of homologous sequences and the extent to which these sequences are functionally characterized.

Method and results: Using sequence searching methods, we analyzed structural genomics target sequences to ascertain if they were members of functionally characterized protein families, protein families of unknown function, or orphan sequences. This analysis provided an indication of what could be expected to emerge from structural genomics projects. Matches were found to approximately 25% of the current functionally unannotated protein families in the PFAM database (protein families database of alignments and hidden Markov models). The 16% of strict orphan sequences will be the most problematic if their structures reveal novel folds. However, out of the remaining target sequences that match families whose members are largely of unknown function, 28% are particularly interesting in that they are part of protein families with considerable sequence diversity.

Conclusion: The determination of a new structure of a member of these families is likely to offer considerable insight into possible functional roles of these proteins even if it is a new fold. Mapping the sequence conservation onto the structure may reveal functionally important residues for further study by experimental methods.

重新审视结构基因组学的期望:结构基因组学目标的分析。
背景:当前的结构基因组学项目主要由两个目标驱动;为了产生一组具有代表性的蛋白质折叠,可以用作比较建模目的的模板,并提供对当前未注释的蛋白质序列的功能的见解。这些项目可能揭示新确定的蛋白质结构与先前观察到的结构具有结构相似性,或者它是一个新的折叠。结构可用于提示蛋白质功能的方式将取决于同源序列的数量和多样性以及这些序列在功能上表征的程度。方法与结果:采用序列搜索方法,分析结构基因组学目标序列,确定它们是功能特征蛋白家族成员,功能未知蛋白家族成员,还是孤儿序列。这一分析为结构基因组学项目提供了一个可能出现的迹象。在PFAM数据库(比对和隐马尔可夫模型的蛋白质家族数据库)中,发现了大约25%的当前功能未注释的蛋白质家族的匹配。如果16%的严格孤儿序列的结构显示出新的褶皱,那么它们将是最有问题的。然而,在剩余的目标序列中,匹配的家族成员大部分功能未知,其中28%特别有趣,因为它们是具有相当序列多样性的蛋白质家族的一部分。结论:这些家族成员的新结构的确定可能为这些蛋白质可能的功能作用提供相当大的见解,即使它是一个新的折叠。将序列保守性映射到结构上可以揭示重要的功能残基,为进一步的实验方法研究提供依据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信