Large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in E. coli.

W Nicholson Price, Samuel K Handelman, John K Everett, Saichiu N Tong, Ana Bracic, Jon D Luff, Victor Naumov, Thomas Acton, Philip Manor, Rong Xiao, Burkhard Rost, Gaetano T Montelione, John F Hunt
{"title":"Large-scale experimental studies show unexpected amino acid effects on protein expression and solubility in vivo in E. coli.","authors":"W Nicholson Price,&nbsp;Samuel K Handelman,&nbsp;John K Everett,&nbsp;Saichiu N Tong,&nbsp;Ana Bracic,&nbsp;Jon D Luff,&nbsp;Victor Naumov,&nbsp;Thomas Acton,&nbsp;Philip Manor,&nbsp;Rong Xiao,&nbsp;Burkhard Rost,&nbsp;Gaetano T Montelione,&nbsp;John F Hunt","doi":"10.1186/2042-5783-1-6","DOIUrl":null,"url":null,"abstract":"<p><p> The biochemical and physical factors controlling protein expression level and solubility in vivo remain incompletely characterized. To gain insight into the primary sequence features influencing these outcomes, we performed statistical analyses of results from the high-throughput protein-production pipeline of the Northeast Structural Genomics Consortium. Proteins expressed in E. coli and consistently purified were scored independently for expression and solubility levels. These parameters nonetheless show a very strong positive correlation. We used logistic regressions to determine whether they are systematically influenced by fractional amino acid composition or several bulk sequence parameters including hydrophobicity, sidechain entropy, electrostatic charge, and predicted backbone disorder. Decreasing hydrophobicity correlates with higher expression and solubility levels, but this correlation apparently derives solely from the beneficial effect of three charged amino acids, at least for bacterial proteins. In fact, the three most hydrophobic residues showed very different correlations with solubility level. Leu showed the strongest negative correlation among amino acids, while Ile showed a slightly positive correlation in most data segments. Several other amino acids also had unexpected effects. Notably, Arg correlated with decreased expression and, most surprisingly, solubility of bacterial proteins, an effect only partially attributable to rare codons. However, rare codons did significantly reduce expression despite use of a codon-enhanced strain. Additional analyses suggest that positively but not negatively charged amino acids may reduce translation efficiency in E. coli irrespective of codon usage. While some observed effects may reflect indirect evolutionary correlations, others may reflect basic physicochemical phenomena. We used these results to construct and validate predictors of expression and solubility levels and overall protein usability, and we propose new strategies to be explored for engineering improved protein expression and solubility.</p>","PeriodicalId":18538,"journal":{"name":"Microbial Informatics and Experimentation","volume":"1 1","pages":"6"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/2042-5783-1-6","citationCount":"28","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbial Informatics and Experimentation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/2042-5783-1-6","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 28

Abstract

The biochemical and physical factors controlling protein expression level and solubility in vivo remain incompletely characterized. To gain insight into the primary sequence features influencing these outcomes, we performed statistical analyses of results from the high-throughput protein-production pipeline of the Northeast Structural Genomics Consortium. Proteins expressed in E. coli and consistently purified were scored independently for expression and solubility levels. These parameters nonetheless show a very strong positive correlation. We used logistic regressions to determine whether they are systematically influenced by fractional amino acid composition or several bulk sequence parameters including hydrophobicity, sidechain entropy, electrostatic charge, and predicted backbone disorder. Decreasing hydrophobicity correlates with higher expression and solubility levels, but this correlation apparently derives solely from the beneficial effect of three charged amino acids, at least for bacterial proteins. In fact, the three most hydrophobic residues showed very different correlations with solubility level. Leu showed the strongest negative correlation among amino acids, while Ile showed a slightly positive correlation in most data segments. Several other amino acids also had unexpected effects. Notably, Arg correlated with decreased expression and, most surprisingly, solubility of bacterial proteins, an effect only partially attributable to rare codons. However, rare codons did significantly reduce expression despite use of a codon-enhanced strain. Additional analyses suggest that positively but not negatively charged amino acids may reduce translation efficiency in E. coli irrespective of codon usage. While some observed effects may reflect indirect evolutionary correlations, others may reflect basic physicochemical phenomena. We used these results to construct and validate predictors of expression and solubility levels and overall protein usability, and we propose new strategies to be explored for engineering improved protein expression and solubility.

Abstract Image

Abstract Image

Abstract Image

大规模实验研究表明,氨基酸对大肠杆菌体内蛋白质表达和溶解度的影响出乎意料。
控制蛋白表达水平和体内溶解度的生化和物理因素尚未完全确定。为了深入了解影响这些结果的初级序列特征,我们对东北结构基因组学联盟的高通量蛋白质生产管道的结果进行了统计分析。在大肠杆菌中表达并一致纯化的蛋白独立地对表达和溶解度水平进行评分。然而,这些参数显示出很强的正相关。我们使用逻辑回归来确定它们是否受到分数氨基酸组成或几个整体序列参数的系统影响,包括疏水性、侧链熵、静电荷和预测的主干紊乱。疏水性的降低与更高的表达和溶解度水平相关,但这种相关性显然仅仅来自于三种带电氨基酸的有益作用,至少对细菌蛋白质是这样。事实上,三个最疏水残基与溶解度水平的相关性非常不同。Leu在氨基酸间负相关最强,Ile在大部分数据段呈微正相关。其他几种氨基酸也有意想不到的效果。值得注意的是,Arg与细菌蛋白的表达和溶解度降低相关,最令人惊讶的是,这种影响仅部分归因于罕见的密码子。然而,尽管使用密码子增强菌株,罕见密码子确实显著降低了表达。另外的分析表明,带正电但不带负电的氨基酸可能会降低大肠杆菌的翻译效率,而与密码子的使用无关。虽然一些观察到的效应可能反映了间接的进化相关性,但其他效应可能反映了基本的物理化学现象。我们利用这些结果构建并验证了表达和溶解度水平以及整体蛋白质可用性的预测因子,并提出了新的策略来探索提高蛋白质表达和溶解度的工程方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信