Insilico Codon Bias Correction for Transgenic Biological Protein Sequences for Vaccine Production

Mohamed Abuelanin, Mohamed Fares, M. El-Hadidi
{"title":"Insilico Codon Bias Correction for Transgenic Biological Protein Sequences for Vaccine Production","authors":"Mohamed Abuelanin, Mohamed Fares, M. El-Hadidi","doi":"10.1109/NILES50944.2020.9257881","DOIUrl":null,"url":null,"abstract":"Codon optimization is primarily used in enhancing the levels of protein expression in the host species. Each species has its own codon usage bias, which represents the codons abundance frequency in that species. Using the host usage profile contributes to personalize the synthesis of the DNA vaccines that can achieve highly active vectors the host cells. For optimizing protein expression levels in a particular host, the genetic code sequence needs correction of codon frequency bias to match the expression of host codon landscape rather than the donating organism profile. In this work, we have applied two approaches for optimizing codon usage in protein-coding sequences. The first approach adopts a substitution-based method to replace less frequent codons with differentially higher frequency codons at the specific codon usage bias tables. The second approach finds and replaces the maximal exact protein matches between the unoptimized sequence and the host proteome. We evaluated our work by optimizing the Avian Influenza H1N1 virus’s HA gene to maximize the protein expression before synthesizing the DNA vaccine. Our method produced optimized sequences with higher GC content by 17%, which is similar to eukaryotic sequence profiles than the viral usage profiles, allowing for better expression in avian host cells.","PeriodicalId":253090,"journal":{"name":"2020 2nd Novel Intelligent and Leading Emerging Sciences Conference (NILES)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 2nd Novel Intelligent and Leading Emerging Sciences Conference (NILES)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NILES50944.2020.9257881","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Codon optimization is primarily used in enhancing the levels of protein expression in the host species. Each species has its own codon usage bias, which represents the codons abundance frequency in that species. Using the host usage profile contributes to personalize the synthesis of the DNA vaccines that can achieve highly active vectors the host cells. For optimizing protein expression levels in a particular host, the genetic code sequence needs correction of codon frequency bias to match the expression of host codon landscape rather than the donating organism profile. In this work, we have applied two approaches for optimizing codon usage in protein-coding sequences. The first approach adopts a substitution-based method to replace less frequent codons with differentially higher frequency codons at the specific codon usage bias tables. The second approach finds and replaces the maximal exact protein matches between the unoptimized sequence and the host proteome. We evaluated our work by optimizing the Avian Influenza H1N1 virus’s HA gene to maximize the protein expression before synthesizing the DNA vaccine. Our method produced optimized sequences with higher GC content by 17%, which is similar to eukaryotic sequence profiles than the viral usage profiles, allowing for better expression in avian host cells.
用于疫苗生产的转基因生物蛋白序列的Insilico密码子偏差校正
密码子优化主要用于提高宿主物种的蛋白表达水平。每个物种都有自己的密码子使用偏好,这代表了该物种的密码子丰度频率。利用宿主使用概况有助于个性化DNA疫苗的合成,从而实现宿主细胞的高活性载体。为了优化蛋白质在特定宿主中的表达水平,遗传密码序列需要纠正密码子频率偏差,以匹配宿主密码子景观的表达,而不是供体生物的特征。在这项工作中,我们应用了两种方法来优化密码子在蛋白质编码序列中的使用。第一种方法采用基于替换的方法,在特定密码子使用偏倚表中用频率差异较高的密码子替换频率较低的密码子。第二种方法是寻找和替换未优化序列与宿主蛋白质组之间的最大精确蛋白质匹配。在合成DNA疫苗之前,我们通过优化禽流感H1N1病毒HA基因来最大化蛋白表达来评估我们的工作。我们的方法产生的优化序列GC含量比病毒使用谱高17%,与真核生物序列谱相似,可以在鸟类宿主细胞中更好地表达。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信