Incorporation of transition to transversion ratio and nonsense mutations, improves the estimation of the number of synonymous and non-synonymous sites in codons.

Ruksana Aziz, Piyali Sen, Pratyush Kumar Beura, Saurav Das, Debapriya Tula, Madhusmita Dash, Nima Dondu Namsa, Ramesh Chandra Deka, Edward J Feil, Siddhartha Sankar Satapathy, Suvendra Kumar Ray
{"title":"Incorporation of transition to transversion ratio and nonsense mutations, improves the estimation of the number of synonymous and non-synonymous sites in codons.","authors":"Ruksana Aziz,&nbsp;Piyali Sen,&nbsp;Pratyush Kumar Beura,&nbsp;Saurav Das,&nbsp;Debapriya Tula,&nbsp;Madhusmita Dash,&nbsp;Nima Dondu Namsa,&nbsp;Ramesh Chandra Deka,&nbsp;Edward J Feil,&nbsp;Siddhartha Sankar Satapathy,&nbsp;Suvendra Kumar Ray","doi":"10.1093/dnares/dsac023","DOIUrl":null,"url":null,"abstract":"<p><p>A common approach to estimate the strength and direction of selection acting on protein coding sequences is to calculate the dN/dS ratio. The method to calculate dN/dS has been widely used by many researchers and many critical reviews have been made on its application after the proposition by Nei and Gojobori in 1986. However, the method is still evolving considering the non-uniform substitution rates and pretermination codons. In our study of SNPs in 586 genes across 156 Escherichia coli strains, synonymous polymorphism in 2-fold degenerate codons were higher in comparison to that in 4-fold degenerate codons, which could be attributed to the difference between transition (Ti) and transversion (Tv) substitution rates where the average rate of a transition is four times more than that of a transversion in general. We considered both the Ti/Tv ratio, and nonsense mutation in pretermination codons, to improve estimates of synonymous (S) and non-synonymous (NS) sites. The accuracy of estimating dN/dS has been improved by considering the Ti/Tv ratio and nonsense substitutions in pretermination codons. We showed that applying the modified approach based on Ti/Tv ratio and pretermination codons results in higher values of dN/dS in 29 common genes of equal reading-frames between E. coli and Salmonella enterica. This study emphasizes the robustness of amino acid composition with varying codon degeneracy, as well as the pretermination codons when calculating dN/dS values.</p>","PeriodicalId":11212,"journal":{"name":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2022-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9358017/pdf/","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DNA Research: An International Journal for Rapid Publication of Reports on Genes and Genomes","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/dnares/dsac023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

A common approach to estimate the strength and direction of selection acting on protein coding sequences is to calculate the dN/dS ratio. The method to calculate dN/dS has been widely used by many researchers and many critical reviews have been made on its application after the proposition by Nei and Gojobori in 1986. However, the method is still evolving considering the non-uniform substitution rates and pretermination codons. In our study of SNPs in 586 genes across 156 Escherichia coli strains, synonymous polymorphism in 2-fold degenerate codons were higher in comparison to that in 4-fold degenerate codons, which could be attributed to the difference between transition (Ti) and transversion (Tv) substitution rates where the average rate of a transition is four times more than that of a transversion in general. We considered both the Ti/Tv ratio, and nonsense mutation in pretermination codons, to improve estimates of synonymous (S) and non-synonymous (NS) sites. The accuracy of estimating dN/dS has been improved by considering the Ti/Tv ratio and nonsense substitutions in pretermination codons. We showed that applying the modified approach based on Ti/Tv ratio and pretermination codons results in higher values of dN/dS in 29 common genes of equal reading-frames between E. coli and Salmonella enterica. This study emphasizes the robustness of amino acid composition with varying codon degeneracy, as well as the pretermination codons when calculating dN/dS values.

Abstract Image

Abstract Image

Abstract Image

结合过渡到翻转比和无义突变,改进了密码子中同义和非同义位点数量的估计。
估计选择作用于蛋白质编码序列的强度和方向的常用方法是计算dN/dS比。计算dN/dS的方法在Nei和Gojobori于1986年提出之后被许多研究者广泛使用,并对其应用进行了许多批评性的评论。然而,考虑到不均匀的取代率和预终止密码子,该方法仍在不断发展。在我们对156个大肠杆菌菌株的586个基因的snp的研究中,2倍简并密码子的同义多态性高于4倍简并密码子,这可能是由于过渡(Ti)和翻转(Tv)替代率的差异,其中过渡的平均率是一般翻转的四倍。我们考虑了Ti/Tv比率和终止密码子的无义突变,以提高对同义(S)和非同义(NS)位点的估计。通过考虑Ti/Tv比值和前缀密码子的无义替换,提高了dN/dS估计的准确性。结果表明,采用基于Ti/Tv比值和终止密码子的改进方法,大肠杆菌和肠炎沙门氏菌具有相同读码框的29个常见基因的dN/dS值更高。本研究强调了不同密码子简并度的氨基酸组成的稳健性,以及在计算dN/dS值时的终止密码子。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信