Simple sequence repeats in different genome sequences of Shigella and comparison with high GC and AT-rich genomes.

Ashraf Hosseini, Suvidya H Ranade, Indira Ghosh, Pramod Khandekar
{"title":"Simple sequence repeats in different genome sequences of Shigella and comparison with high GC and AT-rich genomes.","authors":"Ashraf Hosseini,&nbsp;Suvidya H Ranade,&nbsp;Indira Ghosh,&nbsp;Pramod Khandekar","doi":"10.1080/10425170701461730","DOIUrl":null,"url":null,"abstract":"<p><p>Simple sequence repeats (SSRs) are omnipresent in prokaryotes and eukaryotes, and are found anywhere in the genome in both protein encoding and noncoding regions. In present study the whole genome sequences of seven chromosomes (Shigella flexneri 2a str301 and 2457T, Shigella sonnei, Escherichia coli k12, Mycobacterium tuberculosis, Mycobacterium leprae and Staphylococcus saprophyticus) have downloaded from the GenBank database for identifying abundance, distribution and composition of SSRs and also to determine difference between the tandem repeats in real genome and randomness genome (using sequence shuffling tool) of the organisms included in this study. The data obtained in the present study show that: (i) tandem repeats are widely distributed throughout the genomes; (ii) SSRs are differentially distributed among coding and noncoding regions in investigated Shigella genomes; (iii) total frequency of SSRs in noncoding regions are higher than coding regions; (iv) in all investigated chromosomes ratio of Trinucleotide SSRs in real genomes are much higher than randomness genomes and Di nucleotide SSRs are lower; (v) Ratio of total and mononucleotide SSRs in real genome is higher than randomness genomes in E. coli K12, S. flexneri str 301 and S. saprophyticus, while it is lower in S. flexneri str 2457T, S.sonnei and M. tuberculosis and it is approximately same in M. leprae; (vi) frequency of codon repetitions are vary considerably depending on the type of encoded amino acids.</p>","PeriodicalId":11197,"journal":{"name":"DNA sequence : the journal of DNA sequencing and mapping","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2008-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1080/10425170701461730","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DNA sequence : the journal of DNA sequencing and mapping","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/10425170701461730","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Simple sequence repeats (SSRs) are omnipresent in prokaryotes and eukaryotes, and are found anywhere in the genome in both protein encoding and noncoding regions. In present study the whole genome sequences of seven chromosomes (Shigella flexneri 2a str301 and 2457T, Shigella sonnei, Escherichia coli k12, Mycobacterium tuberculosis, Mycobacterium leprae and Staphylococcus saprophyticus) have downloaded from the GenBank database for identifying abundance, distribution and composition of SSRs and also to determine difference between the tandem repeats in real genome and randomness genome (using sequence shuffling tool) of the organisms included in this study. The data obtained in the present study show that: (i) tandem repeats are widely distributed throughout the genomes; (ii) SSRs are differentially distributed among coding and noncoding regions in investigated Shigella genomes; (iii) total frequency of SSRs in noncoding regions are higher than coding regions; (iv) in all investigated chromosomes ratio of Trinucleotide SSRs in real genomes are much higher than randomness genomes and Di nucleotide SSRs are lower; (v) Ratio of total and mononucleotide SSRs in real genome is higher than randomness genomes in E. coli K12, S. flexneri str 301 and S. saprophyticus, while it is lower in S. flexneri str 2457T, S.sonnei and M. tuberculosis and it is approximately same in M. leprae; (vi) frequency of codon repetitions are vary considerably depending on the type of encoded amino acids.

志贺氏菌不同基因组序列的简单序列重复及与高GC和at富集基因组的比较。
简单序列重复序列(SSRs)在原核生物和真核生物中普遍存在,在蛋白质编码区和非编码区都可以找到。本研究从GenBank数据库下载了7条染色体(福氏志贺氏菌2a str301和2457T、索内志贺氏菌、大肠杆菌k12、结核分枝杆菌、麻风分枝杆菌和腐生葡萄球菌)的全基因组序列进行丰度鉴定。SSRs的分布和组成,以及确定本研究生物真实基因组和随机基因组串联重复序列的差异(使用序列洗牌工具)。本研究的数据表明:(1)串联重复序列在整个基因组中广泛分布;(ii)在所研究的志贺氏菌基因组中,SSRs在编码区和非编码区分布存在差异;(iii)非编码区SSRs的总频率高于编码区;(iv)在所有被调查的染色体中,三核苷酸SSRs在真实基因组中的比例远高于随机基因组,二核苷酸SSRs的比例较低;(v)真实基因组中总ssr与单核苷酸ssr的比值在大肠杆菌K12、福氏沙门氏菌str 301和腐生沙门氏菌中高于随机基因组,在福氏沙门氏菌str 2457T、索内沙门氏菌和结核分枝杆菌中低于随机基因组,在麻风分枝杆菌中基本相同;(六)密码子重复的频率根据编码氨基酸的类型有很大的不同。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信