{"title":"Microsatellite diversity and complexity in the viral genomes of the family Caliciviridae.","authors":"Md Gulam Jilani, Mehboob Hoque, Safdar Ali","doi":"10.1186/s43141-023-00582-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Microsatellites or simple sequence repeats (SSR) consist of 1-6 nucleotide motifs of DNA or RNA which are ubiquitously present in tandem repeated sequences across genome in viruses: prokaryotes and eukaryotes. They may be localized to both the coding and non-coding regions. SSRs play an important role in replication, gene regulation, transcription, and protein function. The Caliciviridae (CLV) family of viruses have ss-RNA, non-enveloped, icosahedral symmetry 27-35 nm in diameter in size. The size of the genome lies between 6.4 and 8.6 kb.</p><p><strong>Results: </strong>The incidence, composition, diversity, complexity, and host range of different microsatellites in 62 representatives of the family of Caliciviridae were systematically analyzed. The full-length genome sequences were assessed from NCBI ( https://www.ncbi.nlm.nih.gov ), and microsatellites were extracted through MISA software. The average genome size is about 7538 bp ranging from 6273 (CLV61) to 8798 (CLV47) bp. The average GC content of the genomes was ~ 51%. There are a total of 1317 SSRs and 53 cSSRs in the studied genomes. CLV 41 and CLV 49 contain the highest and lowest value of SSRs with 32 and 10 respectively, while CLV16 had maximum cSSR incidence of 4. There were 29 species which do not contain any cSSR. The incidence of mono-, di-, and tri-nucleotide SSRs was 219, 884, and 206, respectively. The most prevalent mono-, di-, and tri-nucleotide repeat motifs were \"C\" (126 SSRs), AC/CA (240 SSRs), and TGA/ACT (23 SSRs), respectively. Most of the SSRs and cSSRs are biased toward the coding region with a minimum of ~ 90% incident SSRs in the genomes' coding region. Viruses with similar host are found close to each other on the phylogenetic tree suggesting virus host being one of the driving forces for their evolution.</p><p><strong>Conclusions: </strong>The Caliciviridae genomes does not conform to any pattern of SSR signature in terms of incidence, composition, and localization. This unique property of SSR plays an important role in viral evolution. Clustering of similar host in the phylogenetic tree is the evidence of the uniqueness of SSR signature.</p>","PeriodicalId":74026,"journal":{"name":"Journal, genetic engineering & biotechnology","volume":"21 1","pages":"140"},"PeriodicalIF":3.6000,"publicationDate":"2023-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10673786/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal, genetic engineering & biotechnology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s43141-023-00582-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Microsatellites or simple sequence repeats (SSR) consist of 1-6 nucleotide motifs of DNA or RNA which are ubiquitously present in tandem repeated sequences across genome in viruses: prokaryotes and eukaryotes. They may be localized to both the coding and non-coding regions. SSRs play an important role in replication, gene regulation, transcription, and protein function. The Caliciviridae (CLV) family of viruses have ss-RNA, non-enveloped, icosahedral symmetry 27-35 nm in diameter in size. The size of the genome lies between 6.4 and 8.6 kb.
Results: The incidence, composition, diversity, complexity, and host range of different microsatellites in 62 representatives of the family of Caliciviridae were systematically analyzed. The full-length genome sequences were assessed from NCBI ( https://www.ncbi.nlm.nih.gov ), and microsatellites were extracted through MISA software. The average genome size is about 7538 bp ranging from 6273 (CLV61) to 8798 (CLV47) bp. The average GC content of the genomes was ~ 51%. There are a total of 1317 SSRs and 53 cSSRs in the studied genomes. CLV 41 and CLV 49 contain the highest and lowest value of SSRs with 32 and 10 respectively, while CLV16 had maximum cSSR incidence of 4. There were 29 species which do not contain any cSSR. The incidence of mono-, di-, and tri-nucleotide SSRs was 219, 884, and 206, respectively. The most prevalent mono-, di-, and tri-nucleotide repeat motifs were "C" (126 SSRs), AC/CA (240 SSRs), and TGA/ACT (23 SSRs), respectively. Most of the SSRs and cSSRs are biased toward the coding region with a minimum of ~ 90% incident SSRs in the genomes' coding region. Viruses with similar host are found close to each other on the phylogenetic tree suggesting virus host being one of the driving forces for their evolution.
Conclusions: The Caliciviridae genomes does not conform to any pattern of SSR signature in terms of incidence, composition, and localization. This unique property of SSR plays an important role in viral evolution. Clustering of similar host in the phylogenetic tree is the evidence of the uniqueness of SSR signature.