Zachary Lozier, Lilyahna Hill, Elizabeth Semmann, W. Allen Miller
{"title":"Frontiers | A proposed new Tombusviridae genus featuring extremely long 5' untranslated regions and a luteo/polerovirus-like gene block","authors":"Zachary Lozier, Lilyahna Hill, Elizabeth Semmann, W. Allen Miller","doi":"10.3389/fviro.2024.1422934","DOIUrl":null,"url":null,"abstract":"Tombusviridae is a large family of single-stranded, positive-sense RNA plant viruses with uncapped, non-polyadenylated genomes encoding 4–7 open reading frames (ORFs). Previously, we discovered, by high-throughput sequencing of maize and teosinte RNA, a novel genome of a virus we call Maize-associated tombusvirus (MaTV). Here we determined the precise termini of the MaTV genome by using 5’ and 3’ rapid amplification of cDNA ends (RACE). In GenBank, we discovered eleven other nearly complete viral genomes with MaTV-like genome organizations and related RNA-dependent RNA polymerase (RdRp) sequences. These genomes came from diverse plant, fungal, invertebrate and vertebrate organisms, and some have been found in multiple organisms across the globe. The available 5’ untranslated regions (UTRs) of these genomes are remarkably long: at least 438 to 727 nucleotides (nt), in contrast to those of other tombusvirids, which are <150 nt. Moreover these UTRs contain 6 to 12 AUG triplets that are unlikely to be start codons, because - with the possible exception of MaTV - there are no large or conserved ORFs in the 5’ UTRs. Such features suggest an internal ribosome entry site (IRES), but the only conserved features we found were that the 50 nt upstream of and adjacent to the ORF1 start codon are cytosine-rich and guanosine-poor. ORF2 (RdRp gene) appears to be translated by in-frame ribosomal readthrough of the ORF1 stop codon. In all twelve genomes we identified RNA structures known in other tombusvirids to facilitate this readthrough. ORF4 overlaps with ORF3 (coat protein gene) and may initiate with a non-AUG start codon. ORF5 is predicted to be translated by readthrough of the ORF3 stop codon. The proteins encoded by ORFs 4 and 5 diverge highly from each other and from those of the similarly organized luteo- and poleroviruses. We also found no obvious 3’ cap-independent translation elements, which are present in other tombusvirids. The twelve genomes diverge sufficiently from other tombusvirids to warrant classification in a new genus. Because they contain two leaky stop codons and a potential leaky start codon, we propose to name this genus Rimosavirus (rimosa = leaky in Latin).","PeriodicalId":73114,"journal":{"name":"Frontiers in virology","volume":"70 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2024-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in virology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fviro.2024.1422934","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"VIROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Tombusviridae is a large family of single-stranded, positive-sense RNA plant viruses with uncapped, non-polyadenylated genomes encoding 4–7 open reading frames (ORFs). Previously, we discovered, by high-throughput sequencing of maize and teosinte RNA, a novel genome of a virus we call Maize-associated tombusvirus (MaTV). Here we determined the precise termini of the MaTV genome by using 5’ and 3’ rapid amplification of cDNA ends (RACE). In GenBank, we discovered eleven other nearly complete viral genomes with MaTV-like genome organizations and related RNA-dependent RNA polymerase (RdRp) sequences. These genomes came from diverse plant, fungal, invertebrate and vertebrate organisms, and some have been found in multiple organisms across the globe. The available 5’ untranslated regions (UTRs) of these genomes are remarkably long: at least 438 to 727 nucleotides (nt), in contrast to those of other tombusvirids, which are <150 nt. Moreover these UTRs contain 6 to 12 AUG triplets that are unlikely to be start codons, because - with the possible exception of MaTV - there are no large or conserved ORFs in the 5’ UTRs. Such features suggest an internal ribosome entry site (IRES), but the only conserved features we found were that the 50 nt upstream of and adjacent to the ORF1 start codon are cytosine-rich and guanosine-poor. ORF2 (RdRp gene) appears to be translated by in-frame ribosomal readthrough of the ORF1 stop codon. In all twelve genomes we identified RNA structures known in other tombusvirids to facilitate this readthrough. ORF4 overlaps with ORF3 (coat protein gene) and may initiate with a non-AUG start codon. ORF5 is predicted to be translated by readthrough of the ORF3 stop codon. The proteins encoded by ORFs 4 and 5 diverge highly from each other and from those of the similarly organized luteo- and poleroviruses. We also found no obvious 3’ cap-independent translation elements, which are present in other tombusvirids. The twelve genomes diverge sufficiently from other tombusvirids to warrant classification in a new genus. Because they contain two leaky stop codons and a potential leaky start codon, we propose to name this genus Rimosavirus (rimosa = leaky in Latin).