Analysis of the genetic diversity in RNA-directed RNA polymerase sequences: implications for an automated RNA virus classification system

IF 5.5 2区 医学 Q1 VIROLOGY
Virus Evolution Pub Date : 2024-07-25 DOI:10.1093/ve/veae059
Zhongshuai Tian, Tao Hu, Edward C Holmes, Jingkai Ji, Weifeng Shi
{"title":"Analysis of the genetic diversity in RNA-directed RNA polymerase sequences: implications for an automated RNA virus classification system","authors":"Zhongshuai Tian, Tao Hu, Edward C Holmes, Jingkai Ji, Weifeng Shi","doi":"10.1093/ve/veae059","DOIUrl":null,"url":null,"abstract":"RNA viruses are characterized by a broad host range and high levels of genetic diversity. Despite a recent expansion in the known virosphere following metagenomic sequencing, our knowledge of the species-rank genetic diversity of RNA viruses, and how often they are misassigned and misclassified, is limited. We performed a clustering analysis of 7,801 RNA-directed RNA polymerase (RdRp) sequences representing 1,897 established RNA virus species. From this, we identified substantial genetic divergence within some virus species and inconsistency in RNA virus assignment between the GenBank database and The International Committee on Taxonomy of Viruses (ICTV). In particular, 27.57% virus species were comprised of multiple virus operational taxonomic units (vOTUs), including Alphainfluenzavirus influenzae, Mammarenavirus lassaense, Apple stem pitting virus, and Rotavirus A, with each having over 100 vOTUs. In addition, the distribution of average amino acid identity between vOTUs within single assigned species showed a relatively low threshold: <90%, and sometimes <50%. However, when only exemplar sequences from virus species were analyzed, 1,889 of the ICTV-designated RNA virus species (99.58%) were clustered into a single vOTU. Clustering of RdRp sequences from different virus species also revealed that 17 vOTUs contained two distinct virus species. These potential misassignments were confirmed by phylogenetic analysis. A further analysis of ANI (Average Nucleotide Identity) values ranging from 70% to 97.5% revealed that at ANI of 82.5%, 1559 (82.18%) of the 1,897 virus species could be correctly clustered into one single vOTU. However, at ANI values greater than 82.5%, an increasing number of species were clustered into two or more vOTUs. In sum, we have identified some inconsistency and misassignment of RNA virus species based on the analysis of RdRp sequences alone which has important implications for the development of an automated RNA virus classification system.","PeriodicalId":56026,"journal":{"name":"Virus Evolution","volume":"101 1","pages":""},"PeriodicalIF":5.5000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virus Evolution","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/ve/veae059","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"VIROLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

RNA viruses are characterized by a broad host range and high levels of genetic diversity. Despite a recent expansion in the known virosphere following metagenomic sequencing, our knowledge of the species-rank genetic diversity of RNA viruses, and how often they are misassigned and misclassified, is limited. We performed a clustering analysis of 7,801 RNA-directed RNA polymerase (RdRp) sequences representing 1,897 established RNA virus species. From this, we identified substantial genetic divergence within some virus species and inconsistency in RNA virus assignment between the GenBank database and The International Committee on Taxonomy of Viruses (ICTV). In particular, 27.57% virus species were comprised of multiple virus operational taxonomic units (vOTUs), including Alphainfluenzavirus influenzae, Mammarenavirus lassaense, Apple stem pitting virus, and Rotavirus A, with each having over 100 vOTUs. In addition, the distribution of average amino acid identity between vOTUs within single assigned species showed a relatively low threshold: <90%, and sometimes <50%. However, when only exemplar sequences from virus species were analyzed, 1,889 of the ICTV-designated RNA virus species (99.58%) were clustered into a single vOTU. Clustering of RdRp sequences from different virus species also revealed that 17 vOTUs contained two distinct virus species. These potential misassignments were confirmed by phylogenetic analysis. A further analysis of ANI (Average Nucleotide Identity) values ranging from 70% to 97.5% revealed that at ANI of 82.5%, 1559 (82.18%) of the 1,897 virus species could be correctly clustered into one single vOTU. However, at ANI values greater than 82.5%, an increasing number of species were clustered into two or more vOTUs. In sum, we have identified some inconsistency and misassignment of RNA virus species based on the analysis of RdRp sequences alone which has important implications for the development of an automated RNA virus classification system.
RNA 定向 RNA 聚合酶序列的遗传多样性分析:对 RNA 病毒自动分类系统的影响
RNA 病毒具有宿主范围广、遗传多样性高的特点。尽管随着元基因组测序技术的发展,最近已知的病毒范围有所扩大,但我们对 RNA 病毒的种级遗传多样性以及它们被错误分类和归类的频率的了解仍然有限。我们对代表 1,897 个已确定 RNA 病毒物种的 7,801 个 RNA 引导 RNA 聚合酶(RdRp)序列进行了聚类分析。由此,我们发现了一些病毒物种内部存在着巨大的遗传差异,而且 GenBank 数据库与国际病毒分类委员会(ICTV)之间的 RNA 病毒分类也不一致。其中,27.57%的病毒种类由多个病毒操作分类单元(vOTUs)组成,包括流感病毒(Alphainfluenzavirus influenzae)、马马病毒(Mammarenavirus lassaense)、苹果茎点裂病毒(Apple stem pitting virus)和轮状病毒 A(Rotavirus A),每个病毒种类都有超过 100 个 vOTUs。此外,单个指定物种内 vOTU 之间的平均氨基酸同一性分布显示出相对较低的阈值:90%,有时为 50%。然而,当仅分析病毒物种的示例序列时,1 889 个 ICTV 指定的 RNA 病毒物种(99.58%)被聚类到一个 vOTU 中。对来自不同病毒种类的 RdRp 序列进行聚类还发现,有 17 个 vOTU 包含两个不同的病毒种类。系统进化分析证实了这些潜在的错误定位。对介于 70% 到 97.5% 之间的 ANI(平均核苷酸同一性)值的进一步分析表明,当 ANI 值为 82.5% 时,1,897 个病毒物种中有 1559 个(82.18%)可以正确地聚类成一个单一的 vOTU。然而,当 ANI 值大于 82.5%时,越来越多的物种被聚类到两个或更多的 vOTU 中。总之,我们发现了一些仅基于 RdRp 序列分析的 RNA 病毒物种的不一致性和错误分类,这对开发 RNA 病毒自动分类系统具有重要意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Virus Evolution
Virus Evolution Immunology and Microbiology-Microbiology
CiteScore
10.50
自引率
5.70%
发文量
108
审稿时长
14 weeks
期刊介绍: Virus Evolution is a new Open Access journal focusing on the long-term evolution of viruses, viruses as a model system for studying evolutionary processes, viral molecular epidemiology and environmental virology. The aim of the journal is to provide a forum for original research papers, reviews, commentaries and a venue for in-depth discussion on the topics relevant to virus evolution.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信