Kaiyang Zheng, Jianhua Sun, Yantao Liang, Liangliang Kong, David Paez-Espino, Andrew Mcminn, Min Wang
{"title":"VITAP: a high precision tool for DNA and RNA viral classification based on meta-omic data","authors":"Kaiyang Zheng, Jianhua Sun, Yantao Liang, Liangliang Kong, David Paez-Espino, Andrew Mcminn, Min Wang","doi":"10.1038/s41467-025-57500-7","DOIUrl":null,"url":null,"abstract":"<p>The rapid growth in the number of newly identified DNA and RNA viral sequences underscores the need for an accurate and comprehensive classification system for all viral realms at different taxonomic levels. Here, we establish the Viral Taxonomic Assignment Pipeline (VITAP), which addresses classification challenges by integrating alignment-based techniques with graphs, offering high precision in classifying both DNA and RNA viral sequences and providing confidence level for each taxonomic unit. This tool automatically updates its database in sync with the latest references from the International Committee on Taxonomy of Viruses (ICTV), efficiently classifying viral sequences as short as 1,000 base pairs to genus level. VITAP possesses good generalization capabilities, maintaining accuracy comparable to other pipelines while achieving higher annotation rates across most DNA and RNA viral phyla. Its application in deep-sea viromes has led to significant taxonomic updates, providing comprehensive diversity information of viruses from deep-sea. VITAP is available at https://github.com/DrKaiyangZheng/VITAP.</p>","PeriodicalId":19066,"journal":{"name":"Nature Communications","volume":"15 1","pages":""},"PeriodicalIF":15.7000,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Communications","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1038/s41467-025-57500-7","RegionNum":1,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The rapid growth in the number of newly identified DNA and RNA viral sequences underscores the need for an accurate and comprehensive classification system for all viral realms at different taxonomic levels. Here, we establish the Viral Taxonomic Assignment Pipeline (VITAP), which addresses classification challenges by integrating alignment-based techniques with graphs, offering high precision in classifying both DNA and RNA viral sequences and providing confidence level for each taxonomic unit. This tool automatically updates its database in sync with the latest references from the International Committee on Taxonomy of Viruses (ICTV), efficiently classifying viral sequences as short as 1,000 base pairs to genus level. VITAP possesses good generalization capabilities, maintaining accuracy comparable to other pipelines while achieving higher annotation rates across most DNA and RNA viral phyla. Its application in deep-sea viromes has led to significant taxonomic updates, providing comprehensive diversity information of viruses from deep-sea. VITAP is available at https://github.com/DrKaiyangZheng/VITAP.
期刊介绍:
Nature Communications, an open-access journal, publishes high-quality research spanning all areas of the natural sciences. Papers featured in the journal showcase significant advances relevant to specialists in each respective field. With a 2-year impact factor of 16.6 (2022) and a median time of 8 days from submission to the first editorial decision, Nature Communications is committed to rapid dissemination of research findings. As a multidisciplinary journal, it welcomes contributions from biological, health, physical, chemical, Earth, social, mathematical, applied, and engineering sciences, aiming to highlight important breakthroughs within each domain.