Christina Kirschbaum , Kunaphas Kongkitimanon , Stefan Frank , Martin Hölzer , Sofia Paraskevopoulou , Hugues Richard
{"title":"VirusWarn: A mutation-based early warning system to prioritize concerning SARS-CoV-2 and influenza virus variants from sequencing data","authors":"Christina Kirschbaum , Kunaphas Kongkitimanon , Stefan Frank , Martin Hölzer , Sofia Paraskevopoulou , Hugues Richard","doi":"10.1016/j.csbj.2025.03.010","DOIUrl":null,"url":null,"abstract":"<div><div>The rapid evolution of respiratory viruses is characterized by the emergence of variants with concerning phenotypes that are efficient in antibody escape or show high transmissibility. This necessitates timely identification of such variants by surveillance networks to assist public health interventions. Here, we introduce <em>VirusWarn</em>, a comprehensive system designed for detecting, prioritizing, and warning of emerging virus variants from large genomic datasets. VirusWarn uses both manually-curated rules and machine-learning (ML) classifiers to generate and rank pathogen sequences based on mutations of concern and regions of interest. Validation results for SARS-CoV-2 showed that VirusWarn successfully identifies variants of concern in both assessments, with manual- and ML-derived criteria from positive selection analyses. Although initially developed for SARS-CoV-2, VirusWarn was adapted to Influenza viruses and their dynamics, and provides a robust performance, integrating a scheme that accounts for fixed mutations from past seasons. HTML reports provide detailed results with searchable tables and visualizations, including mutation plots and heatmaps. Because VirusWarn is written in Nextflow, it can be easily adapted to other pathogens, demonstrating its flexibility and scalability for genomic surveillance efforts.</div></div>","PeriodicalId":10715,"journal":{"name":"Computational and structural biotechnology journal","volume":"27 ","pages":"Pages 1081-1088"},"PeriodicalIF":4.4000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational and structural biotechnology journal","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2001037025000790","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
The rapid evolution of respiratory viruses is characterized by the emergence of variants with concerning phenotypes that are efficient in antibody escape or show high transmissibility. This necessitates timely identification of such variants by surveillance networks to assist public health interventions. Here, we introduce VirusWarn, a comprehensive system designed for detecting, prioritizing, and warning of emerging virus variants from large genomic datasets. VirusWarn uses both manually-curated rules and machine-learning (ML) classifiers to generate and rank pathogen sequences based on mutations of concern and regions of interest. Validation results for SARS-CoV-2 showed that VirusWarn successfully identifies variants of concern in both assessments, with manual- and ML-derived criteria from positive selection analyses. Although initially developed for SARS-CoV-2, VirusWarn was adapted to Influenza viruses and their dynamics, and provides a robust performance, integrating a scheme that accounts for fixed mutations from past seasons. HTML reports provide detailed results with searchable tables and visualizations, including mutation plots and heatmaps. Because VirusWarn is written in Nextflow, it can be easily adapted to other pathogens, demonstrating its flexibility and scalability for genomic surveillance efforts.
期刊介绍:
Computational and Structural Biotechnology Journal (CSBJ) is an online gold open access journal publishing research articles and reviews after full peer review. All articles are published, without barriers to access, immediately upon acceptance. The journal places a strong emphasis on functional and mechanistic understanding of how molecular components in a biological process work together through the application of computational methods. Structural data may provide such insights, but they are not a pre-requisite for publication in the journal. Specific areas of interest include, but are not limited to:
Structure and function of proteins, nucleic acids and other macromolecules
Structure and function of multi-component complexes
Protein folding, processing and degradation
Enzymology
Computational and structural studies of plant systems
Microbial Informatics
Genomics
Proteomics
Metabolomics
Algorithms and Hypothesis in Bioinformatics
Mathematical and Theoretical Biology
Computational Chemistry and Drug Discovery
Microscopy and Molecular Imaging
Nanotechnology
Systems and Synthetic Biology