Víctor Eduardo Rojas-Pérez , Eduardo Parra Villegas
{"title":"Dataset and analysis of genetic variability of Severe Acute Respiratory Syndrome Coronavirus 2 from different areas of the respiratory tract","authors":"Víctor Eduardo Rojas-Pérez , Eduardo Parra Villegas","doi":"10.1016/j.mex.2025.103369","DOIUrl":null,"url":null,"abstract":"<div><div>Shannon entropy data were obtained from whole genome sequences of SARS-CoV-2 virus from COVID-19 positive patients from nasopharyngeal and tracheal aspirate samples, these sequences were downloaded from the NCBI public database. The study included 57 genomic sequences of the SARS-CoV-2 virus. A total of 57 samples (<em>n</em> = 30 oronasopharyngeal and <em>n</em> = 27 tracheal aspirates) underwent bioinformatics-based analysis with the objective of determining the extent of viral variability. These sequences were used to perform a genetic variability analysis of the virus with the H(x) function of the BioEdit program and using the Wuhan-Hu-1 reference genome. The generated Shannon entropy data were filtered to comparatively determine the magnitudes of genetic variability between sequences from nasopharyngeal and tracheal aspirate samples using an R script.<ul><li><span>•</span><span><div>This paper proposes a workflow for the bionformatic analysis of SARS-CoV-2 genetic variation from different respiratory system samples.</div></span></li><li><span>•</span><span><div>This bioinformatics workflow could help to differentiate changes in the evolution of the viral population in different compartments of the respiratory system.</div></span></li></ul></div></div>","PeriodicalId":18446,"journal":{"name":"MethodsX","volume":"14 ","pages":"Article 103369"},"PeriodicalIF":1.6000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"MethodsX","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2215016125002158","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Shannon entropy data were obtained from whole genome sequences of SARS-CoV-2 virus from COVID-19 positive patients from nasopharyngeal and tracheal aspirate samples, these sequences were downloaded from the NCBI public database. The study included 57 genomic sequences of the SARS-CoV-2 virus. A total of 57 samples (n = 30 oronasopharyngeal and n = 27 tracheal aspirates) underwent bioinformatics-based analysis with the objective of determining the extent of viral variability. These sequences were used to perform a genetic variability analysis of the virus with the H(x) function of the BioEdit program and using the Wuhan-Hu-1 reference genome. The generated Shannon entropy data were filtered to comparatively determine the magnitudes of genetic variability between sequences from nasopharyngeal and tracheal aspirate samples using an R script.
•
This paper proposes a workflow for the bionformatic analysis of SARS-CoV-2 genetic variation from different respiratory system samples.
•
This bioinformatics workflow could help to differentiate changes in the evolution of the viral population in different compartments of the respiratory system.