利用新管道“TB- annotator”重建全球结核病历史

IF 2.8 3区 医学 Q3 IMMUNOLOGY
Gaetan Senelle , Muhammed Rabiu Sahal , Kevin La , Typhaine Billard-Pomares , Julie Marin , Faiza Mougari , Antoine Bridier-Nahmias , Etienne Carbonnelle , Emmanuelle Cambau , Guislaine Refrégier , Christophe Guyeux , Christophe Sola
{"title":"利用新管道“TB- annotator”重建全球结核病历史","authors":"Gaetan Senelle ,&nbsp;Muhammed Rabiu Sahal ,&nbsp;Kevin La ,&nbsp;Typhaine Billard-Pomares ,&nbsp;Julie Marin ,&nbsp;Faiza Mougari ,&nbsp;Antoine Bridier-Nahmias ,&nbsp;Etienne Carbonnelle ,&nbsp;Emmanuelle Cambau ,&nbsp;Guislaine Refrégier ,&nbsp;Christophe Guyeux ,&nbsp;Christophe Sola","doi":"10.1016/j.tube.2023.102376","DOIUrl":null,"url":null,"abstract":"<div><p><em>Mycobacterium tuberculosis</em><span> complex (MTBC) has a population structure consisting of 9 human and animal lineages<span>. The genomic diversity within these lineages is a pathogenesis factor that affects virulence, transmissibility, host response, and antibiotic resistance. Hence it is important to develop improved information systems for tracking and understanding the spreading and evolution of genomes. We present results obtained thanks to a new informatics platform for computational biology of MTBC, that uses a convenience sample from public/private SRAs, designated as </span></span><em>TB-Annotator</em><span><span>. Version 1 was a first interactive graphic-based web tool based on 15,901 representative genomes. Version 2, still interactive, is a more sophisticated database, developed using the Snakemake Workflow Management System (WMS) that allows an unsupervised global and scalable analysis of the content of the USA National Center for Biotechnology Information Short Read Archives database. This platform analyzes nucleotide variants, the presence/absence of genes, known regions of difference and detect new deletions, the insertion sites of mobile genetic elements, and allows </span>phylogenetic trees to be built, imported in a graphical interface and interactively analyzed between the data and the tree. The objective of </span><em>TB-Annotator</em> is triple: detect recent epidemiological links, reconstruct distant phylogeographical histories as well as perform more complex phenotypic/genotypic Genome-Wide Association Studies (GWAS). In this paper, we compare the various taxonomic SNPs-based labels and hierarchies previously described in recent reference papers for L1, and present a comparative analysis that allows identification of <em>alias</em> and thus provides the basis of a future unifying naming scheme for L1 sublineages. We present a global phylogenetic tree built with RAxML-NG, and one on L2; at the time of writing, we characterized about 200 sublineages, with many new ones; a detail tree for Modern L2 and a hierarchical scheme allowing to facilitate L2 lineage assignment are also presented.</p></div>","PeriodicalId":23383,"journal":{"name":"Tuberculosis","volume":"143 ","pages":"Article 102376"},"PeriodicalIF":2.8000,"publicationDate":"2023-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards the reconstruction of a global TB history using a new pipeline “TB-Annotator\\\"\",\"authors\":\"Gaetan Senelle ,&nbsp;Muhammed Rabiu Sahal ,&nbsp;Kevin La ,&nbsp;Typhaine Billard-Pomares ,&nbsp;Julie Marin ,&nbsp;Faiza Mougari ,&nbsp;Antoine Bridier-Nahmias ,&nbsp;Etienne Carbonnelle ,&nbsp;Emmanuelle Cambau ,&nbsp;Guislaine Refrégier ,&nbsp;Christophe Guyeux ,&nbsp;Christophe Sola\",\"doi\":\"10.1016/j.tube.2023.102376\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p><em>Mycobacterium tuberculosis</em><span> complex (MTBC) has a population structure consisting of 9 human and animal lineages<span>. The genomic diversity within these lineages is a pathogenesis factor that affects virulence, transmissibility, host response, and antibiotic resistance. Hence it is important to develop improved information systems for tracking and understanding the spreading and evolution of genomes. We present results obtained thanks to a new informatics platform for computational biology of MTBC, that uses a convenience sample from public/private SRAs, designated as </span></span><em>TB-Annotator</em><span><span>. Version 1 was a first interactive graphic-based web tool based on 15,901 representative genomes. Version 2, still interactive, is a more sophisticated database, developed using the Snakemake Workflow Management System (WMS) that allows an unsupervised global and scalable analysis of the content of the USA National Center for Biotechnology Information Short Read Archives database. This platform analyzes nucleotide variants, the presence/absence of genes, known regions of difference and detect new deletions, the insertion sites of mobile genetic elements, and allows </span>phylogenetic trees to be built, imported in a graphical interface and interactively analyzed between the data and the tree. The objective of </span><em>TB-Annotator</em> is triple: detect recent epidemiological links, reconstruct distant phylogeographical histories as well as perform more complex phenotypic/genotypic Genome-Wide Association Studies (GWAS). In this paper, we compare the various taxonomic SNPs-based labels and hierarchies previously described in recent reference papers for L1, and present a comparative analysis that allows identification of <em>alias</em> and thus provides the basis of a future unifying naming scheme for L1 sublineages. We present a global phylogenetic tree built with RAxML-NG, and one on L2; at the time of writing, we characterized about 200 sublineages, with many new ones; a detail tree for Modern L2 and a hierarchical scheme allowing to facilitate L2 lineage assignment are also presented.</p></div>\",\"PeriodicalId\":23383,\"journal\":{\"name\":\"Tuberculosis\",\"volume\":\"143 \",\"pages\":\"Article 102376\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2023-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tuberculosis\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1472979223000744\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"IMMUNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tuberculosis","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1472979223000744","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

结核分枝杆菌复合体(MTBC)的种群结构由9个人类和动物谱系组成。这些谱系中的基因组多样性是影响毒力、传播力、宿主反应和抗生素耐药性的致病因素。因此,开发改进的信息系统来跟踪和理解基因组的传播和进化是很重要的。我们展示的结果得益于一个新的MTBC计算生物学信息学平台,该平台使用了来自公共/私人sra的方便样本,称为TB-Annotator。版本1是第一个基于15901个代表性基因组的交互式图形web工具。版本2,仍然是交互式的,是一个更复杂的数据库,使用Snakemake工作流管理系统(WMS)开发,允许对美国国家生物技术信息中心短读档案数据库的内容进行无监督的全球和可扩展的分析。该平台分析核苷酸变异、基因的存在/缺失、已知的差异区域和检测新的缺失、移动遗传元件的插入位点,并允许建立系统发育树,在图形界面中导入,并在数据和树之间进行交互分析。TB-Annotator的目标有三个:检测最近的流行病学联系,重建遥远的系统地理历史,以及进行更复杂的表型/基因型全基因组关联研究(GWAS)。在本文中,我们比较了最近的参考文献中描述的各种基于snp的L1分类标签和层次结构,并提出了一种允许识别别名的比较分析,从而为L1子谱系的未来统一命名方案提供了基础。我们提出了一个用RAxML-NG构建的全局系统发育树和一个基于L2的系统发育树;在撰写本文时,我们描述了大约200个亚谱系,其中有许多是新的;还提出了现代L2的详细树和允许L2谱系分配的分层方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Towards the reconstruction of a global TB history using a new pipeline “TB-Annotator"

Mycobacterium tuberculosis complex (MTBC) has a population structure consisting of 9 human and animal lineages. The genomic diversity within these lineages is a pathogenesis factor that affects virulence, transmissibility, host response, and antibiotic resistance. Hence it is important to develop improved information systems for tracking and understanding the spreading and evolution of genomes. We present results obtained thanks to a new informatics platform for computational biology of MTBC, that uses a convenience sample from public/private SRAs, designated as TB-Annotator. Version 1 was a first interactive graphic-based web tool based on 15,901 representative genomes. Version 2, still interactive, is a more sophisticated database, developed using the Snakemake Workflow Management System (WMS) that allows an unsupervised global and scalable analysis of the content of the USA National Center for Biotechnology Information Short Read Archives database. This platform analyzes nucleotide variants, the presence/absence of genes, known regions of difference and detect new deletions, the insertion sites of mobile genetic elements, and allows phylogenetic trees to be built, imported in a graphical interface and interactively analyzed between the data and the tree. The objective of TB-Annotator is triple: detect recent epidemiological links, reconstruct distant phylogeographical histories as well as perform more complex phenotypic/genotypic Genome-Wide Association Studies (GWAS). In this paper, we compare the various taxonomic SNPs-based labels and hierarchies previously described in recent reference papers for L1, and present a comparative analysis that allows identification of alias and thus provides the basis of a future unifying naming scheme for L1 sublineages. We present a global phylogenetic tree built with RAxML-NG, and one on L2; at the time of writing, we characterized about 200 sublineages, with many new ones; a detail tree for Modern L2 and a hierarchical scheme allowing to facilitate L2 lineage assignment are also presented.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Tuberculosis
Tuberculosis 医学-呼吸系统
CiteScore
4.60
自引率
3.10%
发文量
87
审稿时长
49 days
期刊介绍: Tuberculosis is a speciality journal focusing on basic experimental research on tuberculosis, notably on bacteriological, immunological and pathogenesis aspects of the disease. The journal publishes original research and reviews on the host response and immunology of tuberculosis and the molecular biology, genetics and physiology of the organism, however discourages submissions with a meta-analytical focus (for example, articles based on searches of published articles in public electronic databases, especially where there is lack of evidence of the personal involvement of authors in the generation of such material). We do not publish Clinical Case-Studies. Areas on which submissions are welcomed include: -Clinical TrialsDiagnostics- Antimicrobial resistance- Immunology- Leprosy- Microbiology, including microbial physiology- Molecular epidemiology- Non-tuberculous Mycobacteria- Pathogenesis- Pathology- Vaccine development. This Journal does not accept case-reports. The resurgence of interest in tuberculosis has accelerated the pace of relevant research and Tuberculosis has grown with it, as the only journal dedicated to experimental biomedical research in tuberculosis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信