基因转换和重复导致结核分枝杆菌爆发的遗传变异。

IF 4 2区 生物学 Q1 GENETICS & HEREDITY
Christoph Stritt, Michelle Reitsma, Ana Maria Garcia Marin, Galo Goig, Anna Dötsch, Sonia Borrell, Christian Beisel, Iñaki Comas, Daniela Brites, Sebastien Gagneux
{"title":"基因转换和重复导致结核分枝杆菌爆发的遗传变异。","authors":"Christoph Stritt, Michelle Reitsma, Ana Maria Garcia Marin, Galo Goig, Anna Dötsch, Sonia Borrell, Christian Beisel, Iñaki Comas, Daniela Brites, Sebastien Gagneux","doi":"10.1099/mgen.0.001396","DOIUrl":null,"url":null,"abstract":"<p><p>Repeats are the most diverse and dynamic but also the least well-understood component of microbial genomes. For all we know, repeat-associated mutations such as duplications, deletions, inversions and gene conversion might be as common as point mutations, but because of short-read myopia and methodological bias, they have received much less attention. Long-read DNA sequencing opens the perspective of resolving repeats and systematically investigating the mutations they induce. For this study, we assembled the genomes of 16 closely related strains of the bacterial pathogen <i>Mycobacterium tuberculosis</i> from Pacific Biosciences HiFi reads, with the aim of characterizing the full spectrum of DNA polymorphisms. We found that complete and accurate genomes can be assembled from HiFi reads, with read size being the main limitation in the presence of duplications. By combining a reference-free pangenome graph with extensive repeat annotation, we identified 110 variants, 58 of which could be assigned to repeat-associated mutational mechanisms such as strand slippage and homologous recombination. Whilst recombination events were less frequent than point mutations, they affected large regions and introduced multiple variants at once, as shown by three gene conversion events and a duplication of 7.3 kb that involved <i>ppe18</i> and <i>ppe57</i>, two genes possibly involved in immune subversion. The vast majority of variants were present in single isolates, such that phylogenetic resolution was only marginally increased when estimating a tree from complete genomes. Our study shows that the contribution of repeat-associated mechanisms of mutation can be similar to that of point mutations at the microevolutionary scale of an outbreak. A large reservoir of unstudied genetic variation in this 'monomorphic' bacterial pathogen awaits investigation.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"11 5","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12046097/pdf/","citationCount":"0","resultStr":"{\"title\":\"Gene conversion and duplication contribute to genetic variation in an outbreak of <i>Mycobacterium tuberculosis</i>.\",\"authors\":\"Christoph Stritt, Michelle Reitsma, Ana Maria Garcia Marin, Galo Goig, Anna Dötsch, Sonia Borrell, Christian Beisel, Iñaki Comas, Daniela Brites, Sebastien Gagneux\",\"doi\":\"10.1099/mgen.0.001396\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Repeats are the most diverse and dynamic but also the least well-understood component of microbial genomes. For all we know, repeat-associated mutations such as duplications, deletions, inversions and gene conversion might be as common as point mutations, but because of short-read myopia and methodological bias, they have received much less attention. Long-read DNA sequencing opens the perspective of resolving repeats and systematically investigating the mutations they induce. For this study, we assembled the genomes of 16 closely related strains of the bacterial pathogen <i>Mycobacterium tuberculosis</i> from Pacific Biosciences HiFi reads, with the aim of characterizing the full spectrum of DNA polymorphisms. We found that complete and accurate genomes can be assembled from HiFi reads, with read size being the main limitation in the presence of duplications. By combining a reference-free pangenome graph with extensive repeat annotation, we identified 110 variants, 58 of which could be assigned to repeat-associated mutational mechanisms such as strand slippage and homologous recombination. Whilst recombination events were less frequent than point mutations, they affected large regions and introduced multiple variants at once, as shown by three gene conversion events and a duplication of 7.3 kb that involved <i>ppe18</i> and <i>ppe57</i>, two genes possibly involved in immune subversion. The vast majority of variants were present in single isolates, such that phylogenetic resolution was only marginally increased when estimating a tree from complete genomes. Our study shows that the contribution of repeat-associated mechanisms of mutation can be similar to that of point mutations at the microevolutionary scale of an outbreak. A large reservoir of unstudied genetic variation in this 'monomorphic' bacterial pathogen awaits investigation.</p>\",\"PeriodicalId\":18487,\"journal\":{\"name\":\"Microbial Genomics\",\"volume\":\"11 5\",\"pages\":\"\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12046097/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Microbial Genomics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1099/mgen.0.001396\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbial Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1099/mgen.0.001396","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

重复序列是微生物基因组中最多样化和最动态的,但也是最不为人所知的组成部分。就我们所知,重复相关的突变,如重复、缺失、反转和基因转换,可能与点突变一样常见,但由于短读短视和方法偏差,它们受到的关注要少得多。长读DNA测序打开了解决重复序列和系统地研究它们诱导的突变的前景。在这项研究中,我们组装了来自太平洋生物科学HiFi读数的16株密切相关的细菌病原体结核分枝杆菌的基因组,目的是表征DNA多态性的全谱。我们发现,完整和准确的基因组可以从HiFi读取组装,读取大小是存在重复的主要限制。通过结合无参考的泛基因组图和广泛的重复注释,我们确定了110个变异,其中58个可分配给重复相关的突变机制,如链滑移和同源重组。虽然重组事件的频率低于点突变,但它们影响了大区域,并同时引入了多种变异,如三个基因转换事件和7.3 kb的重复所示,涉及ppe18和ppe57,这两个基因可能参与免疫颠覆。绝大多数变异存在于单个分离株中,因此当从完整基因组估计树时,系统发育分辨率仅略微增加。我们的研究表明,在爆发的微进化尺度上,重复相关的突变机制的贡献可能类似于点突变。在这种“单态”细菌病原体中,大量未研究的遗传变异有待调查。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Gene conversion and duplication contribute to genetic variation in an outbreak of Mycobacterium tuberculosis.

Repeats are the most diverse and dynamic but also the least well-understood component of microbial genomes. For all we know, repeat-associated mutations such as duplications, deletions, inversions and gene conversion might be as common as point mutations, but because of short-read myopia and methodological bias, they have received much less attention. Long-read DNA sequencing opens the perspective of resolving repeats and systematically investigating the mutations they induce. For this study, we assembled the genomes of 16 closely related strains of the bacterial pathogen Mycobacterium tuberculosis from Pacific Biosciences HiFi reads, with the aim of characterizing the full spectrum of DNA polymorphisms. We found that complete and accurate genomes can be assembled from HiFi reads, with read size being the main limitation in the presence of duplications. By combining a reference-free pangenome graph with extensive repeat annotation, we identified 110 variants, 58 of which could be assigned to repeat-associated mutational mechanisms such as strand slippage and homologous recombination. Whilst recombination events were less frequent than point mutations, they affected large regions and introduced multiple variants at once, as shown by three gene conversion events and a duplication of 7.3 kb that involved ppe18 and ppe57, two genes possibly involved in immune subversion. The vast majority of variants were present in single isolates, such that phylogenetic resolution was only marginally increased when estimating a tree from complete genomes. Our study shows that the contribution of repeat-associated mechanisms of mutation can be similar to that of point mutations at the microevolutionary scale of an outbreak. A large reservoir of unstudied genetic variation in this 'monomorphic' bacterial pathogen awaits investigation.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Microbial Genomics
Microbial Genomics Medicine-Epidemiology
CiteScore
6.60
自引率
2.60%
发文量
153
审稿时长
12 weeks
期刊介绍: Microbial Genomics (MGen) is a fully open access, mandatory open data and peer-reviewed journal publishing high-profile original research on archaea, bacteria, microbial eukaryotes and viruses.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信