通过长读测序获取结核分枝杆菌的额外多样性,重新定义传播集群。

IF 2.5 4区 医学 Q1 PARASITOLOGY
Sergio Buenestado-Serrano, Silvia Vallejo-Godoy, Francisca Escabias Machuca, Pilar Barroso, Miguel Martínez-Lirola, Teresa Cabezas, Patricia Muñoz, Laura Pérez-Lago, Darío García de Viedma
{"title":"通过长读测序获取结核分枝杆菌的额外多样性,重新定义传播集群。","authors":"Sergio Buenestado-Serrano, Silvia Vallejo-Godoy, Francisca Escabias Machuca, Pilar Barroso, Miguel Martínez-Lirola, Teresa Cabezas, Patricia Muñoz, Laura Pérez-Lago, Darío García de Viedma","doi":"10.1080/20477724.2025.2555926","DOIUrl":null,"url":null,"abstract":"<p><p>Whole-genome sequencing, supported on short-read-sequencing, has revolutionized the precision to track <i>Mycobacterium tuberculosis</i> (MTB) transmission. However, the complexity of the MTB genome (10% repetitive regions and 65% GC content) challenges short-read mapping and assembly, leading to the exclusion of certain genomic regions from the analysis. Long-read sequencing can overcome these limitations, giving access to these regions, generally uninterrogated. Our study aims to evaluate the potential of long-read sequencing in redefining long-term MTB transmission clusters, previously characterized by short-read sequencing. We selected 78 cases from eight long-term clusters (5-17 years; 7 to 16 cases), from a population-based genomic epidemiology program in Almería, Spain. The clusters were carefully selected to ensure cases i) infected by identical strains (0 SNPs), ii) exhibiting pairwise-SNP-based distances from 1 to 16 SNPs and iii) distributed along different branches in the genomic networks. Long-read analysis increased the distances of each cluster from the reference by an average of 258 SNPs and intercluster distances by 113 SNPs. Within-cluster diversity also increased, with pairwise distances rising from 1 to 22 SNPs across 1-7 network branches. In one cluster, the acquisition of diversity led to overpass the 12-SNP threshold to consider a transmission cluster. Additionally, in four clusters, 1-2 cases previously classified as infected by identical strains were now reclassified due to the identification of additional SNP differences. Thanks to the identification of new diversity between the cases we could identify index cases, reconstruct transmission chronologies, precise patient-to-patient relationships and propose new epidemiological interpretations among the cases in cluster.</p>","PeriodicalId":19850,"journal":{"name":"Pathogens and Global Health","volume":" ","pages":"1-11"},"PeriodicalIF":2.5000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Redefinition of transmission clusters by accessing to additional diversity in <i>Mycobacterium tuberculosis</i> through long-read sequencing.\",\"authors\":\"Sergio Buenestado-Serrano, Silvia Vallejo-Godoy, Francisca Escabias Machuca, Pilar Barroso, Miguel Martínez-Lirola, Teresa Cabezas, Patricia Muñoz, Laura Pérez-Lago, Darío García de Viedma\",\"doi\":\"10.1080/20477724.2025.2555926\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Whole-genome sequencing, supported on short-read-sequencing, has revolutionized the precision to track <i>Mycobacterium tuberculosis</i> (MTB) transmission. However, the complexity of the MTB genome (10% repetitive regions and 65% GC content) challenges short-read mapping and assembly, leading to the exclusion of certain genomic regions from the analysis. Long-read sequencing can overcome these limitations, giving access to these regions, generally uninterrogated. Our study aims to evaluate the potential of long-read sequencing in redefining long-term MTB transmission clusters, previously characterized by short-read sequencing. We selected 78 cases from eight long-term clusters (5-17 years; 7 to 16 cases), from a population-based genomic epidemiology program in Almería, Spain. The clusters were carefully selected to ensure cases i) infected by identical strains (0 SNPs), ii) exhibiting pairwise-SNP-based distances from 1 to 16 SNPs and iii) distributed along different branches in the genomic networks. Long-read analysis increased the distances of each cluster from the reference by an average of 258 SNPs and intercluster distances by 113 SNPs. Within-cluster diversity also increased, with pairwise distances rising from 1 to 22 SNPs across 1-7 network branches. In one cluster, the acquisition of diversity led to overpass the 12-SNP threshold to consider a transmission cluster. Additionally, in four clusters, 1-2 cases previously classified as infected by identical strains were now reclassified due to the identification of additional SNP differences. Thanks to the identification of new diversity between the cases we could identify index cases, reconstruct transmission chronologies, precise patient-to-patient relationships and propose new epidemiological interpretations among the cases in cluster.</p>\",\"PeriodicalId\":19850,\"journal\":{\"name\":\"Pathogens and Global Health\",\"volume\":\" \",\"pages\":\"1-11\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pathogens and Global Health\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1080/20477724.2025.2555926\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PARASITOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pathogens and Global Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/20477724.2025.2555926","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PARASITOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

在短读测序的支持下,全基因组测序彻底改变了追踪结核分枝杆菌(MTB)传播的精度。然而,MTB基因组的复杂性(10%的重复区域和65%的GC含量)给短读定位和组装带来了挑战,导致某些基因组区域被排除在分析之外。长读测序可以克服这些限制,通常不经询问就能进入这些区域。我们的研究旨在评估长读测序在重新定义长期MTB传播集群方面的潜力,以前以短读测序为特征。我们从西班牙Almería基于人群的基因组流行病学项目的8个长期聚类(5-17年;7 - 16例)中选择了78例病例。这些聚类经过精心挑选,以确保病例i)被相同的菌株感染(0个snp), ii)显示1到16个snp的成对距离,iii)分布在基因组网络的不同分支上。长读分析使每个簇与参考文献的距离平均增加258个snp,簇间距离平均增加113个snp。集群内的多样性也有所增加,在1-7个网络分支上的成对距离从1个增加到22个snp。在一个集群中,多样性的获取导致超过12-SNP阈值,以考虑传输集群。此外,在四个聚集性病例中,由于发现了额外的SNP差异,先前被分类为同一菌株感染的1-2例病例现在被重新分类。由于发现了病例之间新的多样性,我们可以识别指示病例,重建传播年表,精确的患者与患者之间的关系,并在聚集性病例中提出新的流行病学解释。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Redefinition of transmission clusters by accessing to additional diversity in Mycobacterium tuberculosis through long-read sequencing.

Whole-genome sequencing, supported on short-read-sequencing, has revolutionized the precision to track Mycobacterium tuberculosis (MTB) transmission. However, the complexity of the MTB genome (10% repetitive regions and 65% GC content) challenges short-read mapping and assembly, leading to the exclusion of certain genomic regions from the analysis. Long-read sequencing can overcome these limitations, giving access to these regions, generally uninterrogated. Our study aims to evaluate the potential of long-read sequencing in redefining long-term MTB transmission clusters, previously characterized by short-read sequencing. We selected 78 cases from eight long-term clusters (5-17 years; 7 to 16 cases), from a population-based genomic epidemiology program in Almería, Spain. The clusters were carefully selected to ensure cases i) infected by identical strains (0 SNPs), ii) exhibiting pairwise-SNP-based distances from 1 to 16 SNPs and iii) distributed along different branches in the genomic networks. Long-read analysis increased the distances of each cluster from the reference by an average of 258 SNPs and intercluster distances by 113 SNPs. Within-cluster diversity also increased, with pairwise distances rising from 1 to 22 SNPs across 1-7 network branches. In one cluster, the acquisition of diversity led to overpass the 12-SNP threshold to consider a transmission cluster. Additionally, in four clusters, 1-2 cases previously classified as infected by identical strains were now reclassified due to the identification of additional SNP differences. Thanks to the identification of new diversity between the cases we could identify index cases, reconstruct transmission chronologies, precise patient-to-patient relationships and propose new epidemiological interpretations among the cases in cluster.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Pathogens and Global Health
Pathogens and Global Health PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH-PARASITOLOGY
CiteScore
6.00
自引率
0.00%
发文量
60
审稿时长
6-12 weeks
期刊介绍: Pathogens and Global Health is a journal of infectious disease and public health that focuses on the translation of molecular, immunological, genomics and epidemiological knowledge into control measures for global health threat. The journal publishes original innovative research papers, reviews articles and interviews policy makers and opinion leaders on health subjects of international relevance. It provides a forum for scientific, ethical and political discussion of new innovative solutions for controlling and eradicating infectious diseases, with particular emphasis on those diseases affecting the poorest regions of the world.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信