Pan-lineage Mycobacterium tuberculosis reference genome for enhanced molecular diagnosis.

IF 3.9 2区 生物学 Q1 GENETICS & HEREDITY
Kunhyung Bahk, Joohon Sung, Mitsuko Seki, Kyungjong Kim, Jina Kim, Hongjo Choi, Jake Whang, Satoshi Mitarai
{"title":"Pan-lineage Mycobacterium tuberculosis reference genome for enhanced molecular diagnosis.","authors":"Kunhyung Bahk, Joohon Sung, Mitsuko Seki, Kyungjong Kim, Jina Kim, Hongjo Choi, Jake Whang, Satoshi Mitarai","doi":"10.1093/dnares/dsae023","DOIUrl":null,"url":null,"abstract":"<p><p>In Mycobacterium tuberculosis (MTB) control, whole genome sequencing-based molecular drug susceptibility testing (molDST-WGS) has emerged as a pivotal tool. However, the current reliance on a single-strain reference limits molDST-WGS's true potential. To address this, we introduce a new pan-lineage reference genome, 'MtbRf'. We assembled 'unmapped' reads from 3,614 MTB genomes (751 L1; 881 L2; 1,700 L3; and 282 L4) into 35 shared, annotated contigs (54 coding sequences [CDSs]). We constructed MtbRf through: (1) searching for contig homologues among genome database that precipitate results uniquely within Mycobacteria genus; (2) comparing genomes with H37Rv ('lift-over') to define 18 insertions; and (3) filling gaps in H37Rv with insertions. MtbRf adds 1.18% sequences to H37rv, salvaging >60% of previously unmapped reads. Transcriptomics confirmed gene expression of new CDSs. The new variants provided a moderate DST predictive value (AUROC 0.60-0.75). MtbRf thus unveils previously hidden genomic information and lays the foundation for lineage-specific molDST-WGS.</p>","PeriodicalId":51014,"journal":{"name":"DNA Research","volume":" ","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11339604/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DNA Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/dnares/dsae023","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

In Mycobacterium tuberculosis (MTB) control, whole genome sequencing-based molecular drug susceptibility testing (molDST-WGS) has emerged as a pivotal tool. However, the current reliance on a single-strain reference limits molDST-WGS's true potential. To address this, we introduce a new pan-lineage reference genome, 'MtbRf'. We assembled 'unmapped' reads from 3,614 MTB genomes (751 L1; 881 L2; 1,700 L3; and 282 L4) into 35 shared, annotated contigs (54 coding sequences [CDSs]). We constructed MtbRf through: (1) searching for contig homologues among genome database that precipitate results uniquely within Mycobacteria genus; (2) comparing genomes with H37Rv ('lift-over') to define 18 insertions; and (3) filling gaps in H37Rv with insertions. MtbRf adds 1.18% sequences to H37rv, salvaging >60% of previously unmapped reads. Transcriptomics confirmed gene expression of new CDSs. The new variants provided a moderate DST predictive value (AUROC 0.60-0.75). MtbRf thus unveils previously hidden genomic information and lays the foundation for lineage-specific molDST-WGS.

用于强化分子诊断的全系结核分枝杆菌参考基因组。
在结核分枝杆菌(MTB)控制方面,基于全基因组测序的分子药敏试验(molDST-WGS)已成为一种关键工具。然而,目前对单一菌株参考文献的依赖限制了 molDST-WGS 的真正潜力。为了解决这个问题,我们引入了一个新的泛品系参考基因组 "MtbRf"。我们将来自 3,614 个 MTB 基因组(751 个 L1、881 个 L2、1,700 个 L3 和 282 个 L4)的 "未映射 "读数组装成 35 个共享的注释等位组(54 个 CDS)。我们通过以下方法构建了 MtbRf:1)在基因组数据库中搜索等位基因同源物,这些等位基因在分枝杆菌属中具有独特的沉淀结果;2)将基因组与 H37Rv 进行比较("lift-over"),以确定 18 个插入片段;3)用插入片段填补 H37Rv 中的空白。MtbRf 为 H37rv 增加了 1.18% 的序列,挽救了 60% 以上以前未映射的读数。转录组学证实了新 CDS 的基因表达。新变异具有中等的 DST 预测价值(AUROC 0.60-0.75)。因此,MtbRf揭示了以前隐藏的基因组信息,为针对不同品系的molDST-WGS奠定了基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
DNA Research
DNA Research 生物-遗传学
CiteScore
6.00
自引率
4.90%
发文量
39
审稿时长
4.5 months
期刊介绍: DNA Research is an internationally peer-reviewed journal which aims at publishing papers of highest quality in broad aspects of DNA and genome-related research. Emphasis will be made on the following subjects: 1) Sequencing and characterization of genomes/important genomic regions, 2) Comprehensive analysis of the functions of genes, gene families and genomes, 3) Techniques and equipments useful for structural and functional analysis of genes, gene families and genomes, 4) Computer algorithms and/or their applications relevant to structural and functional analysis of genes and genomes. The journal also welcomes novel findings in other scientific disciplines related to genomes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信