An improved catalogue for whole-genome sequencing prediction of bedaquiline resistance in Mycobacterium tuberculosis using a reproducible algorithmic approach.
Dylan Adlard, Lavania Joseph, Hermione Webster, Ailva O'Reilly, Jeffrey Knaggs, Tim E A Peto, Derrick W Crook, Shaheed V Omar, Philip W Fowler
{"title":"An improved catalogue for whole-genome sequencing prediction of bedaquiline resistance in <i>Mycobacterium tuberculosis</i> using a reproducible algorithmic approach.","authors":"Dylan Adlard, Lavania Joseph, Hermione Webster, Ailva O'Reilly, Jeffrey Knaggs, Tim E A Peto, Derrick W Crook, Shaheed V Omar, Philip W Fowler","doi":"10.1099/mgen.0.001429","DOIUrl":null,"url":null,"abstract":"<p><p>Bedaquiline (BDQ) has only been approved for use for just over a decade and is a key drug for treating multidrug-resistant tuberculosis; however, rising levels of resistance threaten to reduce its effectiveness. Catalogues of mutations associated with resistance to BDQ are key to detecting resistance genetically for either diagnosis or surveillance. At present, building catalogues requires considerable expert knowledge, often requires the use of complex grading rules and is an irreproducible process. We developed an automated method, catomatic, that associates genetic variants with resistance (or susceptibility) using a two-tailed binomial test with a stated background rate and applied it to a dataset of 11,867 <i>Mycobacterium tuberculosis</i> samples with whole-genome and BDQ susceptibility testing data. Using this framework, we investigated how to best classify variants and the phenotypic significance of minor alleles. The genes <i>mmpS5</i> and <i>mmpL5</i> are not directly associated with BDQ resistance, and our catalogue of <i>Rv0678</i>, <i>atpE</i> and <i>pepQ</i> variants attains a cross-validated sensitivity and specificity of 79.4±1.8% and 98.5±0.3%, respectively, for 94±0.4% of samples. Identifying samples with subpopulations containing <i>Rv0678</i> variants improves sensitivity, and detection thresholds in bioinformatic pipelines should therefore be lowered. By using a more permissive and deterministic algorithm trained on a sufficient number of resistant samples, we have reproducibly constructed a catalogue of BDQ resistance-associated variants that is comprehensive and accurate.</p>","PeriodicalId":18487,"journal":{"name":"Microbial Genomics","volume":"11 6","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Microbial Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1099/mgen.0.001429","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Bedaquiline (BDQ) has only been approved for use for just over a decade and is a key drug for treating multidrug-resistant tuberculosis; however, rising levels of resistance threaten to reduce its effectiveness. Catalogues of mutations associated with resistance to BDQ are key to detecting resistance genetically for either diagnosis or surveillance. At present, building catalogues requires considerable expert knowledge, often requires the use of complex grading rules and is an irreproducible process. We developed an automated method, catomatic, that associates genetic variants with resistance (or susceptibility) using a two-tailed binomial test with a stated background rate and applied it to a dataset of 11,867 Mycobacterium tuberculosis samples with whole-genome and BDQ susceptibility testing data. Using this framework, we investigated how to best classify variants and the phenotypic significance of minor alleles. The genes mmpS5 and mmpL5 are not directly associated with BDQ resistance, and our catalogue of Rv0678, atpE and pepQ variants attains a cross-validated sensitivity and specificity of 79.4±1.8% and 98.5±0.3%, respectively, for 94±0.4% of samples. Identifying samples with subpopulations containing Rv0678 variants improves sensitivity, and detection thresholds in bioinformatic pipelines should therefore be lowered. By using a more permissive and deterministic algorithm trained on a sufficient number of resistant samples, we have reproducibly constructed a catalogue of BDQ resistance-associated variants that is comprehensive and accurate.
期刊介绍:
Microbial Genomics (MGen) is a fully open access, mandatory open data and peer-reviewed journal publishing high-profile original research on archaea, bacteria, microbial eukaryotes and viruses.