Minimal gene signatures enable high-accuracy prediction of antibiotic resistance in Pseudomonas aeruginosa.

IF 3.5 2区生物学 Q1 MATHEMATICAL & COMPUTATIONAL BIOLOGY

NPJ Systems Biology and Applications Pub Date : 2025-10-02 DOI:10.1038/s41540-025-00584-0

Nabia Shahreen, Syed Ahsan Shahid, Mahfuze Subhani, Adil Al-Siyabi, Rajib Saha

{"title":"Minimal gene signatures enable high-accuracy prediction of antibiotic resistance in Pseudomonas aeruginosa.","authors":"Nabia Shahreen, Syed Ahsan Shahid, Mahfuze Subhani, Adil Al-Siyabi, Rajib Saha","doi":"10.1038/s41540-025-00584-0","DOIUrl":null,"url":null,"abstract":"<p><p>Antimicrobial resistance (AMR) in Pseudomonas aeruginosa poses a critical global health challenge, with current diagnostics relying on slow, culture-based methods. Here, we present a ML framework leveraging transcriptomic data to predict antibiotic resistance with high accuracy. We applied a genetic algorithm to 414 clinical isolates to identify minimal, highly predictive gene sets (~35-40 genes) distinguishing resistant from susceptible strains for meropenem, ciprofloxacin, tobramycin, and ceftazidime. Automated ML classifiers trained on these sets achieved accuracies of 96-99% on test data (F1 scores: 0.93-0.99), surpassing clinical deployment thresholds. Multiple distinct, non-overlapping gene subsets exhibited comparable performance, suggesting that resistance acquisition is associated with changes in the expression of diverse regulatory and metabolic genes. Comparison with known resistance markers from CARD and operon annotations revealed a substantial number of previously unannotated clusters, highlighting significant knowledge gaps in current AMR understanding. Mapping these genes onto independently modulated gene sets (iModulons) revealed transcriptional adaptations across diverse genetic regions. Overall, this study presents a streamlined machine-learning workflow for transcriptomic data and offers a pathway toward rapid diagnostics and personalized treatment strategies against AMR.</p>","PeriodicalId":19345,"journal":{"name":"NPJ Systems Biology and Applications","volume":"11 1","pages":"108"},"PeriodicalIF":3.5000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12491527/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Systems Biology and Applications","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41540-025-00584-0","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Antimicrobial resistance (AMR) in Pseudomonas aeruginosa poses a critical global health challenge, with current diagnostics relying on slow, culture-based methods. Here, we present a ML framework leveraging transcriptomic data to predict antibiotic resistance with high accuracy. We applied a genetic algorithm to 414 clinical isolates to identify minimal, highly predictive gene sets (~35-40 genes) distinguishing resistant from susceptible strains for meropenem, ciprofloxacin, tobramycin, and ceftazidime. Automated ML classifiers trained on these sets achieved accuracies of 96-99% on test data (F1 scores: 0.93-0.99), surpassing clinical deployment thresholds. Multiple distinct, non-overlapping gene subsets exhibited comparable performance, suggesting that resistance acquisition is associated with changes in the expression of diverse regulatory and metabolic genes. Comparison with known resistance markers from CARD and operon annotations revealed a substantial number of previously unannotated clusters, highlighting significant knowledge gaps in current AMR understanding. Mapping these genes onto independently modulated gene sets (iModulons) revealed transcriptional adaptations across diverse genetic regions. Overall, this study presents a streamlined machine-learning workflow for transcriptomic data and offers a pathway toward rapid diagnostics and personalized treatment strategies against AMR.

查看原文本刊更多论文

最小的基因标记使铜绿假单胞菌的抗生素耐药性的高精度预测。

铜绿假单胞菌的抗微生物药物耐药性（AMR）构成了一个重大的全球卫生挑战，目前的诊断依赖于缓慢的、基于培养的方法。在这里，我们提出了一个ML框架，利用转录组学数据来高精度地预测抗生素耐药性。我们对414株临床分离株应用遗传算法，以确定最小的、高度预测的基因集（~35-40个基因），以区分美罗培南、环丙沙星、妥布霉素和头孢他啶的耐药菌株和敏感菌株。在这些集合上训练的自动ML分类器在测试数据上达到了96-99%的准确率（F1分数：0.93-0.99），超过了临床部署阈值。多个不同的、不重叠的基因亚群表现出类似的表现，这表明耐药性的获得与多种调控和代谢基因的表达变化有关。与来自CARD和操纵子注释的已知抗性标记进行比较，发现了大量以前未注释的聚类，突出了当前AMR理解中的重大知识空白。将这些基因定位到独立调节的基因集（iModulons）上，揭示了不同遗传区域的转录适应性。总体而言，本研究提出了一种简化的转录组学数据机器学习工作流程，并为针对AMR的快速诊断和个性化治疗策略提供了途径。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

NPJ Systems Biology and Applications Mathematics-Applied Mathematics

CiteScore

5.80

自引率

0.00%

发文量

审稿时长

8 weeks

期刊介绍： npj Systems Biology and Applications is an online Open Access journal dedicated to publishing the premier research that takes a systems-oriented approach. The journal aims to provide a forum for the presentation of articles that help define this nascent field, as well as those that apply the advances to wider fields. We encourage studies that integrate, or aid the integration of, data, analyses and insight from molecules to organisms and broader systems. Important areas of interest include not only fundamental biological systems and drug discovery, but also applications to health, medical practice and implementation, big data, biotechnology, food science, human behaviour, broader biological systems and industrial applications of systems biology. We encourage all approaches, including network biology, application of control theory to biological systems, computational modelling and analysis, comprehensive and/or high-content measurements, theoretical, analytical and computational studies of system-level properties of biological systems and computational/software/data platforms enabling such studies.