{"title":"An explainable machine learning pipeline for prediction of antimicrobial resistance in <i>Pseudomonas aeruginosa</i>.","authors":"Aakriti Jain, Govinda Rao Dabburu, Bishal Samanta, Neelja Singhal, Manish Kumar","doi":"10.1093/bioadv/vbaf190","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Prediction of antimicrobial resistance in <i>Pseudomonas aeruginosa</i> using machine learning and genomic sequences holds the potential to serve as comparable alternatives to laboratory based detection if not better. Additionally, model interpretability can further enhance the potential of these models paving way for their reproducibility.</p><p><strong>Results: </strong>We have developed a machine-learning based 2-tier pipeline to predict resistance phenotype in <i>P. aeruginosa</i> using only genomic sequences as input in the form of k-mers. Our Decision Tree Model yields an accuracy of 79% and area under the receiver operating characteristic curve of 0.77 with a 70% specificity and 84% sensitivity. We have interpreted the model's predictions using explainable AI as an attempt to bridge the gap between computational prediction and biological insight. Through these interpretations we have gathered antibiotic specific k-mer signatures pushing phenotype towards resistance.</p><p><strong>Availability and implementation: </strong>The curated dataset and related codes are available on request.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf190"},"PeriodicalIF":2.8000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12380447/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf190","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Motivation: Prediction of antimicrobial resistance in Pseudomonas aeruginosa using machine learning and genomic sequences holds the potential to serve as comparable alternatives to laboratory based detection if not better. Additionally, model interpretability can further enhance the potential of these models paving way for their reproducibility.
Results: We have developed a machine-learning based 2-tier pipeline to predict resistance phenotype in P. aeruginosa using only genomic sequences as input in the form of k-mers. Our Decision Tree Model yields an accuracy of 79% and area under the receiver operating characteristic curve of 0.77 with a 70% specificity and 84% sensitivity. We have interpreted the model's predictions using explainable AI as an attempt to bridge the gap between computational prediction and biological insight. Through these interpretations we have gathered antibiotic specific k-mer signatures pushing phenotype towards resistance.
Availability and implementation: The curated dataset and related codes are available on request.