{"title":"幽门螺杆菌抗生素耐药性评估:基于全基因组遗传变异的实用且可解释的机器学习模型。","authors":"Yingying Wang, Shuwen Zheng, Rui Guo, Yanke Li, Honghao Yin, Xunan Qiu, Jijun Chen, Chuxuan Ni, Yuan Yuan, Yuehua Gong","doi":"10.1080/21505594.2025.2481503","DOIUrl":null,"url":null,"abstract":"<p><p><i>Helicobacter pylori</i> (<i>H. pylori</i>) antibiotic resistance poses a global health threat. Accurate identification of antibiotic resistant strains is essential for the control of infection. In the present study, our goal is to leverage the whole-genome data of <i>H. pylori</i> to develop practical and interpretable machine learning (ML) models for comprehensive antibiotic resistance assessment. A total of 296 <i>H. pylori</i> isolates with genome-wide data were downloaded from the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) and the National Center for Biotechnology Information (NCBI) databases. By training ML models on feature sets of single nucleotide polymorphisms from SNP calling (SNPs-1), antibiotic-resistance SNP annotated by the Comprehensive Antibiotic Resistance Database (SNPs-2), gene presence or absence (GPA), we generated predictive models for four antibiotics and multidrug-resistance (MDR). Among them, the models that combined SNPs-1, SNPs-2, and GPA data demonstrated the best performance, with the eXtreme Gradient Boosting (XGBoost) consistently outperforming others. And then we utilized the SHapley Additive exPlanations (SHAP) method to interpret the ML models. Furthermore, a free web application for the MDR model was deployed to the GitHub repository (https://H.pylori/MDR/App/). Our study demonstrated the promise of employing whole-genome data in conjunction with ML algorithms to forecast <i>H. pylori</i> antibiotic resistance. In the future, the application of this approach for predicting <i>H. pylori</i> antibiotic resistance would hold the potential to mitigate the empiric administration.</p>","PeriodicalId":23747,"journal":{"name":"Virulence","volume":"16 1","pages":"2481503"},"PeriodicalIF":5.5000,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11934168/pdf/","citationCount":"0","resultStr":"{\"title\":\"Assessment for antibiotic resistance in <i>Helicobacter pylori</i>: A practical and interpretable machine learning model based on genome-wide genetic variation.\",\"authors\":\"Yingying Wang, Shuwen Zheng, Rui Guo, Yanke Li, Honghao Yin, Xunan Qiu, Jijun Chen, Chuxuan Ni, Yuan Yuan, Yuehua Gong\",\"doi\":\"10.1080/21505594.2025.2481503\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><i>Helicobacter pylori</i> (<i>H. pylori</i>) antibiotic resistance poses a global health threat. Accurate identification of antibiotic resistant strains is essential for the control of infection. In the present study, our goal is to leverage the whole-genome data of <i>H. pylori</i> to develop practical and interpretable machine learning (ML) models for comprehensive antibiotic resistance assessment. A total of 296 <i>H. pylori</i> isolates with genome-wide data were downloaded from the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) and the National Center for Biotechnology Information (NCBI) databases. By training ML models on feature sets of single nucleotide polymorphisms from SNP calling (SNPs-1), antibiotic-resistance SNP annotated by the Comprehensive Antibiotic Resistance Database (SNPs-2), gene presence or absence (GPA), we generated predictive models for four antibiotics and multidrug-resistance (MDR). Among them, the models that combined SNPs-1, SNPs-2, and GPA data demonstrated the best performance, with the eXtreme Gradient Boosting (XGBoost) consistently outperforming others. And then we utilized the SHapley Additive exPlanations (SHAP) method to interpret the ML models. Furthermore, a free web application for the MDR model was deployed to the GitHub repository (https://H.pylori/MDR/App/). Our study demonstrated the promise of employing whole-genome data in conjunction with ML algorithms to forecast <i>H. pylori</i> antibiotic resistance. In the future, the application of this approach for predicting <i>H. pylori</i> antibiotic resistance would hold the potential to mitigate the empiric administration.</p>\",\"PeriodicalId\":23747,\"journal\":{\"name\":\"Virulence\",\"volume\":\"16 1\",\"pages\":\"2481503\"},\"PeriodicalIF\":5.5000,\"publicationDate\":\"2025-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11934168/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Virulence\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1080/21505594.2025.2481503\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/21 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"IMMUNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Virulence","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1080/21505594.2025.2481503","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/21 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
Assessment for antibiotic resistance in Helicobacter pylori: A practical and interpretable machine learning model based on genome-wide genetic variation.
Helicobacter pylori (H. pylori) antibiotic resistance poses a global health threat. Accurate identification of antibiotic resistant strains is essential for the control of infection. In the present study, our goal is to leverage the whole-genome data of H. pylori to develop practical and interpretable machine learning (ML) models for comprehensive antibiotic resistance assessment. A total of 296 H. pylori isolates with genome-wide data were downloaded from the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) and the National Center for Biotechnology Information (NCBI) databases. By training ML models on feature sets of single nucleotide polymorphisms from SNP calling (SNPs-1), antibiotic-resistance SNP annotated by the Comprehensive Antibiotic Resistance Database (SNPs-2), gene presence or absence (GPA), we generated predictive models for four antibiotics and multidrug-resistance (MDR). Among them, the models that combined SNPs-1, SNPs-2, and GPA data demonstrated the best performance, with the eXtreme Gradient Boosting (XGBoost) consistently outperforming others. And then we utilized the SHapley Additive exPlanations (SHAP) method to interpret the ML models. Furthermore, a free web application for the MDR model was deployed to the GitHub repository (https://H.pylori/MDR/App/). Our study demonstrated the promise of employing whole-genome data in conjunction with ML algorithms to forecast H. pylori antibiotic resistance. In the future, the application of this approach for predicting H. pylori antibiotic resistance would hold the potential to mitigate the empiric administration.
期刊介绍:
Virulence is a fully open access peer-reviewed journal. All articles will (if accepted) be available for anyone to read anywhere, at any time immediately on publication.
Virulence is the first international peer-reviewed journal of its kind to focus exclusively on microbial pathogenicity, the infection process and host-pathogen interactions. To address the new infectious challenges, emerging infectious agents and antimicrobial resistance, there is a clear need for interdisciplinary research.