Prediction of Antibiotic Resistance Phenotypes and Minimum Inhibitory Concentrations in Salmonella Using Machine Learning Analysis of Its Pan-Genome and Pan-Resistome Features.

IF 1.9 2区 农林科学 Q3 FOOD SCIENCE & TECHNOLOGY
Yichen He, Xiujuan Zhou, Lida Zhang, Yan Cui, Yiping He, Andrew Gehring, Xiangyu Deng, Xianming Shi
{"title":"Prediction of Antibiotic Resistance Phenotypes and Minimum Inhibitory Concentrations in <i>Salmonella</i> Using Machine Learning Analysis of Its Pan-Genome and Pan-Resistome Features.","authors":"Yichen He, Xiujuan Zhou, Lida Zhang, Yan Cui, Yiping He, Andrew Gehring, Xiangyu Deng, Xianming Shi","doi":"10.1089/fpd.2024.0170","DOIUrl":null,"url":null,"abstract":"<p><p>Traditional experimental methods for determining antibiotic resistance phenotypes (ARPs) and minimum inhibitory concentrations (MICs) in bacteria are laborious and time consuming. This study aims to explore the potential of whole-genome sequencing data combined with machine learning models for robustly predicting ARPs and MICs in <i>Salmonella</i>. Using a training set of 6394 <i>Salmonella</i> genomes alongside antimicrobial susceptibility testing results, we built two machine learning (ML) predictive models based on the pan-genome and pan-resistome. Each model was implemented using three algorithms: random forest, extreme gradient boosting (XGB), and convolutional neural network. Among them, XGB achieved the highest overall accuracy, with the pan-genome and pan-resistome models accurately predicting ARPs (98.51% and 97.77%) and MICs (81.42% and 78.99%) for 15 commonly used antibiotics. Feature extraction from pan-genome and pan-resistome data effectively reduced computational complexity and significantly decreased computation time. Notably, fewer than 10 key genomic features, often linked to known resistance or mobile genes, were sufficient for robust predictions for each antibiotic. This study also identified challenges, including imbalanced resistance classes and imprecise MIC measurements, which impacted prediction accuracy. These findings highlight the importance of using multiple evaluation metrics to assess model performance comprehensively. Overall, our findings demonstrated that ML, utilizing pan-genome or pan-resistome features, was highly effective in predicting antibiotic resistance and identifying correlated genetic features in <i>Salmonella</i>. This approach holds great potential to supplement conventional culture-based methods for routine surveillance of antibiotic-resistant bacteria.</p>","PeriodicalId":12333,"journal":{"name":"Foodborne pathogens and disease","volume":" ","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Foodborne pathogens and disease","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1089/fpd.2024.0170","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Traditional experimental methods for determining antibiotic resistance phenotypes (ARPs) and minimum inhibitory concentrations (MICs) in bacteria are laborious and time consuming. This study aims to explore the potential of whole-genome sequencing data combined with machine learning models for robustly predicting ARPs and MICs in Salmonella. Using a training set of 6394 Salmonella genomes alongside antimicrobial susceptibility testing results, we built two machine learning (ML) predictive models based on the pan-genome and pan-resistome. Each model was implemented using three algorithms: random forest, extreme gradient boosting (XGB), and convolutional neural network. Among them, XGB achieved the highest overall accuracy, with the pan-genome and pan-resistome models accurately predicting ARPs (98.51% and 97.77%) and MICs (81.42% and 78.99%) for 15 commonly used antibiotics. Feature extraction from pan-genome and pan-resistome data effectively reduced computational complexity and significantly decreased computation time. Notably, fewer than 10 key genomic features, often linked to known resistance or mobile genes, were sufficient for robust predictions for each antibiotic. This study also identified challenges, including imbalanced resistance classes and imprecise MIC measurements, which impacted prediction accuracy. These findings highlight the importance of using multiple evaluation metrics to assess model performance comprehensively. Overall, our findings demonstrated that ML, utilizing pan-genome or pan-resistome features, was highly effective in predicting antibiotic resistance and identifying correlated genetic features in Salmonella. This approach holds great potential to supplement conventional culture-based methods for routine surveillance of antibiotic-resistant bacteria.

利用机器学习分析沙门氏菌泛基因组和泛抵抗组特征预测抗生素耐药表型和最低抑菌浓度。
传统的测定细菌抗生素耐药表型(ARPs)和最低抑菌浓度(mic)的实验方法既费力又耗时。本研究旨在探索全基因组测序数据与机器学习模型相结合的潜力,以可靠地预测沙门氏菌的ARPs和mic。利用6394个沙门氏菌基因组训练集和抗菌药敏试验结果,建立了基于泛基因组和泛抵抗组的机器学习(ML)预测模型。每个模型都使用三种算法实现:随机森林、极端梯度增强(XGB)和卷积神经网络。其中,XGB总体准确率最高,泛基因组和泛抵抗组模型预测15种常用抗生素的ARPs(98.51%)和mic(81.42%)的准确率分别为97.77%和78.99%。对泛基因组和泛抵抗组数据进行特征提取,有效降低了计算复杂度,显著缩短了计算时间。值得注意的是,不到10个关键的基因组特征(通常与已知的耐药性或可移动基因有关)足以对每种抗生素进行可靠的预测。该研究还发现了一些挑战,包括不平衡的阻力等级和不精确的MIC测量,这些都会影响预测的准确性。这些发现强调了使用多个评估指标来全面评估模型性能的重要性。总的来说,我们的研究结果表明,利用泛基因组或泛抵抗组特征的ML在预测沙门氏菌的抗生素耐药性和识别相关遗传特征方面非常有效。这种方法具有很大的潜力,可以补充传统的基于培养的抗生素耐药细菌常规监测方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Foodborne pathogens and disease
Foodborne pathogens and disease 医学-食品科技
CiteScore
5.30
自引率
3.60%
发文量
80
审稿时长
1 months
期刊介绍: Foodborne Pathogens and Disease is one of the most inclusive scientific publications on the many disciplines that contribute to food safety. Spanning an array of issues from "farm-to-fork," the Journal bridges the gap between science and policy to reduce the burden of foodborne illness worldwide. Foodborne Pathogens and Disease coverage includes: Agroterrorism Safety of organically grown and genetically modified foods Emerging pathogens Emergence of drug resistance Methods and technology for rapid and accurate detection Strategies to destroy or control foodborne pathogens Novel strategies for the prevention and control of plant and animal diseases that impact food safety Biosecurity issues and the implications of new regulatory guidelines Impact of changing lifestyles and consumer demands on food safety.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信