Bioinformatics combined with machine learning unravels differences among environmental, seafood, and clinical isolates of Vibrio parahaemolyticus.

IF 4 2区 生物学 Q2 MICROBIOLOGY
Frontiers in Microbiology Pub Date : 2025-03-19 eCollection Date: 2025-01-01 DOI:10.3389/fmicb.2025.1549260
Shuyi Feng, Padmini Ramachandran, Ryan A Blaustein, Abani K Pradhan
{"title":"Bioinformatics combined with machine learning unravels differences among environmental, seafood, and clinical isolates of <i>Vibrio parahaemolyticus</i>.","authors":"Shuyi Feng, Padmini Ramachandran, Ryan A Blaustein, Abani K Pradhan","doi":"10.3389/fmicb.2025.1549260","DOIUrl":null,"url":null,"abstract":"<p><p><i>Vibrio parahaemolyticus</i> is the leading cause of illnesses and outbreaks linked to seafood consumption across the globe. Understanding how this pathogen may be adapted to persist along the farm-to-table supply chain has applications for addressing food safety. This study utilized machine learning to develop robust models classifying genomic diversity of <i>V. parahaemolyticus</i> that was isolated from environmental (<i>n</i> = 176), seafood (<i>n</i> = 975), and clinical (<i>n</i> = 865) sample origins. We constructed a pangenome of the respective genome assemblies and employed random forest algorithm to develop predictive models to identify gene clusters encoding metabolism, virulence, and antibiotic resistance that were associated with isolate source type. Comparison of genomes of all seafood-clinical isolates showed high balanced accuracy (≥0.80) and Area Under the Receiver Operating Characteristics curve (≥0.87) for all of these functional features. Major virulence factors including <i>tdh</i>, <i>trh</i>, type III secretion system-related genes, and four alpha-hemolysin genes (<i>hlyA</i>, <i>hlyB</i>, <i>hlyC</i>, and <i>hlyD</i>) were identified as important differentiating factors in our seafood-clinical virulence model, underscoring the need for further investigation. Significant patterns for AMR genes differing among seafood and clinical samples were revealed from our model and genes conferring to tetracycline, elfamycin, and multidrug (phenicol antibiotic, diaminopyrimidine antibiotic, and fluoroquinolone antibiotic) resistance were identified as the top three key variables. These findings provide crucial insights into the development of effective surveillance and management strategies to address the public health threats associated with <i>V. parahaemolyticus</i>.</p>","PeriodicalId":12466,"journal":{"name":"Frontiers in Microbiology","volume":"16 ","pages":"1549260"},"PeriodicalIF":4.0000,"publicationDate":"2025-03-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11961994/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Microbiology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fmicb.2025.1549260","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Vibrio parahaemolyticus is the leading cause of illnesses and outbreaks linked to seafood consumption across the globe. Understanding how this pathogen may be adapted to persist along the farm-to-table supply chain has applications for addressing food safety. This study utilized machine learning to develop robust models classifying genomic diversity of V. parahaemolyticus that was isolated from environmental (n = 176), seafood (n = 975), and clinical (n = 865) sample origins. We constructed a pangenome of the respective genome assemblies and employed random forest algorithm to develop predictive models to identify gene clusters encoding metabolism, virulence, and antibiotic resistance that were associated with isolate source type. Comparison of genomes of all seafood-clinical isolates showed high balanced accuracy (≥0.80) and Area Under the Receiver Operating Characteristics curve (≥0.87) for all of these functional features. Major virulence factors including tdh, trh, type III secretion system-related genes, and four alpha-hemolysin genes (hlyA, hlyB, hlyC, and hlyD) were identified as important differentiating factors in our seafood-clinical virulence model, underscoring the need for further investigation. Significant patterns for AMR genes differing among seafood and clinical samples were revealed from our model and genes conferring to tetracycline, elfamycin, and multidrug (phenicol antibiotic, diaminopyrimidine antibiotic, and fluoroquinolone antibiotic) resistance were identified as the top three key variables. These findings provide crucial insights into the development of effective surveillance and management strategies to address the public health threats associated with V. parahaemolyticus.

生物信息学结合机器学习揭示了环境、海鲜和临床分离副溶血性弧菌之间的差异。
副溶血性弧菌是全球与海鲜消费有关的疾病和疫情的主要原因。了解这种病原体如何适应并在从农场到餐桌的供应链中持续存在,对于解决食品安全问题具有应用价值。本研究利用机器学习建立了强大的模型,对从环境(n = 176)、海鲜(n = 975)和临床(n = 865)样本中分离出来的副溶血性弧菌基因组多样性进行分类。我们构建了各自基因组组合的泛基因组,并采用随机森林算法建立预测模型,以识别与分离源类型相关的编码代谢、毒力和抗生素耐药性的基因簇。所有海产品临床分离株的基因组比较显示,所有这些功能特征的平衡精度(≥0.80)和接受者工作特征曲线下面积(≥0.87)均较高。主要毒力因子包括tdh、trh、III型分泌系统相关基因和4个α -溶血素基因(hlyA、hlyB、hlyC和hlyD)是我们的海鲜-临床毒力模型的重要区分因素,需要进一步研究。该模型揭示了海产品和临床样品之间AMR基因差异的显著模式,并确定了四环素、埃尔法霉素和多药(苯酚类抗生素、二氨基嘧啶类抗生素和氟喹诺酮类抗生素)耐药基因为前三个关键变量。这些发现为制定有效的监测和管理战略以应对与副溶血性弧菌相关的公共卫生威胁提供了重要见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.70
自引率
9.60%
发文量
4837
审稿时长
14 weeks
期刊介绍: Frontiers in Microbiology is a leading journal in its field, publishing rigorously peer-reviewed research across the entire spectrum of microbiology. Field Chief Editor Martin G. Klotz at Washington State University is supported by an outstanding Editorial Board of international researchers. This multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, clinicians and the public worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信