利用全基因组测序数据分析肠炎沙门氏菌的食物来源

IF 6.6 2区 医学 Q1 IMMUNOLOGY
Erica Billig Rose, Molly K. Steele, Beth Tolar, James Pettengill, Michael Batz, Michael Bazaco, Berhanu Tameru, Zhaohui Cui, Rebecca L. Lindsey, Mustafa Simmons, Jess Chen, Drew Posny, Heather Carleton, Beau B. Bruce
{"title":"利用全基因组测序数据分析肠炎沙门氏菌的食物来源","authors":"Erica Billig Rose, Molly K. Steele, Beth Tolar, James Pettengill, Michael Batz, Michael Bazaco, Berhanu Tameru, Zhaohui Cui, Rebecca L. Lindsey, Mustafa Simmons, Jess Chen, Drew Posny, Heather Carleton, Beau B. Bruce","doi":"10.3201/eid3104.241172","DOIUrl":null,"url":null,"abstract":"<p><em>Salmonella enterica</em> bacteria are a leading cause of foodborne illness in the United States; however, most <em>Salmonella</em> illnesses are not associated with known outbreaks, and predicting the source of sporadic illnesses remains a challenge. We used a supervised random forest model to determine the most likely sources responsible for human salmonellosis cases in the United States. We trained the model by using whole-genome multilocus sequence typing data from 18,661 <em>Salmonella</em> isolates from collected single food sources and used feature selection to determine the subset of loci most influential for prediction. The overall out-of-bag accuracy of the trained model was 91%; the highest prediction accuracy was for chicken (97%). We applied the trained model to 6,470 isolates from humans with unknown exposure to predict the source of infection. Our model predicted that &gt;33% of the human-derived <em>Salmonella</em> isolates originated from chicken and 27% were from vegetables.</p>","PeriodicalId":11595,"journal":{"name":"Emerging Infectious Diseases","volume":"63 1","pages":""},"PeriodicalIF":6.6000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Attribution of Salmonella enterica to Food Sources by Using Whole-Genome Sequencing Data\",\"authors\":\"Erica Billig Rose, Molly K. Steele, Beth Tolar, James Pettengill, Michael Batz, Michael Bazaco, Berhanu Tameru, Zhaohui Cui, Rebecca L. Lindsey, Mustafa Simmons, Jess Chen, Drew Posny, Heather Carleton, Beau B. Bruce\",\"doi\":\"10.3201/eid3104.241172\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><em>Salmonella enterica</em> bacteria are a leading cause of foodborne illness in the United States; however, most <em>Salmonella</em> illnesses are not associated with known outbreaks, and predicting the source of sporadic illnesses remains a challenge. We used a supervised random forest model to determine the most likely sources responsible for human salmonellosis cases in the United States. We trained the model by using whole-genome multilocus sequence typing data from 18,661 <em>Salmonella</em> isolates from collected single food sources and used feature selection to determine the subset of loci most influential for prediction. The overall out-of-bag accuracy of the trained model was 91%; the highest prediction accuracy was for chicken (97%). We applied the trained model to 6,470 isolates from humans with unknown exposure to predict the source of infection. Our model predicted that &gt;33% of the human-derived <em>Salmonella</em> isolates originated from chicken and 27% were from vegetables.</p>\",\"PeriodicalId\":11595,\"journal\":{\"name\":\"Emerging Infectious Diseases\",\"volume\":\"63 1\",\"pages\":\"\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2025-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Emerging Infectious Diseases\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3201/eid3104.241172\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"IMMUNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Emerging Infectious Diseases","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3201/eid3104.241172","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"IMMUNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

在美国,肠道沙门氏菌是食源性疾病的主要原因;然而,大多数沙门氏菌疾病与已知的暴发无关,并且预测散发疾病的来源仍然是一项挑战。我们使用监督随机森林模型来确定美国人类沙门氏菌病病例最可能的来源。我们使用从单一食物来源收集的18661株沙门氏菌的全基因组多位点序列分型数据来训练模型,并使用特征选择来确定对预测最有影响的位点子集。训练模型的整体脱袋准确率为91%;鸡的预测准确率最高(97%)。我们将训练好的模型应用于6470株来自未知暴露的人类分离株,以预测感染源。我们的模型预测33%的人源性沙门氏菌分离株来自鸡肉,27%来自蔬菜。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Attribution of Salmonella enterica to Food Sources by Using Whole-Genome Sequencing Data

Salmonella enterica bacteria are a leading cause of foodborne illness in the United States; however, most Salmonella illnesses are not associated with known outbreaks, and predicting the source of sporadic illnesses remains a challenge. We used a supervised random forest model to determine the most likely sources responsible for human salmonellosis cases in the United States. We trained the model by using whole-genome multilocus sequence typing data from 18,661 Salmonella isolates from collected single food sources and used feature selection to determine the subset of loci most influential for prediction. The overall out-of-bag accuracy of the trained model was 91%; the highest prediction accuracy was for chicken (97%). We applied the trained model to 6,470 isolates from humans with unknown exposure to predict the source of infection. Our model predicted that >33% of the human-derived Salmonella isolates originated from chicken and 27% were from vegetables.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Emerging Infectious Diseases
Emerging Infectious Diseases 医学-传染病学
CiteScore
17.30
自引率
1.70%
发文量
505
审稿时长
1 months
期刊介绍: Emerging Infectious Diseases is a monthly open access journal published by the Centers for Disease Control and Prevention. The primary goal of this peer-reviewed journal is to advance the global recognition of both new and reemerging infectious diseases, while also enhancing our understanding of the underlying factors that contribute to disease emergence, prevention, and elimination. Targeted towards professionals in the field of infectious diseases and related sciences, the journal encourages diverse contributions from experts in academic research, industry, clinical practice, public health, as well as specialists in economics, social sciences, and other relevant disciplines. By fostering a collaborative approach, Emerging Infectious Diseases aims to facilitate interdisciplinary dialogue and address the multifaceted challenges posed by infectious diseases.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信