DeepSANet:一种用于沙门氏菌分层地理来源归属的深度学习方法

IF 8 1区 农林科学 Q1 FOOD SCIENCE & TECHNOLOGY
Shan Liang , Shaohui Mei , Jiahao Ji , Bingkui Li , Mingyuan Xu , Miaomiao Chen , Yawen Huang , Yuxin Zheng , Di Chen , Xiangyu Deng , Shaoting Li , Hongmei Zhang
{"title":"DeepSANet:一种用于沙门氏菌分层地理来源归属的深度学习方法","authors":"Shan Liang ,&nbsp;Shaohui Mei ,&nbsp;Jiahao Ji ,&nbsp;Bingkui Li ,&nbsp;Mingyuan Xu ,&nbsp;Miaomiao Chen ,&nbsp;Yawen Huang ,&nbsp;Yuxin Zheng ,&nbsp;Di Chen ,&nbsp;Xiangyu Deng ,&nbsp;Shaoting Li ,&nbsp;Hongmei Zhang","doi":"10.1016/j.foodres.2025.117554","DOIUrl":null,"url":null,"abstract":"<div><div>Microbial foodborne pathogens pose serious threats to public health and the food industry, with <em>Salmonella</em> being particularly widespread and hazardous. Tracing its geographical origins is crucial for outbreak control and prevention. However, existing genomic approaches lack required precision. To address this, we propose a deep source attribution network (DeepSANet) for hierarchical geographical source attribution of <em>Salmonella</em>, as the first to introduce deep learning to this task. In DeepSANet, a Swin Transformer is used to generate deep representations for genomic data. Then, we introduce the parallel hierarchical predictor (PHP) module to achieve simultaneous predictions of multi-level geographical origins. Finally, an adaptive hierarchical transfer (AHT) loss is designed to leverage the label hierarchy, enhancing prediction accuracy and consistency across multiple granularities. We conducted experiments on a public <em>Salmonella enterica</em> serovar Enteritidis genome dataset. DeepSANet outperforms existing methods, achieving source attribution accuracies of 91.88 %, 87.05 %, and 80.83 % at the region, subregion, and country levels, respectively. To comprehensively evaluate the generalizability of the proposed method, we constructed a large-scale dataset for <em>Salmonella</em> hierarchical geographical source attribution based on the EnteroBase, comprising globally distributed isolates across diverse serovars. Results show that DeepSANet achieves over 90 % source prediction accuracy across all geographical levels, demonstrating generalization capability to diverse serovars. Notably, above performance was achieved using only 3,002 core genome multilocus sequence typing (cgMLST) loci as features, highlighting the model's efficiency and practicality. These findings suggest that deep learning holds promise for supporting foodborne pathogen surveillance.</div></div>","PeriodicalId":323,"journal":{"name":"Food Research International","volume":"221 ","pages":"Article 117554"},"PeriodicalIF":8.0000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DeepSANet: A deep learning approach for hierarchical geographical source attribution of Salmonella\",\"authors\":\"Shan Liang ,&nbsp;Shaohui Mei ,&nbsp;Jiahao Ji ,&nbsp;Bingkui Li ,&nbsp;Mingyuan Xu ,&nbsp;Miaomiao Chen ,&nbsp;Yawen Huang ,&nbsp;Yuxin Zheng ,&nbsp;Di Chen ,&nbsp;Xiangyu Deng ,&nbsp;Shaoting Li ,&nbsp;Hongmei Zhang\",\"doi\":\"10.1016/j.foodres.2025.117554\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Microbial foodborne pathogens pose serious threats to public health and the food industry, with <em>Salmonella</em> being particularly widespread and hazardous. Tracing its geographical origins is crucial for outbreak control and prevention. However, existing genomic approaches lack required precision. To address this, we propose a deep source attribution network (DeepSANet) for hierarchical geographical source attribution of <em>Salmonella</em>, as the first to introduce deep learning to this task. In DeepSANet, a Swin Transformer is used to generate deep representations for genomic data. Then, we introduce the parallel hierarchical predictor (PHP) module to achieve simultaneous predictions of multi-level geographical origins. Finally, an adaptive hierarchical transfer (AHT) loss is designed to leverage the label hierarchy, enhancing prediction accuracy and consistency across multiple granularities. We conducted experiments on a public <em>Salmonella enterica</em> serovar Enteritidis genome dataset. DeepSANet outperforms existing methods, achieving source attribution accuracies of 91.88 %, 87.05 %, and 80.83 % at the region, subregion, and country levels, respectively. To comprehensively evaluate the generalizability of the proposed method, we constructed a large-scale dataset for <em>Salmonella</em> hierarchical geographical source attribution based on the EnteroBase, comprising globally distributed isolates across diverse serovars. Results show that DeepSANet achieves over 90 % source prediction accuracy across all geographical levels, demonstrating generalization capability to diverse serovars. Notably, above performance was achieved using only 3,002 core genome multilocus sequence typing (cgMLST) loci as features, highlighting the model's efficiency and practicality. These findings suggest that deep learning holds promise for supporting foodborne pathogen surveillance.</div></div>\",\"PeriodicalId\":323,\"journal\":{\"name\":\"Food Research International\",\"volume\":\"221 \",\"pages\":\"Article 117554\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Food Research International\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0963996925018927\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"FOOD SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Food Research International","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0963996925018927","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

微生物食源性病原体对公众健康和食品工业构成严重威胁,沙门氏菌尤其广泛和危险。追踪其地理来源对于控制和预防疫情至关重要。然而,现有的基因组方法缺乏所需的精确度。为了解决这个问题,我们提出了一个深度来源归因网络(DeepSANet),用于沙门氏菌的分层地理来源归因,这是第一个将深度学习引入该任务的网络。在DeepSANet中,Swin Transformer用于生成基因组数据的深度表示。然后,我们引入了并行分层预测器(PHP)模块来实现多层次地理起源的同时预测。最后,设计了一种自适应分层转移(AHT)损失来利用标签层次结构,提高预测的准确性和跨多粒度的一致性。我们在公共肠炎沙门氏菌血清型基因组数据集上进行了实验。DeepSANet优于现有方法,在区域、子区域和国家层面分别实现了91.88%、87.05%和80.83%的源归属准确率。为了全面评估所提出方法的普遍性,我们基于EnteroBase构建了沙门氏菌分层地理来源归属的大规模数据集,包括全球分布的不同血清型的分离株。结果表明,DeepSANet在所有地理级别上实现了超过90%的源预测精度,展示了对不同服务器的泛化能力。值得注意的是,上述性能仅以3,002个核心基因组多位点序列分型(cgMLST)位点为特征,突出了模型的效率和实用性。这些发现表明,深度学习有望支持食源性病原体监测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

DeepSANet: A deep learning approach for hierarchical geographical source attribution of Salmonella

DeepSANet: A deep learning approach for hierarchical geographical source attribution of Salmonella
Microbial foodborne pathogens pose serious threats to public health and the food industry, with Salmonella being particularly widespread and hazardous. Tracing its geographical origins is crucial for outbreak control and prevention. However, existing genomic approaches lack required precision. To address this, we propose a deep source attribution network (DeepSANet) for hierarchical geographical source attribution of Salmonella, as the first to introduce deep learning to this task. In DeepSANet, a Swin Transformer is used to generate deep representations for genomic data. Then, we introduce the parallel hierarchical predictor (PHP) module to achieve simultaneous predictions of multi-level geographical origins. Finally, an adaptive hierarchical transfer (AHT) loss is designed to leverage the label hierarchy, enhancing prediction accuracy and consistency across multiple granularities. We conducted experiments on a public Salmonella enterica serovar Enteritidis genome dataset. DeepSANet outperforms existing methods, achieving source attribution accuracies of 91.88 %, 87.05 %, and 80.83 % at the region, subregion, and country levels, respectively. To comprehensively evaluate the generalizability of the proposed method, we constructed a large-scale dataset for Salmonella hierarchical geographical source attribution based on the EnteroBase, comprising globally distributed isolates across diverse serovars. Results show that DeepSANet achieves over 90 % source prediction accuracy across all geographical levels, demonstrating generalization capability to diverse serovars. Notably, above performance was achieved using only 3,002 core genome multilocus sequence typing (cgMLST) loci as features, highlighting the model's efficiency and practicality. These findings suggest that deep learning holds promise for supporting foodborne pathogen surveillance.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Food Research International
Food Research International 工程技术-食品科技
CiteScore
12.50
自引率
7.40%
发文量
1183
审稿时长
79 days
期刊介绍: Food Research International serves as a rapid dissemination platform for significant and impactful research in food science, technology, engineering, and nutrition. The journal focuses on publishing novel, high-quality, and high-impact review papers, original research papers, and letters to the editors across various disciplines in the science and technology of food. Additionally, it follows a policy of publishing special issues on topical and emergent subjects in food research or related areas. Selected, peer-reviewed papers from scientific meetings, workshops, and conferences on the science, technology, and engineering of foods are also featured in special issues.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信