Shan Liang , Shaohui Mei , Jiahao Ji , Bingkui Li , Mingyuan Xu , Miaomiao Chen , Yawen Huang , Yuxin Zheng , Di Chen , Xiangyu Deng , Shaoting Li , Hongmei Zhang
{"title":"DeepSANet:一种用于沙门氏菌分层地理来源归属的深度学习方法","authors":"Shan Liang , Shaohui Mei , Jiahao Ji , Bingkui Li , Mingyuan Xu , Miaomiao Chen , Yawen Huang , Yuxin Zheng , Di Chen , Xiangyu Deng , Shaoting Li , Hongmei Zhang","doi":"10.1016/j.foodres.2025.117554","DOIUrl":null,"url":null,"abstract":"<div><div>Microbial foodborne pathogens pose serious threats to public health and the food industry, with <em>Salmonella</em> being particularly widespread and hazardous. Tracing its geographical origins is crucial for outbreak control and prevention. However, existing genomic approaches lack required precision. To address this, we propose a deep source attribution network (DeepSANet) for hierarchical geographical source attribution of <em>Salmonella</em>, as the first to introduce deep learning to this task. In DeepSANet, a Swin Transformer is used to generate deep representations for genomic data. Then, we introduce the parallel hierarchical predictor (PHP) module to achieve simultaneous predictions of multi-level geographical origins. Finally, an adaptive hierarchical transfer (AHT) loss is designed to leverage the label hierarchy, enhancing prediction accuracy and consistency across multiple granularities. We conducted experiments on a public <em>Salmonella enterica</em> serovar Enteritidis genome dataset. DeepSANet outperforms existing methods, achieving source attribution accuracies of 91.88 %, 87.05 %, and 80.83 % at the region, subregion, and country levels, respectively. To comprehensively evaluate the generalizability of the proposed method, we constructed a large-scale dataset for <em>Salmonella</em> hierarchical geographical source attribution based on the EnteroBase, comprising globally distributed isolates across diverse serovars. Results show that DeepSANet achieves over 90 % source prediction accuracy across all geographical levels, demonstrating generalization capability to diverse serovars. Notably, above performance was achieved using only 3,002 core genome multilocus sequence typing (cgMLST) loci as features, highlighting the model's efficiency and practicality. These findings suggest that deep learning holds promise for supporting foodborne pathogen surveillance.</div></div>","PeriodicalId":323,"journal":{"name":"Food Research International","volume":"221 ","pages":"Article 117554"},"PeriodicalIF":8.0000,"publicationDate":"2025-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DeepSANet: A deep learning approach for hierarchical geographical source attribution of Salmonella\",\"authors\":\"Shan Liang , Shaohui Mei , Jiahao Ji , Bingkui Li , Mingyuan Xu , Miaomiao Chen , Yawen Huang , Yuxin Zheng , Di Chen , Xiangyu Deng , Shaoting Li , Hongmei Zhang\",\"doi\":\"10.1016/j.foodres.2025.117554\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Microbial foodborne pathogens pose serious threats to public health and the food industry, with <em>Salmonella</em> being particularly widespread and hazardous. Tracing its geographical origins is crucial for outbreak control and prevention. However, existing genomic approaches lack required precision. To address this, we propose a deep source attribution network (DeepSANet) for hierarchical geographical source attribution of <em>Salmonella</em>, as the first to introduce deep learning to this task. In DeepSANet, a Swin Transformer is used to generate deep representations for genomic data. Then, we introduce the parallel hierarchical predictor (PHP) module to achieve simultaneous predictions of multi-level geographical origins. Finally, an adaptive hierarchical transfer (AHT) loss is designed to leverage the label hierarchy, enhancing prediction accuracy and consistency across multiple granularities. We conducted experiments on a public <em>Salmonella enterica</em> serovar Enteritidis genome dataset. DeepSANet outperforms existing methods, achieving source attribution accuracies of 91.88 %, 87.05 %, and 80.83 % at the region, subregion, and country levels, respectively. To comprehensively evaluate the generalizability of the proposed method, we constructed a large-scale dataset for <em>Salmonella</em> hierarchical geographical source attribution based on the EnteroBase, comprising globally distributed isolates across diverse serovars. Results show that DeepSANet achieves over 90 % source prediction accuracy across all geographical levels, demonstrating generalization capability to diverse serovars. Notably, above performance was achieved using only 3,002 core genome multilocus sequence typing (cgMLST) loci as features, highlighting the model's efficiency and practicality. These findings suggest that deep learning holds promise for supporting foodborne pathogen surveillance.</div></div>\",\"PeriodicalId\":323,\"journal\":{\"name\":\"Food Research International\",\"volume\":\"221 \",\"pages\":\"Article 117554\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-09-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Food Research International\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0963996925018927\",\"RegionNum\":1,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"FOOD SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Food Research International","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0963996925018927","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
DeepSANet: A deep learning approach for hierarchical geographical source attribution of Salmonella
Microbial foodborne pathogens pose serious threats to public health and the food industry, with Salmonella being particularly widespread and hazardous. Tracing its geographical origins is crucial for outbreak control and prevention. However, existing genomic approaches lack required precision. To address this, we propose a deep source attribution network (DeepSANet) for hierarchical geographical source attribution of Salmonella, as the first to introduce deep learning to this task. In DeepSANet, a Swin Transformer is used to generate deep representations for genomic data. Then, we introduce the parallel hierarchical predictor (PHP) module to achieve simultaneous predictions of multi-level geographical origins. Finally, an adaptive hierarchical transfer (AHT) loss is designed to leverage the label hierarchy, enhancing prediction accuracy and consistency across multiple granularities. We conducted experiments on a public Salmonella enterica serovar Enteritidis genome dataset. DeepSANet outperforms existing methods, achieving source attribution accuracies of 91.88 %, 87.05 %, and 80.83 % at the region, subregion, and country levels, respectively. To comprehensively evaluate the generalizability of the proposed method, we constructed a large-scale dataset for Salmonella hierarchical geographical source attribution based on the EnteroBase, comprising globally distributed isolates across diverse serovars. Results show that DeepSANet achieves over 90 % source prediction accuracy across all geographical levels, demonstrating generalization capability to diverse serovars. Notably, above performance was achieved using only 3,002 core genome multilocus sequence typing (cgMLST) loci as features, highlighting the model's efficiency and practicality. These findings suggest that deep learning holds promise for supporting foodborne pathogen surveillance.
期刊介绍:
Food Research International serves as a rapid dissemination platform for significant and impactful research in food science, technology, engineering, and nutrition. The journal focuses on publishing novel, high-quality, and high-impact review papers, original research papers, and letters to the editors across various disciplines in the science and technology of food. Additionally, it follows a policy of publishing special issues on topical and emergent subjects in food research or related areas. Selected, peer-reviewed papers from scientific meetings, workshops, and conferences on the science, technology, and engineering of foods are also featured in special issues.