T4Seeker:用于识别 IV 型分泌效应物的混合模型。

IF 4.4 1区 生物学 Q1 BIOLOGY
Jing Li, Shida He, Jian Zhang, Feng Zhang, Quan Zou, Fengming Ni
{"title":"T4Seeker:用于识别 IV 型分泌效应物的混合模型。","authors":"Jing Li, Shida He, Jian Zhang, Feng Zhang, Quan Zou, Fengming Ni","doi":"10.1186/s12915-024-02064-z","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The type IV secretion system is widely present in various bacteria, such as Salmonella, Escherichia coli, and Helicobacter pylori. These bacteria use the type IV secretion system to secrete type IV secretion effectors, infect host cells, and disrupt or modulate the communication pathways. In this study, type III and type VI secretion effectors were used as negative samples to train a robust model.</p><p><strong>Results: </strong>The area under the curve of T4Seeker on the validation and independent test sets were 0.947 and 0.970, respectively, demonstrating the strong predictive capacity and robustness of T4Seeker. After comparing with the classic and state-of-the-art T4SE identification models, we found that T4Seeker, which is based on traditional features and large language model features, had a higher predictive ability.</p><p><strong>Conclusion: </strong>The T4Seeker proposed in this study demonstrates superior performance in the field of T4SEs prediction. By integrating features at multiple levels, it achieves higher predictive accuracy and strong generalization capability, providing an effective tool for future T4SE research.</p>","PeriodicalId":9339,"journal":{"name":"BMC Biology","volume":"22 1","pages":"259"},"PeriodicalIF":4.4000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11566746/pdf/","citationCount":"0","resultStr":"{\"title\":\"T4Seeker: a hybrid model for type IV secretion effectors identification.\",\"authors\":\"Jing Li, Shida He, Jian Zhang, Feng Zhang, Quan Zou, Fengming Ni\",\"doi\":\"10.1186/s12915-024-02064-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The type IV secretion system is widely present in various bacteria, such as Salmonella, Escherichia coli, and Helicobacter pylori. These bacteria use the type IV secretion system to secrete type IV secretion effectors, infect host cells, and disrupt or modulate the communication pathways. In this study, type III and type VI secretion effectors were used as negative samples to train a robust model.</p><p><strong>Results: </strong>The area under the curve of T4Seeker on the validation and independent test sets were 0.947 and 0.970, respectively, demonstrating the strong predictive capacity and robustness of T4Seeker. After comparing with the classic and state-of-the-art T4SE identification models, we found that T4Seeker, which is based on traditional features and large language model features, had a higher predictive ability.</p><p><strong>Conclusion: </strong>The T4Seeker proposed in this study demonstrates superior performance in the field of T4SEs prediction. By integrating features at multiple levels, it achieves higher predictive accuracy and strong generalization capability, providing an effective tool for future T4SE research.</p>\",\"PeriodicalId\":9339,\"journal\":{\"name\":\"BMC Biology\",\"volume\":\"22 1\",\"pages\":\"259\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2024-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11566746/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s12915-024-02064-z\",\"RegionNum\":1,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12915-024-02064-z","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:IV 型分泌系统广泛存在于各种细菌中,如沙门氏菌、大肠杆菌和幽门螺旋杆菌。这些细菌利用 IV 型分泌系统分泌 IV 型分泌效应物,感染宿主细胞,破坏或调节通讯途径。在这项研究中,III型和VI型分泌效应物被用作训练稳健模型的阴性样本:结果:T4Seeker 在验证集和独立测试集上的曲线下面积分别为 0.947 和 0.970,表明 T4Seeker 具有很强的预测能力和鲁棒性。在与经典的和最先进的 T4SE 识别模型比较后,我们发现基于传统特征和大语言模型特征的 T4Seeker 具有更高的预测能力:结论:本研究提出的 T4Seeker 在 T4SE 预测领域表现出色。结论:本研究提出的 T4Seeker 在 T4SE 预测领域表现出了卓越的性能,通过整合多个层面的特征,它实现了更高的预测准确率和更强的泛化能力,为未来的 T4SE 研究提供了有效的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
T4Seeker: a hybrid model for type IV secretion effectors identification.

Background: The type IV secretion system is widely present in various bacteria, such as Salmonella, Escherichia coli, and Helicobacter pylori. These bacteria use the type IV secretion system to secrete type IV secretion effectors, infect host cells, and disrupt or modulate the communication pathways. In this study, type III and type VI secretion effectors were used as negative samples to train a robust model.

Results: The area under the curve of T4Seeker on the validation and independent test sets were 0.947 and 0.970, respectively, demonstrating the strong predictive capacity and robustness of T4Seeker. After comparing with the classic and state-of-the-art T4SE identification models, we found that T4Seeker, which is based on traditional features and large language model features, had a higher predictive ability.

Conclusion: The T4Seeker proposed in this study demonstrates superior performance in the field of T4SEs prediction. By integrating features at multiple levels, it achieves higher predictive accuracy and strong generalization capability, providing an effective tool for future T4SE research.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Biology
BMC Biology 生物-生物学
CiteScore
7.80
自引率
1.90%
发文量
260
审稿时长
3 months
期刊介绍: BMC Biology is a broad scope journal covering all areas of biology. Our content includes research articles, new methods and tools. BMC Biology also publishes reviews, Q&A, and commentaries.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信