DeepMineLys: Deep mining of phage lysins from human microbiome.

IF 7.5 1区 生物学 Q1 CELL BIOLOGY
Cell reports Pub Date : 2024-08-27 Epub Date: 2024-08-06 DOI:10.1016/j.celrep.2024.114583
Yiran Fu, Shuting Yu, Jianfeng Li, Zisha Lao, Xiaofeng Yang, Zhanglin Lin
{"title":"DeepMineLys: Deep mining of phage lysins from human microbiome.","authors":"Yiran Fu, Shuting Yu, Jianfeng Li, Zisha Lao, Xiaofeng Yang, Zhanglin Lin","doi":"10.1016/j.celrep.2024.114583","DOIUrl":null,"url":null,"abstract":"<p><p>Vast shotgun metagenomics data remain an underutilized resource for novel enzymes. Artificial intelligence (AI) has increasingly been applied to protein mining, but its conventional performance evaluation is interpolative in nature, and these trained models often struggle to extrapolate effectively when challenged with unknown data. In this study, we present a framework (DeepMineLys [deep mining of phage lysins from human microbiome]) based on the convolutional neural network (CNN) to identify phage lysins from three human microbiome datasets. When validated with an independent dataset, our method achieved an F1-score of 84.00%, surpassing existing methods by 20.84%. We expressed 16 lysin candidates from the top 100 sequences in E. coli, confirming 11 as active. The best one displayed an activity 6.2-fold that of lysozyme derived from hen egg white, establishing it as the most potent lysin from the human microbiome. Our study also underscores several important issues when applying AI to biology questions. This framework should be applicable for mining other proteins.</p>","PeriodicalId":9798,"journal":{"name":"Cell reports","volume":null,"pages":null},"PeriodicalIF":7.5000,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell reports","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.celrep.2024.114583","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/6 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Vast shotgun metagenomics data remain an underutilized resource for novel enzymes. Artificial intelligence (AI) has increasingly been applied to protein mining, but its conventional performance evaluation is interpolative in nature, and these trained models often struggle to extrapolate effectively when challenged with unknown data. In this study, we present a framework (DeepMineLys [deep mining of phage lysins from human microbiome]) based on the convolutional neural network (CNN) to identify phage lysins from three human microbiome datasets. When validated with an independent dataset, our method achieved an F1-score of 84.00%, surpassing existing methods by 20.84%. We expressed 16 lysin candidates from the top 100 sequences in E. coli, confirming 11 as active. The best one displayed an activity 6.2-fold that of lysozyme derived from hen egg white, establishing it as the most potent lysin from the human microbiome. Our study also underscores several important issues when applying AI to biology questions. This framework should be applicable for mining other proteins.

Abstract Image

DeepMineLys:从人类微生物组中深度挖掘噬菌体溶菌酶。
大量的射枪元基因组学数据仍然是一种未得到充分利用的新型酶资源。人工智能(AI)已越来越多地应用于蛋白质挖掘,但其传统的性能评估本质上是插值式的,这些训练有素的模型在面对未知数据时往往难以有效推断。在本研究中,我们提出了一个基于卷积神经网络(CNN)的框架(DeepMineLys[从人类微生物组中深度挖掘噬菌体溶酶体]),用于从三个人类微生物组数据集中识别噬菌体溶酶体。在使用独立数据集进行验证时,我们的方法取得了 84.00% 的 F1 分数,比现有方法高出 20.84%。我们在大肠杆菌中表达了前 100 个序列中的 16 个候选溶菌酶,确认其中 11 个具有活性。其中最好的一个序列的活性是提取自母鸡蛋白的溶菌酶的 6.2 倍,从而使其成为人类微生物组中最有效的溶菌酶。我们的研究还强调了将人工智能应用于生物学问题时的几个重要问题。这一框架应适用于挖掘其他蛋白质。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Cell reports
Cell reports CELL BIOLOGY-
CiteScore
13.80
自引率
1.10%
发文量
1305
审稿时长
77 days
期刊介绍: Cell Reports publishes high-quality research across the life sciences and focuses on new biological insight as its primary criterion for publication. The journal offers three primary article types: Reports, which are shorter single-point articles, research articles, which are longer and provide deeper mechanistic insights, and resources, which highlight significant technical advances or major informational datasets that contribute to biological advances. Reviews covering recent literature in emerging and active fields are also accepted. The Cell Reports Portfolio includes gold open-access journals that cover life, medical, and physical sciences, and its mission is to make cutting-edge research and methodologies available to a wide readership. The journal's professional in-house editors work closely with authors, reviewers, and the scientific advisory board, which consists of current and future leaders in their respective fields. The advisory board guides the scope, content, and quality of the journal, but editorial decisions are independently made by the in-house scientific editors of Cell Reports.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信