预测蛋白质羰基化位点的可解释深度多层次注意学习。

IF 14.1 1区 材料科学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Jian Zhang, Jingjing Qian, Pei Wang, Xuan Liu, Fuhao Zhang, Haiting Chai, Quan Zou
{"title":"预测蛋白质羰基化位点的可解释深度多层次注意学习。","authors":"Jian Zhang,&nbsp;Jingjing Qian,&nbsp;Pei Wang,&nbsp;Xuan Liu,&nbsp;Fuhao Zhang,&nbsp;Haiting Chai,&nbsp;Quan Zou","doi":"10.1002/advs.202500581","DOIUrl":null,"url":null,"abstract":"<p>Protein carbonylation refers to the covalent modification of proteins through the attachment of carbonyl groups, which arise from oxidative stress. This modification is biologically significant, as it can elicit modifications in protein functionality, signaling cascades, and cellular homeostasis. Accurate prediction of carbonylation sites offers valuable insights into the mechanisms underlying protein carbonylation and the pathogenesis of related diseases. Notably, carbonylation sites and ligand interaction sites, both functional sites, exhibit numerous similarities. The survey reveals that current computation-based approaches tend to make excessive cross-predictions for ligand interaction sites. To tackle this unresolved challenge, selective carbonylation sites (SCANS) is introduced, a novel deep learning-based framework. SCANS employs a multilevel attention strategy to capture both local (segment-level) and global (protein-level) features, utilizes a tailored loss function to penalize cross-predictions (residue-level), and applies transfer learning to augment the specificity of the overall network by leveraging knowledge from pretrained model. These innovative designs have been shown to successfully boost predictive performance and statistically outperforms current methods. Particularly, results on benchmark testing dataset demonstrate that SCANS consistently achieves low false positive rates, including low rates of cross-predictions. Furthermore, motif analyses and interpretations are conducted to provide novel insights into the protein carbonylation sites from various perspectives.</p>","PeriodicalId":117,"journal":{"name":"Advanced Science","volume":"12 23","pages":""},"PeriodicalIF":14.1000,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/advs.202500581","citationCount":"0","resultStr":"{\"title\":\"Explainable Deep Multilevel Attention Learning for Predicting Protein Carbonylation Sites\",\"authors\":\"Jian Zhang,&nbsp;Jingjing Qian,&nbsp;Pei Wang,&nbsp;Xuan Liu,&nbsp;Fuhao Zhang,&nbsp;Haiting Chai,&nbsp;Quan Zou\",\"doi\":\"10.1002/advs.202500581\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Protein carbonylation refers to the covalent modification of proteins through the attachment of carbonyl groups, which arise from oxidative stress. This modification is biologically significant, as it can elicit modifications in protein functionality, signaling cascades, and cellular homeostasis. Accurate prediction of carbonylation sites offers valuable insights into the mechanisms underlying protein carbonylation and the pathogenesis of related diseases. Notably, carbonylation sites and ligand interaction sites, both functional sites, exhibit numerous similarities. The survey reveals that current computation-based approaches tend to make excessive cross-predictions for ligand interaction sites. To tackle this unresolved challenge, selective carbonylation sites (SCANS) is introduced, a novel deep learning-based framework. SCANS employs a multilevel attention strategy to capture both local (segment-level) and global (protein-level) features, utilizes a tailored loss function to penalize cross-predictions (residue-level), and applies transfer learning to augment the specificity of the overall network by leveraging knowledge from pretrained model. These innovative designs have been shown to successfully boost predictive performance and statistically outperforms current methods. Particularly, results on benchmark testing dataset demonstrate that SCANS consistently achieves low false positive rates, including low rates of cross-predictions. Furthermore, motif analyses and interpretations are conducted to provide novel insights into the protein carbonylation sites from various perspectives.</p>\",\"PeriodicalId\":117,\"journal\":{\"name\":\"Advanced Science\",\"volume\":\"12 23\",\"pages\":\"\"},\"PeriodicalIF\":14.1000,\"publicationDate\":\"2025-03-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/advs.202500581\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Science\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/advs.202500581\",\"RegionNum\":1,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Science","FirstCategoryId":"88","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/advs.202500581","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

蛋白质羰基化是指氧化应激引起的蛋白质通过羰基的附着而发生共价修饰。这种修饰具有重要的生物学意义,因为它可以引起蛋白质功能、信号级联和细胞稳态的改变。准确预测羰基化位点为蛋白质羰基化的机制和相关疾病的发病机制提供了有价值的见解。值得注意的是,羰基化位点和配体相互作用位点,都是功能位点,表现出许多相似之处。调查显示,目前基于计算的方法倾向于对配体相互作用位点进行过多的交叉预测。为了解决这个尚未解决的挑战,引入了选择性羰基化位点(SCANS),这是一种新的基于深度学习的框架。SCANS采用多层次的注意力策略来捕捉局部(片段级)和全局(蛋白质级)的特征,利用量身定制的损失函数来惩罚交叉预测(残差级),并应用迁移学习,以增加从预训练模型的知识,整个网络的特异性。这些创新的设计已经被证明可以成功地提高预测性能,并且在统计上优于当前的方法。特别是,在基准测试数据集上的结果表明,SCANS始终实现低假阳性率,包括低交叉预测率。此外,进行基序分析和解释,从不同的角度对蛋白质羰基化位点提供新的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Explainable Deep Multilevel Attention Learning for Predicting Protein Carbonylation Sites

Explainable Deep Multilevel Attention Learning for Predicting Protein Carbonylation Sites

Protein carbonylation refers to the covalent modification of proteins through the attachment of carbonyl groups, which arise from oxidative stress. This modification is biologically significant, as it can elicit modifications in protein functionality, signaling cascades, and cellular homeostasis. Accurate prediction of carbonylation sites offers valuable insights into the mechanisms underlying protein carbonylation and the pathogenesis of related diseases. Notably, carbonylation sites and ligand interaction sites, both functional sites, exhibit numerous similarities. The survey reveals that current computation-based approaches tend to make excessive cross-predictions for ligand interaction sites. To tackle this unresolved challenge, selective carbonylation sites (SCANS) is introduced, a novel deep learning-based framework. SCANS employs a multilevel attention strategy to capture both local (segment-level) and global (protein-level) features, utilizes a tailored loss function to penalize cross-predictions (residue-level), and applies transfer learning to augment the specificity of the overall network by leveraging knowledge from pretrained model. These innovative designs have been shown to successfully boost predictive performance and statistically outperforms current methods. Particularly, results on benchmark testing dataset demonstrate that SCANS consistently achieves low false positive rates, including low rates of cross-predictions. Furthermore, motif analyses and interpretations are conducted to provide novel insights into the protein carbonylation sites from various perspectives.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Advanced Science
Advanced Science CHEMISTRY, MULTIDISCIPLINARYNANOSCIENCE &-NANOSCIENCE & NANOTECHNOLOGY
CiteScore
18.90
自引率
2.60%
发文量
1602
审稿时长
1.9 months
期刊介绍: Advanced Science is a prestigious open access journal that focuses on interdisciplinary research in materials science, physics, chemistry, medical and life sciences, and engineering. The journal aims to promote cutting-edge research by employing a rigorous and impartial review process. It is committed to presenting research articles with the highest quality production standards, ensuring maximum accessibility of top scientific findings. With its vibrant and innovative publication platform, Advanced Science seeks to revolutionize the dissemination and organization of scientific knowledge.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信