Jian Zhang, Jingjing Qian, Pei Wang, Xuan Liu, Fuhao Zhang, Haiting Chai, Quan Zou
{"title":"预测蛋白质羰基化位点的可解释深度多层次注意学习。","authors":"Jian Zhang, Jingjing Qian, Pei Wang, Xuan Liu, Fuhao Zhang, Haiting Chai, Quan Zou","doi":"10.1002/advs.202500581","DOIUrl":null,"url":null,"abstract":"<p>Protein carbonylation refers to the covalent modification of proteins through the attachment of carbonyl groups, which arise from oxidative stress. This modification is biologically significant, as it can elicit modifications in protein functionality, signaling cascades, and cellular homeostasis. Accurate prediction of carbonylation sites offers valuable insights into the mechanisms underlying protein carbonylation and the pathogenesis of related diseases. Notably, carbonylation sites and ligand interaction sites, both functional sites, exhibit numerous similarities. The survey reveals that current computation-based approaches tend to make excessive cross-predictions for ligand interaction sites. To tackle this unresolved challenge, selective carbonylation sites (SCANS) is introduced, a novel deep learning-based framework. SCANS employs a multilevel attention strategy to capture both local (segment-level) and global (protein-level) features, utilizes a tailored loss function to penalize cross-predictions (residue-level), and applies transfer learning to augment the specificity of the overall network by leveraging knowledge from pretrained model. These innovative designs have been shown to successfully boost predictive performance and statistically outperforms current methods. Particularly, results on benchmark testing dataset demonstrate that SCANS consistently achieves low false positive rates, including low rates of cross-predictions. Furthermore, motif analyses and interpretations are conducted to provide novel insights into the protein carbonylation sites from various perspectives.</p>","PeriodicalId":117,"journal":{"name":"Advanced Science","volume":"12 23","pages":""},"PeriodicalIF":14.1000,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/advs.202500581","citationCount":"0","resultStr":"{\"title\":\"Explainable Deep Multilevel Attention Learning for Predicting Protein Carbonylation Sites\",\"authors\":\"Jian Zhang, Jingjing Qian, Pei Wang, Xuan Liu, Fuhao Zhang, Haiting Chai, Quan Zou\",\"doi\":\"10.1002/advs.202500581\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Protein carbonylation refers to the covalent modification of proteins through the attachment of carbonyl groups, which arise from oxidative stress. This modification is biologically significant, as it can elicit modifications in protein functionality, signaling cascades, and cellular homeostasis. Accurate prediction of carbonylation sites offers valuable insights into the mechanisms underlying protein carbonylation and the pathogenesis of related diseases. Notably, carbonylation sites and ligand interaction sites, both functional sites, exhibit numerous similarities. The survey reveals that current computation-based approaches tend to make excessive cross-predictions for ligand interaction sites. To tackle this unresolved challenge, selective carbonylation sites (SCANS) is introduced, a novel deep learning-based framework. SCANS employs a multilevel attention strategy to capture both local (segment-level) and global (protein-level) features, utilizes a tailored loss function to penalize cross-predictions (residue-level), and applies transfer learning to augment the specificity of the overall network by leveraging knowledge from pretrained model. These innovative designs have been shown to successfully boost predictive performance and statistically outperforms current methods. Particularly, results on benchmark testing dataset demonstrate that SCANS consistently achieves low false positive rates, including low rates of cross-predictions. Furthermore, motif analyses and interpretations are conducted to provide novel insights into the protein carbonylation sites from various perspectives.</p>\",\"PeriodicalId\":117,\"journal\":{\"name\":\"Advanced Science\",\"volume\":\"12 23\",\"pages\":\"\"},\"PeriodicalIF\":14.1000,\"publicationDate\":\"2025-03-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/advs.202500581\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Science\",\"FirstCategoryId\":\"88\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/advs.202500581\",\"RegionNum\":1,\"RegionCategory\":\"材料科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Science","FirstCategoryId":"88","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/advs.202500581","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Explainable Deep Multilevel Attention Learning for Predicting Protein Carbonylation Sites
Protein carbonylation refers to the covalent modification of proteins through the attachment of carbonyl groups, which arise from oxidative stress. This modification is biologically significant, as it can elicit modifications in protein functionality, signaling cascades, and cellular homeostasis. Accurate prediction of carbonylation sites offers valuable insights into the mechanisms underlying protein carbonylation and the pathogenesis of related diseases. Notably, carbonylation sites and ligand interaction sites, both functional sites, exhibit numerous similarities. The survey reveals that current computation-based approaches tend to make excessive cross-predictions for ligand interaction sites. To tackle this unresolved challenge, selective carbonylation sites (SCANS) is introduced, a novel deep learning-based framework. SCANS employs a multilevel attention strategy to capture both local (segment-level) and global (protein-level) features, utilizes a tailored loss function to penalize cross-predictions (residue-level), and applies transfer learning to augment the specificity of the overall network by leveraging knowledge from pretrained model. These innovative designs have been shown to successfully boost predictive performance and statistically outperforms current methods. Particularly, results on benchmark testing dataset demonstrate that SCANS consistently achieves low false positive rates, including low rates of cross-predictions. Furthermore, motif analyses and interpretations are conducted to provide novel insights into the protein carbonylation sites from various perspectives.
期刊介绍:
Advanced Science is a prestigious open access journal that focuses on interdisciplinary research in materials science, physics, chemistry, medical and life sciences, and engineering. The journal aims to promote cutting-edge research by employing a rigorous and impartial review process. It is committed to presenting research articles with the highest quality production standards, ensuring maximum accessibility of top scientific findings. With its vibrant and innovative publication platform, Advanced Science seeks to revolutionize the dissemination and organization of scientific knowledge.