{"title":"BioSeq_Ksite: Multi-perspective feature-driven prediction of protein succinylation based on an adaptive attention module with SSBCE loss strategy","authors":"Lun Zhu , Ziqi Zhang , Sen Yang","doi":"10.1016/j.ijbiomac.2025.143601","DOIUrl":null,"url":null,"abstract":"<div><div>Succinylation is a post-translational modification in which a succinyl group is transferred to the lysine residue of a protein, playing a crucial role in regulating both protein structure and cellular function. This paper introduces a novel sequential model, BioSeq_Ksite, designed to enhance succinylation prediction accuracy by integrating an adaptive attention mechanism and a joint loss function. This study first presents a new hybrid feature, ProtFusion, which combines the physicochemical properties of amino acids with pretrained models. Next, this paper introduces an adaptive attention module that enables the model to autonomously identify important features during training. Additionally, a gated network architecture is adopted to create a dual-branch sequential model. Finally, by combining sensitivity, specificity, and cross-entropy loss, a new joint loss function is proposed, which is used for succinylation prediction for the first time and significantly enhances the model's ability to handle class-imbalanced data. Evaluation on the test dataset shows that BioSeq_Ksite outperforms other models in MCC, Sn, AUC, and F1-Score, with a 7.68 % improvement in MCC over the second-best model. It provides an efficient and reliable tool for succinylation research and application. BioSeq_Ksite can be accessed at <span><span>https://github.com/zzq1124ZHZ/BioSeq_Ksite</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":333,"journal":{"name":"International Journal of Biological Macromolecules","volume":"310 ","pages":"Article 143601"},"PeriodicalIF":7.7000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Biological Macromolecules","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141813025041534","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Succinylation is a post-translational modification in which a succinyl group is transferred to the lysine residue of a protein, playing a crucial role in regulating both protein structure and cellular function. This paper introduces a novel sequential model, BioSeq_Ksite, designed to enhance succinylation prediction accuracy by integrating an adaptive attention mechanism and a joint loss function. This study first presents a new hybrid feature, ProtFusion, which combines the physicochemical properties of amino acids with pretrained models. Next, this paper introduces an adaptive attention module that enables the model to autonomously identify important features during training. Additionally, a gated network architecture is adopted to create a dual-branch sequential model. Finally, by combining sensitivity, specificity, and cross-entropy loss, a new joint loss function is proposed, which is used for succinylation prediction for the first time and significantly enhances the model's ability to handle class-imbalanced data. Evaluation on the test dataset shows that BioSeq_Ksite outperforms other models in MCC, Sn, AUC, and F1-Score, with a 7.68 % improvement in MCC over the second-best model. It provides an efficient and reliable tool for succinylation research and application. BioSeq_Ksite can be accessed at https://github.com/zzq1124ZHZ/BioSeq_Ksite.
期刊介绍:
The International Journal of Biological Macromolecules is a well-established international journal dedicated to research on the chemical and biological aspects of natural macromolecules. Focusing on proteins, macromolecular carbohydrates, glycoproteins, proteoglycans, lignins, biological poly-acids, and nucleic acids, the journal presents the latest findings in molecular structure, properties, biological activities, interactions, modifications, and functional properties. Papers must offer new and novel insights, encompassing related model systems, structural conformational studies, theoretical developments, and analytical techniques. Each paper is required to primarily focus on at least one named biological macromolecule, reflected in the title, abstract, and text.