BioSeq_Ksite: Multi-perspective feature-driven prediction of protein succinylation based on an adaptive attention module with SSBCE loss strategy

IF 7.7 1区 化学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Lun Zhu , Ziqi Zhang , Sen Yang
{"title":"BioSeq_Ksite: Multi-perspective feature-driven prediction of protein succinylation based on an adaptive attention module with SSBCE loss strategy","authors":"Lun Zhu ,&nbsp;Ziqi Zhang ,&nbsp;Sen Yang","doi":"10.1016/j.ijbiomac.2025.143601","DOIUrl":null,"url":null,"abstract":"<div><div>Succinylation is a post-translational modification in which a succinyl group is transferred to the lysine residue of a protein, playing a crucial role in regulating both protein structure and cellular function. This paper introduces a novel sequential model, BioSeq_Ksite, designed to enhance succinylation prediction accuracy by integrating an adaptive attention mechanism and a joint loss function. This study first presents a new hybrid feature, ProtFusion, which combines the physicochemical properties of amino acids with pretrained models. Next, this paper introduces an adaptive attention module that enables the model to autonomously identify important features during training. Additionally, a gated network architecture is adopted to create a dual-branch sequential model. Finally, by combining sensitivity, specificity, and cross-entropy loss, a new joint loss function is proposed, which is used for succinylation prediction for the first time and significantly enhances the model's ability to handle class-imbalanced data. Evaluation on the test dataset shows that BioSeq_Ksite outperforms other models in MCC, Sn, AUC, and F1-Score, with a 7.68 % improvement in MCC over the second-best model. It provides an efficient and reliable tool for succinylation research and application. BioSeq_Ksite can be accessed at <span><span>https://github.com/zzq1124ZHZ/BioSeq_Ksite</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":333,"journal":{"name":"International Journal of Biological Macromolecules","volume":"310 ","pages":"Article 143601"},"PeriodicalIF":7.7000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Biological Macromolecules","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0141813025041534","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Succinylation is a post-translational modification in which a succinyl group is transferred to the lysine residue of a protein, playing a crucial role in regulating both protein structure and cellular function. This paper introduces a novel sequential model, BioSeq_Ksite, designed to enhance succinylation prediction accuracy by integrating an adaptive attention mechanism and a joint loss function. This study first presents a new hybrid feature, ProtFusion, which combines the physicochemical properties of amino acids with pretrained models. Next, this paper introduces an adaptive attention module that enables the model to autonomously identify important features during training. Additionally, a gated network architecture is adopted to create a dual-branch sequential model. Finally, by combining sensitivity, specificity, and cross-entropy loss, a new joint loss function is proposed, which is used for succinylation prediction for the first time and significantly enhances the model's ability to handle class-imbalanced data. Evaluation on the test dataset shows that BioSeq_Ksite outperforms other models in MCC, Sn, AUC, and F1-Score, with a 7.68 % improvement in MCC over the second-best model. It provides an efficient and reliable tool for succinylation research and application. BioSeq_Ksite can be accessed at https://github.com/zzq1124ZHZ/BioSeq_Ksite.
BioSeq_Ksite:基于SSBCE丢失策略的自适应注意力模块的蛋白质琥珀酰化多视角特征驱动预测
琥珀酰化是一种翻译后修饰,将琥珀酰基转移到蛋白质的赖氨酸残基上,在调节蛋白质结构和细胞功能方面起着至关重要的作用。本文介绍了一个新的序列模型BioSeq_Ksite,该模型通过集成自适应注意机制和联合损失函数来提高琥珀酰化预测的准确性。这项研究首先提出了一种新的杂交特征,ProtFusion,它将氨基酸的物理化学性质与预训练模型相结合。其次,本文引入了自适应注意模块,使模型能够在训练过程中自主识别重要特征。此外,采用门控网络架构创建双分支顺序模型。最后,结合灵敏度、特异性和交叉熵损失,提出了一种新的联合损失函数,首次将其用于琥珀酰化预测,显著提高了模型处理类不平衡数据的能力。对测试数据集的评估表明,BioSeq_Ksite在MCC、Sn、AUC和F1-Score方面优于其他模型,其中MCC比第二好的模型提高了7.68%。为琥珀酰化的研究和应用提供了一种高效可靠的工具。BioSeq_Ksite可以通过https://github.com/zzq1124ZHZ/BioSeq_Ksite访问。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Biological Macromolecules
International Journal of Biological Macromolecules 生物-生化与分子生物学
CiteScore
13.70
自引率
9.80%
发文量
2728
审稿时长
64 days
期刊介绍: The International Journal of Biological Macromolecules is a well-established international journal dedicated to research on the chemical and biological aspects of natural macromolecules. Focusing on proteins, macromolecular carbohydrates, glycoproteins, proteoglycans, lignins, biological poly-acids, and nucleic acids, the journal presents the latest findings in molecular structure, properties, biological activities, interactions, modifications, and functional properties. Papers must offer new and novel insights, encompassing related model systems, structural conformational studies, theoretical developments, and analytical techniques. Each paper is required to primarily focus on at least one named biological macromolecule, reflected in the title, abstract, and text.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信