S-DCNN: prediction of ATP binding residues by deep convolutional neural network based on SMOTE.

IF 2.8 3区 生物学 Q2 GENETICS & HEREDITY
Frontiers in Genetics Pub Date : 2025-01-06 eCollection Date: 2024-01-01 DOI:10.3389/fgene.2024.1513201
Sixi Hao, Cai-Yan Li, Xiuzhen Hu, Zhenxing Feng, Gaimei Zhang, Caiyun Yang, Huimin Hu
{"title":"S-DCNN: prediction of ATP binding residues by deep convolutional neural network based on SMOTE.","authors":"Sixi Hao, Cai-Yan Li, Xiuzhen Hu, Zhenxing Feng, Gaimei Zhang, Caiyun Yang, Huimin Hu","doi":"10.3389/fgene.2024.1513201","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The realization of many protein functions requires binding with ligands. As a significant protein-binding ligand, ATP plays a crucial role in various biological processes. Currently, the precise prediction of ATP binding residues remains challenging.</p><p><strong>Methods: </strong>Based on the sequence information, this paper introduces a method called S-DCNN for predicting ATP binding residues, utilizing a deep convolutional neural network (DCNN) enhanced with the synthetic minority over-sampling technique (SMOTE).</p><p><strong>Results: </strong>The incorporation of additional feature parameters such as dihedral angles, energy, and propensity factors into the standard parameter set resulted in a significant enhancement in prediction accuracy on the ATP-289 dataset. The S-DCNN achieved the highest Matthews correlation coefficient value of 0.5031 and an accuracy rate of 97.06% on an independent test set. Furthermore, when applied to the ATP-221 and ATP-388 datasets for validation, the S-DCNN outperformed existing methods on ATP-221 and performed comparably to other methods on ATP-388 during independent testing.</p><p><strong>Conclusion: </strong>Our experimental results underscore the efficacy of the S-DCNN in accurately predicting ATP binding residues, establishing it as a potent tool in the prediction of ATP binding residues.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"15 ","pages":"1513201"},"PeriodicalIF":2.8000,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11744016/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fgene.2024.1513201","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The realization of many protein functions requires binding with ligands. As a significant protein-binding ligand, ATP plays a crucial role in various biological processes. Currently, the precise prediction of ATP binding residues remains challenging.

Methods: Based on the sequence information, this paper introduces a method called S-DCNN for predicting ATP binding residues, utilizing a deep convolutional neural network (DCNN) enhanced with the synthetic minority over-sampling technique (SMOTE).

Results: The incorporation of additional feature parameters such as dihedral angles, energy, and propensity factors into the standard parameter set resulted in a significant enhancement in prediction accuracy on the ATP-289 dataset. The S-DCNN achieved the highest Matthews correlation coefficient value of 0.5031 and an accuracy rate of 97.06% on an independent test set. Furthermore, when applied to the ATP-221 and ATP-388 datasets for validation, the S-DCNN outperformed existing methods on ATP-221 and performed comparably to other methods on ATP-388 during independent testing.

Conclusion: Our experimental results underscore the efficacy of the S-DCNN in accurately predicting ATP binding residues, establishing it as a potent tool in the prediction of ATP binding residues.

S-DCNN:基于SMOTE的深度卷积神经网络预测ATP结合残基。
背景:许多蛋白质功能的实现需要与配体结合。ATP作为一种重要的蛋白质结合配体,在多种生物过程中起着至关重要的作用。目前,ATP结合残基的精确预测仍然具有挑战性。方法:基于序列信息,利用合成少数派过采样技术(SMOTE)增强的深度卷积神经网络(DCNN),提出了一种预测ATP结合残基的S-DCNN方法。结果:在标准参数集中加入额外的特征参数,如二面角、能量和倾向因素,显著提高了ATP-289数据集的预测精度。S-DCNN在独立测试集上的马修斯相关系数最高,为0.5031,准确率为97.06%。此外,当应用于ATP-221和ATP-388数据集进行验证时,S-DCNN优于现有的ATP-221方法,并且在独立测试中与其他方法在ATP-388上的表现相当。结论:我们的实验结果强调了S-DCNN在准确预测ATP结合残基方面的有效性,确立了它是预测ATP结合残基的有效工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Frontiers in Genetics
Frontiers in Genetics Biochemistry, Genetics and Molecular Biology-Molecular Medicine
CiteScore
5.50
自引率
8.10%
发文量
3491
审稿时长
14 weeks
期刊介绍: Frontiers in Genetics publishes rigorously peer-reviewed research on genes and genomes relating to all the domains of life, from humans to plants to livestock and other model organisms. Led by an outstanding Editorial Board of the world’s leading experts, this multidisciplinary, open-access journal is at the forefront of communicating cutting-edge research to researchers, academics, clinicians, policy makers and the public. The study of inheritance and the impact of the genome on various biological processes is well documented. However, the majority of discoveries are still to come. A new era is seeing major developments in the function and variability of the genome, the use of genetic and genomic tools and the analysis of the genetic basis of various biological phenomena.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信