Analysis and prediction of protein stability based on interaction network, gene ontology, and KEGG pathway enrichment scores

IF 2.5 4区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Feiming Huang , Minfei Fu , JiaRui Li , Lei Chen , KaiYan Feng , Tao Huang , Yu-Dong Cai
{"title":"Analysis and prediction of protein stability based on interaction network, gene ontology, and KEGG pathway enrichment scores","authors":"Feiming Huang ,&nbsp;Minfei Fu ,&nbsp;JiaRui Li ,&nbsp;Lei Chen ,&nbsp;KaiYan Feng ,&nbsp;Tao Huang ,&nbsp;Yu-Dong Cai","doi":"10.1016/j.bbapap.2023.140889","DOIUrl":null,"url":null,"abstract":"<div><p>Metabolic stability of proteins plays a vital role in various dedicated cellular processes. Traditional methods of measuring the metabolic stability are time-consuming and expensive. Therefore, we developed a more efficient computational approach to understand the protein dynamic action mechanisms in biological process<span> networks. In this study, we collected 341 short-lived proteins and 824 non-short-lived proteins from U2OS; 342 short-lived proteins and 821 non-short-lived proteins from HEK293T; 424 short-lived proteins and 1153 non-short-lived proteins from HCT116; and 384 short-lived proteins and 992 non-short-lived proteins from RPE1. The proteins were encoded by GO<span> and KEGG enrichment scores based on the genes and their neighbors in STRING, resulting in 20,681 GO term features and 297 KEGG pathway features. We also incorporated the protein interaction information from STRING into the features and obtained 19,247 node features. Boruta and mRMR methods were used for feature filtering, and IFS method was used to obtain the best feature subsets and create the models with the highest performance. The present study identified 42 features that did not appear in previous studies and classified them into eight groups according to their functional annotation. By reviewing the literature, we found that the following three functional groups were critical in determining the stability of proteins: synaptic transmission, post-translational modifications, and cell fate determination. These findings may serve as a valuable reference for developing drugs that target protein stability.</span></span></p></div>","PeriodicalId":8760,"journal":{"name":"Biochimica et biophysica acta. Proteins and proteomics","volume":"1871 3","pages":"Article 140889"},"PeriodicalIF":2.5000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biochimica et biophysica acta. Proteins and proteomics","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S157096392300002X","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 16

Abstract

Metabolic stability of proteins plays a vital role in various dedicated cellular processes. Traditional methods of measuring the metabolic stability are time-consuming and expensive. Therefore, we developed a more efficient computational approach to understand the protein dynamic action mechanisms in biological process networks. In this study, we collected 341 short-lived proteins and 824 non-short-lived proteins from U2OS; 342 short-lived proteins and 821 non-short-lived proteins from HEK293T; 424 short-lived proteins and 1153 non-short-lived proteins from HCT116; and 384 short-lived proteins and 992 non-short-lived proteins from RPE1. The proteins were encoded by GO and KEGG enrichment scores based on the genes and their neighbors in STRING, resulting in 20,681 GO term features and 297 KEGG pathway features. We also incorporated the protein interaction information from STRING into the features and obtained 19,247 node features. Boruta and mRMR methods were used for feature filtering, and IFS method was used to obtain the best feature subsets and create the models with the highest performance. The present study identified 42 features that did not appear in previous studies and classified them into eight groups according to their functional annotation. By reviewing the literature, we found that the following three functional groups were critical in determining the stability of proteins: synaptic transmission, post-translational modifications, and cell fate determination. These findings may serve as a valuable reference for developing drugs that target protein stability.

基于互作网络、基因本体和KEGG通路富集评分的蛋白质稳定性分析与预测
蛋白质的代谢稳定性在各种专门的细胞过程中起着至关重要的作用。测量代谢稳定性的传统方法耗时且昂贵。因此,我们开发了一种更有效的计算方法来理解生物过程网络中的蛋白质动态作用机制。在本研究中,我们从U2OS中收集了341个短命蛋白和824个非短命蛋白;来自HEK293T的342个短命蛋白和821个非短命蛋白;来自HCT116的424个短命蛋白和1153个非短命蛋白;以及来自RPE1的384个短命蛋白和992个非短命蛋白。基于STRING中的基因及其邻居,通过GO和KEGG富集评分对蛋白质进行编码,产生20681个GO术语特征和297个KEGG途径特征。我们还将来自STRING的蛋白质相互作用信息纳入特征中,并获得19247个节点特征。Boruta和mRMR方法用于特征滤波,IFS方法用于获得最佳特征子集并创建具有最高性能的模型。本研究确定了42个以前研究中没有出现的特征,并根据其功能注释将其分为八组。通过回顾文献,我们发现以下三个官能团在决定蛋白质的稳定性方面至关重要:突触传递、翻译后修饰和细胞命运决定。这些发现可能为开发靶向蛋白质稳定性的药物提供有价值的参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
8.00
自引率
0.00%
发文量
55
审稿时长
33 days
期刊介绍: BBA Proteins and Proteomics covers protein structure conformation and dynamics; protein folding; protein-ligand interactions; enzyme mechanisms, models and kinetics; protein physical properties and spectroscopy; and proteomics and bioinformatics analyses of protein structure, protein function, or protein regulation.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信