CarSite: identifying carbonylated sites of human proteins based on a one-sided selection resampling method†

IF 3.743 Q2 Biochemistry, Genetics and Molecular Biology
Yun Zuo and Cang-Zhi Jia
{"title":"CarSite: identifying carbonylated sites of human proteins based on a one-sided selection resampling method†","authors":"Yun Zuo and Cang-Zhi Jia","doi":"10.1039/C7MB00363C","DOIUrl":null,"url":null,"abstract":"<p >Protein carbonylation is one of the most important biomarkers of oxidative protein damage and such protein damage is linked to various diseases and aging. It is thus vital that carbonylation sites are identified accurately. In this study, CarSite, a novel bioinformatics tool, was established to identify carbonylation sites in human proteins. The one-sided selection (OSS) resampling method was used to establish balanced training datasets and this resampling method is demonstrated to perform better than a Monte Carlo resampling method <em>via</em> 10-fold cross-validation tests on the Jia dataset. Moreover, the hybrid combination of position-specific amino acid propensity (PSAAP), composition of <em>k</em>-spaced amino acid pairs (CKSAAP), amino acid composition (AAC), and composition of hydrophobic and hydrophilic amino acids (CHHAA) was selected to optimize the performance of the predictor. On 10-fold cross-validation of the Jia dataset, CarSite obtained rates of sensitivity corresponding to K/P/R/T-type peptides of ~21%, 22%, 19%, or 18% higher than those obtained by iCar-PseCp, respectively, which was previously considered as the best predictor for identifying carbonylation sites in human proteins. Furthermore, compared with other existing predictors, CarSite obtained much higher sensitivity and accuracy when tested on the same dataset.</p>","PeriodicalId":90,"journal":{"name":"Molecular BioSystems","volume":" 11","pages":" 2362-2369"},"PeriodicalIF":3.7430,"publicationDate":"2017-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1039/C7MB00363C","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular BioSystems","FirstCategoryId":"1085","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2017/mb/c7mb00363c","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 10

Abstract

Protein carbonylation is one of the most important biomarkers of oxidative protein damage and such protein damage is linked to various diseases and aging. It is thus vital that carbonylation sites are identified accurately. In this study, CarSite, a novel bioinformatics tool, was established to identify carbonylation sites in human proteins. The one-sided selection (OSS) resampling method was used to establish balanced training datasets and this resampling method is demonstrated to perform better than a Monte Carlo resampling method via 10-fold cross-validation tests on the Jia dataset. Moreover, the hybrid combination of position-specific amino acid propensity (PSAAP), composition of k-spaced amino acid pairs (CKSAAP), amino acid composition (AAC), and composition of hydrophobic and hydrophilic amino acids (CHHAA) was selected to optimize the performance of the predictor. On 10-fold cross-validation of the Jia dataset, CarSite obtained rates of sensitivity corresponding to K/P/R/T-type peptides of ~21%, 22%, 19%, or 18% higher than those obtained by iCar-PseCp, respectively, which was previously considered as the best predictor for identifying carbonylation sites in human proteins. Furthermore, compared with other existing predictors, CarSite obtained much higher sensitivity and accuracy when tested on the same dataset.

Abstract Image

CarSite:基于单侧选择重采样方法鉴定人类蛋白质的羰基化位点
蛋白质羰基化是氧化蛋白损伤最重要的生物标志物之一,这种蛋白质损伤与各种疾病和衰老有关。因此,准确地识别羰基化位点是至关重要的。在这项研究中,建立了一种新的生物信息学工具CarSite,用于识别人类蛋白质中的羰基化位点。采用单侧选择(OSS)重采样方法建立平衡训练数据集,并在Jia数据集上进行10次交叉验证,证明该重采样方法优于蒙特卡罗重采样方法。此外,选择位置特异性氨基酸倾向(PSAAP)、k间距氨基酸对组成(CKSAAP)、氨基酸组成(AAC)和亲疏水氨基酸组成(CHHAA)的杂交组合来优化预测器的性能。在Jia数据集的10倍交叉验证中,CarSite获得的K/P/R/ t型肽对应的敏感性分别比iCar-PseCp高21%,22%,19%或18%,这在以前被认为是识别人类蛋白质羰基化位点的最佳预测因子。此外,与其他现有的预测器相比,CarSite在同一数据集上测试时获得了更高的灵敏度和准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Molecular BioSystems
Molecular BioSystems 生物-生化与分子生物学
CiteScore
2.94
自引率
0.00%
发文量
0
审稿时长
2.6 months
期刊介绍: Molecular Omics publishes molecular level experimental and bioinformatics research in the -omics sciences, including genomics, proteomics, transcriptomics and metabolomics. We will also welcome multidisciplinary papers presenting studies combining different types of omics, or the interface of omics and other fields such as systems biology or chemical biology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信