Comparison Study of Peptide Retention Time Prediction Model Based on Five Kinds of Amino Acid Descriptors in HPLC by Support Vector Machine

Jiajian Yin
{"title":"Comparison Study of Peptide Retention Time Prediction Model Based on Five Kinds of Amino Acid Descriptors in HPLC by Support Vector Machine","authors":"Jiajian Yin","doi":"10.1109/ICBBE.2010.5516374","DOIUrl":null,"url":null,"abstract":"Based on amino acid descriptors(z-scales, c-scales, ISA-ECI,MS-WHIM and PRIN) and additive method, evaluation of predict performance of five amino acid descriptors in peptide QSRR(Quantitative structure-retention relationships) with 101 promiscuous peptides in High-Performance Liquid Chromato- graphy by support vector regression(SVR) is made in the article, and RBF(radical basis function) is selected as kernel function. Using leave-one-out cross-validation (LOO-CV), we suppose that predicting accuracy of ISA-ECI is better than the other descriptors in SVR with RBF. The prediction correlation coefficient of the SVR model (ε = 0.001,σ= 5 and C= 100) is 0.8445 by leave-one-out cross validation. The standard error of prediction (SEP) error of the dataset is 1.03 by fitting calculation, and the prediction correlation coefficient is 0.9642.The prediction results are in agreement with the experimental values. This paper provided a simple and effective method for predicting the retention behavior of peptide and some insight into what structural features are related to the retention time of peptides. Moreover, it also offered an idea about nonlinear relation between retention time of peptides and their structural descriptors (ISA-ECI).Therefore, SVR is assumed to be a feasible method in peptide QSAR (Quantitative structure-activity relationships) model.","PeriodicalId":6396,"journal":{"name":"2010 4th International Conference on Bioinformatics and Biomedical Engineering","volume":"87 1","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2010-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 4th International Conference on Bioinformatics and Biomedical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBBE.2010.5516374","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Based on amino acid descriptors(z-scales, c-scales, ISA-ECI,MS-WHIM and PRIN) and additive method, evaluation of predict performance of five amino acid descriptors in peptide QSRR(Quantitative structure-retention relationships) with 101 promiscuous peptides in High-Performance Liquid Chromato- graphy by support vector regression(SVR) is made in the article, and RBF(radical basis function) is selected as kernel function. Using leave-one-out cross-validation (LOO-CV), we suppose that predicting accuracy of ISA-ECI is better than the other descriptors in SVR with RBF. The prediction correlation coefficient of the SVR model (ε = 0.001,σ= 5 and C= 100) is 0.8445 by leave-one-out cross validation. The standard error of prediction (SEP) error of the dataset is 1.03 by fitting calculation, and the prediction correlation coefficient is 0.9642.The prediction results are in agreement with the experimental values. This paper provided a simple and effective method for predicting the retention behavior of peptide and some insight into what structural features are related to the retention time of peptides. Moreover, it also offered an idea about nonlinear relation between retention time of peptides and their structural descriptors (ISA-ECI).Therefore, SVR is assumed to be a feasible method in peptide QSAR (Quantitative structure-activity relationships) model.
基于五种氨基酸描述符的高效液相色谱多肽保留时间预测模型的支持向量机比较研究
基于氨基酸描述符(z-scale、c-scale、ISA-ECI、MS-WHIM和PRIN)和加性法,采用支持向量回归(SVR)评价了5个氨基酸描述符在高效液相色谱中对101个混杂肽的QSRR(定量结构-保留关系)预测性能,并选择RBF(radical basis function)作为核函数。利用留一交叉验证(LOO-CV),我们假设在RBF的SVR中,ISA-ECI的预测精度优于其他描述符。SVR模型的预测相关系数(ε= 0.001,σ= 5, C= 100)通过留一交叉验证为0.8445。经拟合计算,该数据集的预测标准差(SEP)误差为1.03,预测相关系数为0.9642。预测结果与实验值吻合较好。本文提供了一种简单有效的预测多肽保留行为的方法,并对多肽的结构特征与保留时间的关系有了一些认识。此外,还提出了多肽保留时间与其结构描述符(ISA-ECI)之间的非线性关系。因此,假设SVR是一种可行的多肽QSAR(定量构效关系)模型方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信