Xiuquan Du, Shiwei Sun, Changlin Hu, Xinrui Li, Junfeng Xia
{"title":"基于集成学习和加权特征描述子的蛋白质相互作用位点预测。","authors":"Xiuquan Du, Shiwei Sun, Changlin Hu, Xinrui Li, Junfeng Xia","doi":"10.1186/s40709-016-0046-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Reliable prediction of protein-protein interaction sites is an important goal in the field of bioinformatics. Many computational methods have been explored for the large-scale prediction of protein-protein interaction sites based on various data types, including protein sequence, structural and genomic data. Although much progress has been achieved in recent years, the problem has not yet been satisfactorily solved.</p><p><strong>Results: </strong>In this work, we presented an efficient approach that uses ensemble learning algorithm with weighted feature descriptor (EL-WFD) to predict protein-protein interaction sites. Moreover, weighted feature descriptor was designed to describe the distance influence of neighboring residues on interaction sites. The results on two dataset (Hetero and Homo), show that the proposed method yields a satisfactory accuracy with 83.8 % recall and 96.3 % precision on the Hetero dataset and 84.2 % recall and 96.3 % precision on the Homo dataset, respectively. In both datasets, our method tend to obtain high Mathews correlation coefficient compared with state-of-the-art technique random forest method.</p><p><strong>Conclusions: </strong>The experimental results show that the EL-WFD method is quite effective in predicting protein-protein interaction sites. The novel weighted feature descriptor was proved to be promising in discovering interaction sites. Overall, the proposed method can be considered as a new powerful tool for predicting protein-protein interaction sites with excellence performance.</p>","PeriodicalId":50251,"journal":{"name":"Journal of Biological Research-Thessaloniki","volume":null,"pages":null},"PeriodicalIF":1.9000,"publicationDate":"2016-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s40709-016-0046-7","citationCount":"7","resultStr":"{\"title\":\"Prediction of protein-protein interaction sites by means of ensemble learning and weighted feature descriptor.\",\"authors\":\"Xiuquan Du, Shiwei Sun, Changlin Hu, Xinrui Li, Junfeng Xia\",\"doi\":\"10.1186/s40709-016-0046-7\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Reliable prediction of protein-protein interaction sites is an important goal in the field of bioinformatics. Many computational methods have been explored for the large-scale prediction of protein-protein interaction sites based on various data types, including protein sequence, structural and genomic data. Although much progress has been achieved in recent years, the problem has not yet been satisfactorily solved.</p><p><strong>Results: </strong>In this work, we presented an efficient approach that uses ensemble learning algorithm with weighted feature descriptor (EL-WFD) to predict protein-protein interaction sites. Moreover, weighted feature descriptor was designed to describe the distance influence of neighboring residues on interaction sites. The results on two dataset (Hetero and Homo), show that the proposed method yields a satisfactory accuracy with 83.8 % recall and 96.3 % precision on the Hetero dataset and 84.2 % recall and 96.3 % precision on the Homo dataset, respectively. In both datasets, our method tend to obtain high Mathews correlation coefficient compared with state-of-the-art technique random forest method.</p><p><strong>Conclusions: </strong>The experimental results show that the EL-WFD method is quite effective in predicting protein-protein interaction sites. The novel weighted feature descriptor was proved to be promising in discovering interaction sites. Overall, the proposed method can be considered as a new powerful tool for predicting protein-protein interaction sites with excellence performance.</p>\",\"PeriodicalId\":50251,\"journal\":{\"name\":\"Journal of Biological Research-Thessaloniki\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2016-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1186/s40709-016-0046-7\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Biological Research-Thessaloniki\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s40709-016-0046-7\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2016/5/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biological Research-Thessaloniki","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s40709-016-0046-7","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2016/5/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
Prediction of protein-protein interaction sites by means of ensemble learning and weighted feature descriptor.
Background: Reliable prediction of protein-protein interaction sites is an important goal in the field of bioinformatics. Many computational methods have been explored for the large-scale prediction of protein-protein interaction sites based on various data types, including protein sequence, structural and genomic data. Although much progress has been achieved in recent years, the problem has not yet been satisfactorily solved.
Results: In this work, we presented an efficient approach that uses ensemble learning algorithm with weighted feature descriptor (EL-WFD) to predict protein-protein interaction sites. Moreover, weighted feature descriptor was designed to describe the distance influence of neighboring residues on interaction sites. The results on two dataset (Hetero and Homo), show that the proposed method yields a satisfactory accuracy with 83.8 % recall and 96.3 % precision on the Hetero dataset and 84.2 % recall and 96.3 % precision on the Homo dataset, respectively. In both datasets, our method tend to obtain high Mathews correlation coefficient compared with state-of-the-art technique random forest method.
Conclusions: The experimental results show that the EL-WFD method is quite effective in predicting protein-protein interaction sites. The novel weighted feature descriptor was proved to be promising in discovering interaction sites. Overall, the proposed method can be considered as a new powerful tool for predicting protein-protein interaction sites with excellence performance.
期刊介绍:
Journal of Biological Research-Thessaloniki is a peer-reviewed, open access, international journal that publishes articles providing novel insights into the major fields of biology.
Topics covered in Journal of Biological Research-Thessaloniki include, but are not limited to: molecular biology, cytology, genetics, evolutionary biology, morphology, development and differentiation, taxonomy, bioinformatics, physiology, marine biology, behaviour, ecology and conservation.