Zhe Liu, Weihao Pan, Xu Zhen, Ji Liang, Wenxiang Cai, Kai Yuan, G. Lin
{"title":"Will AlphaFold2 Be Helpful in Improving the Accuracy of Single-sequence PPI Site Prediction?","authors":"Zhe Liu, Weihao Pan, Xu Zhen, Ji Liang, Wenxiang Cai, Kai Yuan, G. Lin","doi":"10.1109/icbcb55259.2022.9802490","DOIUrl":null,"url":null,"abstract":"AlphaFold2 has achieved relatively high structure prediction accuracy on proteins. However, it is reported that directly feeding coordinates into deep learning models cannot achieve ideal results on downstream tasks. Therefore, how to process the predicted results into an effective form that deep learning networks can understand to improve the performance of downstream tasks is worth exploring. In this study, taking single-sequence PPI site prediction as an example, we verified the effects of three processing strategies of coordinates, namely spatial Altering, SVD20, and the rASA feature calculation. The experiment results showed that spatial filtering and the rASA feature were two effective and suitable ways to encode structural information for deep learning models. Besides, we also performed a case study of a mutated protein. The results proved that spatial filtering might potentially introduce structural changes into HHblits profiles and deep learning networks when protein mutations occur. This work provides new insight into the downstream tasks, such as predicting the binding sites of proteins or predicting the effects of mutations.","PeriodicalId":429633,"journal":{"name":"2022 10th International Conference on Bioinformatics and Computational Biology (ICBCB)","volume":" 36","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 10th International Conference on Bioinformatics and Computational Biology (ICBCB)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icbcb55259.2022.9802490","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
AlphaFold2 has achieved relatively high structure prediction accuracy on proteins. However, it is reported that directly feeding coordinates into deep learning models cannot achieve ideal results on downstream tasks. Therefore, how to process the predicted results into an effective form that deep learning networks can understand to improve the performance of downstream tasks is worth exploring. In this study, taking single-sequence PPI site prediction as an example, we verified the effects of three processing strategies of coordinates, namely spatial Altering, SVD20, and the rASA feature calculation. The experiment results showed that spatial filtering and the rASA feature were two effective and suitable ways to encode structural information for deep learning models. Besides, we also performed a case study of a mutated protein. The results proved that spatial filtering might potentially introduce structural changes into HHblits profiles and deep learning networks when protein mutations occur. This work provides new insight into the downstream tasks, such as predicting the binding sites of proteins or predicting the effects of mutations.