{"title":"基于转导支持向量机的跨膜蛋白螺旋残基接触预测","authors":"Bander Almalki, Aman Sawhney, Li Liao","doi":"10.29007/3ztg","DOIUrl":null,"url":null,"abstract":"Protein functions are strongly related to their 3D structure. Therefore, it is crucial to identify their structure to understand how they behave. Studies have shown that numerous numbers of proteins cross a biological membrane, called Transmembrane (TM) proteins, and many of them adopt alpha helices shape. Unlike the current contact prediction methods that use inductive learning to predict transmembrane protein inter-helical residues contact, we adopt a transductive learning approach. The idea of transductive learning can be very useful when the test set is much bigger than the training set, which is usually the case in amino acids residues contacts prediction. We test this approach on a set of transmembrane protein sequences to identify helix-helix residues contacts, compare transductive and inductive approaches, and identify conditions and limitations where TSVM outperforms inductive SVM. In addition, we investigate the performance degradation of the traditional TSVM and explore the proposed solutions in the literature. Moreover, we propose an early stop technique that can outperform the state of art TSVM and produce a more accurate prediction.","PeriodicalId":93549,"journal":{"name":"EPiC series in computing","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Transmembrane Protein Inter-Helical Residue Contacts Prediction Using Transductive Support Vector Machines\",\"authors\":\"Bander Almalki, Aman Sawhney, Li Liao\",\"doi\":\"10.29007/3ztg\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Protein functions are strongly related to their 3D structure. Therefore, it is crucial to identify their structure to understand how they behave. Studies have shown that numerous numbers of proteins cross a biological membrane, called Transmembrane (TM) proteins, and many of them adopt alpha helices shape. Unlike the current contact prediction methods that use inductive learning to predict transmembrane protein inter-helical residues contact, we adopt a transductive learning approach. The idea of transductive learning can be very useful when the test set is much bigger than the training set, which is usually the case in amino acids residues contacts prediction. We test this approach on a set of transmembrane protein sequences to identify helix-helix residues contacts, compare transductive and inductive approaches, and identify conditions and limitations where TSVM outperforms inductive SVM. In addition, we investigate the performance degradation of the traditional TSVM and explore the proposed solutions in the literature. Moreover, we propose an early stop technique that can outperform the state of art TSVM and produce a more accurate prediction.\",\"PeriodicalId\":93549,\"journal\":{\"name\":\"EPiC series in computing\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1900-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EPiC series in computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.29007/3ztg\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EPiC series in computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.29007/3ztg","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Transmembrane Protein Inter-Helical Residue Contacts Prediction Using Transductive Support Vector Machines
Protein functions are strongly related to their 3D structure. Therefore, it is crucial to identify their structure to understand how they behave. Studies have shown that numerous numbers of proteins cross a biological membrane, called Transmembrane (TM) proteins, and many of them adopt alpha helices shape. Unlike the current contact prediction methods that use inductive learning to predict transmembrane protein inter-helical residues contact, we adopt a transductive learning approach. The idea of transductive learning can be very useful when the test set is much bigger than the training set, which is usually the case in amino acids residues contacts prediction. We test this approach on a set of transmembrane protein sequences to identify helix-helix residues contacts, compare transductive and inductive approaches, and identify conditions and limitations where TSVM outperforms inductive SVM. In addition, we investigate the performance degradation of the traditional TSVM and explore the proposed solutions in the literature. Moreover, we propose an early stop technique that can outperform the state of art TSVM and produce a more accurate prediction.