{"title":"支持向量机与逻辑回归的混合β转弯预测方法","authors":"M. Elbashir, Jianxin Wang, Fang Wu","doi":"10.1109/BIBMW.2012.6470205","DOIUrl":null,"url":null,"abstract":"A β-turn is a secondary protein structure type that plays a significant role in protein folding, stability, and molecular recognition. It is the most common type of non-repetitive structures. On average 25% of amino acids in protein structures are located in β-turns. In this paper, we propose a hybrid approach of support vector machines (SVMs) with logistic regression (LR) for β-turn prediction. In this hybrid approach, the non β-turn class in a training set is under-sampled several times and combined with the β-turn class to create a number of balanced sets. Each balanced set is used for training one SVM at a time. The results of the SVMs are aggregated by using a logistic regression model. By adopting this hybrid approach, we cannot only avoid the difficulty of imbalanced data, but also have outputs with probability, and less ambiguous than combining SVM with other methods such as voting. Our simulation studies on BT426, and other datasets show that this hybrid approach achieves favorable performance in predicting β-turns as measured by the Matthew correlation coefficient (MCC) when compared with other competing methods.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A hybrid approach of support vector machines with logistic regression for β-turn prediction\",\"authors\":\"M. Elbashir, Jianxin Wang, Fang Wu\",\"doi\":\"10.1109/BIBMW.2012.6470205\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A β-turn is a secondary protein structure type that plays a significant role in protein folding, stability, and molecular recognition. It is the most common type of non-repetitive structures. On average 25% of amino acids in protein structures are located in β-turns. In this paper, we propose a hybrid approach of support vector machines (SVMs) with logistic regression (LR) for β-turn prediction. In this hybrid approach, the non β-turn class in a training set is under-sampled several times and combined with the β-turn class to create a number of balanced sets. Each balanced set is used for training one SVM at a time. The results of the SVMs are aggregated by using a logistic regression model. By adopting this hybrid approach, we cannot only avoid the difficulty of imbalanced data, but also have outputs with probability, and less ambiguous than combining SVM with other methods such as voting. Our simulation studies on BT426, and other datasets show that this hybrid approach achieves favorable performance in predicting β-turns as measured by the Matthew correlation coefficient (MCC) when compared with other competing methods.\",\"PeriodicalId\":6392,\"journal\":{\"name\":\"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-10-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBMW.2012.6470205\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBMW.2012.6470205","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A hybrid approach of support vector machines with logistic regression for β-turn prediction
A β-turn is a secondary protein structure type that plays a significant role in protein folding, stability, and molecular recognition. It is the most common type of non-repetitive structures. On average 25% of amino acids in protein structures are located in β-turns. In this paper, we propose a hybrid approach of support vector machines (SVMs) with logistic regression (LR) for β-turn prediction. In this hybrid approach, the non β-turn class in a training set is under-sampled several times and combined with the β-turn class to create a number of balanced sets. Each balanced set is used for training one SVM at a time. The results of the SVMs are aggregated by using a logistic regression model. By adopting this hybrid approach, we cannot only avoid the difficulty of imbalanced data, but also have outputs with probability, and less ambiguous than combining SVM with other methods such as voting. Our simulation studies on BT426, and other datasets show that this hybrid approach achieves favorable performance in predicting β-turns as measured by the Matthew correlation coefficient (MCC) when compared with other competing methods.