{"title":"Prediction of Lysine Succinylation Sites by SVR and Weighted Down-sampling","authors":"Kai Wang, P. Liang, Junda Hu","doi":"10.1145/3366715.3366735","DOIUrl":null,"url":null,"abstract":"Succinylation is a post-translational modification (PTM), which changes the chemical structure of lysine and results in significant changes in the structure and function of proteins. Lysine succinylation plays an important role in coordinating various biological processes, and it isalso associated with some diseases. Accurately identifying the lysine succinylation sites in proteins is of significant importance for basic research and drug development. Lysine succinylation sites prediction is a typical imbalanced and fragmentary learning problem. Directly applyingthe traditional machine learning approach for this task is not suitable. To circumvent this problem, based on extracting the features of protein sequences by sliding window and mirror-effect, weighted under-sampling is developed to make samples complete and balanced. Finally based on SVR prediction model and the corresponding suitable threshold, comparing with several state-of-art related methods, the effectiveness of the proposed method was validated by the experimental results.","PeriodicalId":425980,"journal":{"name":"Proceedings of the 2019 International Conference on Robotics Systems and Vehicle Technology - RSVT '19","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 International Conference on Robotics Systems and Vehicle Technology - RSVT '19","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3366715.3366735","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Succinylation is a post-translational modification (PTM), which changes the chemical structure of lysine and results in significant changes in the structure and function of proteins. Lysine succinylation plays an important role in coordinating various biological processes, and it isalso associated with some diseases. Accurately identifying the lysine succinylation sites in proteins is of significant importance for basic research and drug development. Lysine succinylation sites prediction is a typical imbalanced and fragmentary learning problem. Directly applyingthe traditional machine learning approach for this task is not suitable. To circumvent this problem, based on extracting the features of protein sequences by sliding window and mirror-effect, weighted under-sampling is developed to make samples complete and balanced. Finally based on SVR prediction model and the corresponding suitable threshold, comparing with several state-of-art related methods, the effectiveness of the proposed method was validated by the experimental results.