{"title":"Analyzing the Impact of Protein Representation on Mining Structural Patterns from Protein Data","authors":"S. Albert, G. Czibula, Mihai Teletin","doi":"10.1109/SACI.2018.8440984","DOIUrl":null,"url":null,"abstract":"Proteins have essential roles in the biological processes of living organisms by contributing to maintain cellular environments. Understanding the conformational transitions of proteins may help identifying situations when incorrect folding or mutations can occur and thus, it may contribute to inhibit possible uncontrolled behaviour. In this paper we are performing a study on how different protein representations impact the process of mining relevant patterns from protein related data. Two representations are used for the proteins, one using the structural alphabet and the second using the relative solvent accessibility values of the amino acids from the proteins' primary structure. Using these representations, two case studies are performed to emphasize the effectiveness of using the proposed protein representations to unsupervisedly learn structural patterns from on a protein data set.","PeriodicalId":126087,"journal":{"name":"2018 IEEE 12th International Symposium on Applied Computational Intelligence and Informatics (SACI)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 12th International Symposium on Applied Computational Intelligence and Informatics (SACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SACI.2018.8440984","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Proteins have essential roles in the biological processes of living organisms by contributing to maintain cellular environments. Understanding the conformational transitions of proteins may help identifying situations when incorrect folding or mutations can occur and thus, it may contribute to inhibit possible uncontrolled behaviour. In this paper we are performing a study on how different protein representations impact the process of mining relevant patterns from protein related data. Two representations are used for the proteins, one using the structural alphabet and the second using the relative solvent accessibility values of the amino acids from the proteins' primary structure. Using these representations, two case studies are performed to emphasize the effectiveness of using the proposed protein representations to unsupervisedly learn structural patterns from on a protein data set.