{"title":"基于多尺度卷积神经网络的蛋白质二级结构预测","authors":"Yu Xiao, Xiaozhou Chen","doi":"10.54097/ijbls.v2i3.8642","DOIUrl":null,"url":null,"abstract":"In the field of bioinformatics, the prediction of secondary structure of proteins is very important. It can be obtained from the prediction of primary structure (amino acid sequence) and can provide reference for the prediction of tertiary structure of proteins. Amino acid sequences of proteins are encoded with several features and then combined into the prediction network. Convolutional neural network has excellent performance in text and sequence information extraction. The amino acid sequence of protein is also a special sequence, so the convolutional neural network can be used to extract the information in the sequence. Moreover, the influence of amino acids on the formation of secondary structure varies with different distances, so in the experiment, convolutional neural networks with convolution nuclei of different sizes were used to form multi-scale convolution blocks to extract amino acid sequence information. At the same time, the sliding window technique is also used to show the interaction between the sequences, and a long amino acid sequence is divided into some amino acid fragments and input into the model. Finally, the accuracy of Q8 on the dataset CB6133_filtered reaches 71%.","PeriodicalId":182292,"journal":{"name":"International Journal of Biology and Life Sciences","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of Protein Secondary Structure based on Multi-scale Convolutional Neural Network\",\"authors\":\"Yu Xiao, Xiaozhou Chen\",\"doi\":\"10.54097/ijbls.v2i3.8642\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the field of bioinformatics, the prediction of secondary structure of proteins is very important. It can be obtained from the prediction of primary structure (amino acid sequence) and can provide reference for the prediction of tertiary structure of proteins. Amino acid sequences of proteins are encoded with several features and then combined into the prediction network. Convolutional neural network has excellent performance in text and sequence information extraction. The amino acid sequence of protein is also a special sequence, so the convolutional neural network can be used to extract the information in the sequence. Moreover, the influence of amino acids on the formation of secondary structure varies with different distances, so in the experiment, convolutional neural networks with convolution nuclei of different sizes were used to form multi-scale convolution blocks to extract amino acid sequence information. At the same time, the sliding window technique is also used to show the interaction between the sequences, and a long amino acid sequence is divided into some amino acid fragments and input into the model. Finally, the accuracy of Q8 on the dataset CB6133_filtered reaches 71%.\",\"PeriodicalId\":182292,\"journal\":{\"name\":\"International Journal of Biology and Life Sciences\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-05-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Biology and Life Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.54097/ijbls.v2i3.8642\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Biology and Life Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54097/ijbls.v2i3.8642","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Prediction of Protein Secondary Structure based on Multi-scale Convolutional Neural Network
In the field of bioinformatics, the prediction of secondary structure of proteins is very important. It can be obtained from the prediction of primary structure (amino acid sequence) and can provide reference for the prediction of tertiary structure of proteins. Amino acid sequences of proteins are encoded with several features and then combined into the prediction network. Convolutional neural network has excellent performance in text and sequence information extraction. The amino acid sequence of protein is also a special sequence, so the convolutional neural network can be used to extract the information in the sequence. Moreover, the influence of amino acids on the formation of secondary structure varies with different distances, so in the experiment, convolutional neural networks with convolution nuclei of different sizes were used to form multi-scale convolution blocks to extract amino acid sequence information. At the same time, the sliding window technique is also used to show the interaction between the sequences, and a long amino acid sequence is divided into some amino acid fragments and input into the model. Finally, the accuracy of Q8 on the dataset CB6133_filtered reaches 71%.