{"title":"Fer-COCL: A Novel Method Based on Multiple Deep Learning Algorithms for Identifying Fertility-Related Proteins","authors":"Shenmin Zhang, Xinjie Li, Hongyan Shi, Yuanyuan Jing, Yunyun Liang, Yusen Zhang","doi":"10.46793/match.90-3.537z","DOIUrl":null,"url":null,"abstract":"The survival of species depends on the fertility of organisms. It is also worthwhile to study the proteins that can regulate the reproductive activity of organisms. Since biological experiments are laborious to confirm proteins, it has become a priority that develop relevant computational models to predict the function of fertility-related proteins. With the development of machine learning, pertinent various algorithms can be the key to identifying fertility-related proteins. In this work, we develop a model Fer-COCL based on deep learning. The model consists of multiple features as well as multiple deep learning algorithms. First, we extract features using Amino acid composition (AAC), Dipeptide composition (DPC), CTD transition (CTDT) and deviation between the dipeptide and the expected mean (DDE). After that, the spliced features are fed into the classifier. The data processed jointly by convolutional neural network and long short-term memory is input to the fully connected layer for classification. After evaluating the model using 10-fold cross-validation, the accuracy of the two data sets reaches 97.1% and 98.3%, respectively. The results indicate that the model is efficient and accurate, facilitating biologists' research on biological fertility. In addition, a free online tool for predicting the function of fertility-related proteins is available at http://fercocl.zhanglab.site/.","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":"31 1","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Match-Communications in Mathematical and in Computer Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.46793/match.90-3.537z","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The survival of species depends on the fertility of organisms. It is also worthwhile to study the proteins that can regulate the reproductive activity of organisms. Since biological experiments are laborious to confirm proteins, it has become a priority that develop relevant computational models to predict the function of fertility-related proteins. With the development of machine learning, pertinent various algorithms can be the key to identifying fertility-related proteins. In this work, we develop a model Fer-COCL based on deep learning. The model consists of multiple features as well as multiple deep learning algorithms. First, we extract features using Amino acid composition (AAC), Dipeptide composition (DPC), CTD transition (CTDT) and deviation between the dipeptide and the expected mean (DDE). After that, the spliced features are fed into the classifier. The data processed jointly by convolutional neural network and long short-term memory is input to the fully connected layer for classification. After evaluating the model using 10-fold cross-validation, the accuracy of the two data sets reaches 97.1% and 98.3%, respectively. The results indicate that the model is efficient and accurate, facilitating biologists' research on biological fertility. In addition, a free online tool for predicting the function of fertility-related proteins is available at http://fercocl.zhanglab.site/.
期刊介绍:
MATCH Communications in Mathematical and in Computer Chemistry publishes papers of original research as well as reviews on chemically important mathematical results and non-routine applications of mathematical techniques to chemical problems. A paper acceptable for publication must contain non-trivial mathematics or communicate non-routine computer-based procedures AND have a clear connection to chemistry. Papers are published without any processing or publication charge.