{"title":"DeepAPArice: a deep learning model for poly(A) site intelligent prediction in rice using convolutional neural network","authors":"Haiyong He, Guoli Ji","doi":"10.1109/ICPECA53709.2022.9718901","DOIUrl":null,"url":null,"abstract":"Polyadenylation [poly(A)] is an extremely important step in the post-transcriptional process of premRNA, which plays a key role in gene regulation, protein binding and translation efficiency. Accurate identification of poly(A) sites can help understand gene expression regulation and improve the accuracy of gene annotation. Although there have been many machine learning methods used for poly(A) site prediction, most of them only take into account a limited set of features. With the increase of the amount of genomic data and the development of computing technology, deep learning technology has been used for solving many machine learning tasks. Here, we propose DeepAPArice, a deep learning model that predicts poly(A) sites in Oryza Sativa based on sequence features and RNA secondary structures around poly(A) sites. We compared DeepAPArice with both traditional machine learning algorithms and advanced deep learning algorithms, and the results show that DeepAPArice is superior to them in terms of several indicators.","PeriodicalId":244448,"journal":{"name":"2022 IEEE 2nd International Conference on Power, Electronics and Computer Applications (ICPECA)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 2nd International Conference on Power, Electronics and Computer Applications (ICPECA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPECA53709.2022.9718901","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Polyadenylation [poly(A)] is an extremely important step in the post-transcriptional process of premRNA, which plays a key role in gene regulation, protein binding and translation efficiency. Accurate identification of poly(A) sites can help understand gene expression regulation and improve the accuracy of gene annotation. Although there have been many machine learning methods used for poly(A) site prediction, most of them only take into account a limited set of features. With the increase of the amount of genomic data and the development of computing technology, deep learning technology has been used for solving many machine learning tasks. Here, we propose DeepAPArice, a deep learning model that predicts poly(A) sites in Oryza Sativa based on sequence features and RNA secondary structures around poly(A) sites. We compared DeepAPArice with both traditional machine learning algorithms and advanced deep learning algorithms, and the results show that DeepAPArice is superior to them in terms of several indicators.