PSI-MOUSE: Predicting Mouse Pseudouridine Sites From Sequence and Genome-Derived Features.

IF 1.7 4区 生物学 Q4 EVOLUTIONARY BIOLOGY
Evolutionary Bioinformatics Pub Date : 2020-06-09 eCollection Date: 2020-01-01 DOI:10.1177/1176934320925752
Bowen Song, Kunqi Chen, Yujiao Tang, Jialin Ma, Jia Meng, Zhen Wei
{"title":"PSI-MOUSE: Predicting Mouse Pseudouridine Sites From Sequence and Genome-Derived Features.","authors":"Bowen Song, Kunqi Chen, Yujiao Tang, Jialin Ma, Jia Meng, Zhen Wei","doi":"10.1177/1176934320925752","DOIUrl":null,"url":null,"abstract":"<p><p>Pseudouridine (Ψ) is the first discovered and the most prevalent posttranscriptional modification, which has been widely studied during the past decades. Pseudouridine was observed in almost all kinds of RNAs and shown to have important biological functions. Currently, the time-consuming and high-cost procedures of experimental approaches limit its uses in real-life Ψ site detection. Alternatively, by taking advantage of the explosive growth of Ψ sequencing data, the computational methods may provide a more cost-effective avenue. To date, the existing mouse Ψ site predictors were all developed based on sequence-derived features, and their performance can be further improved by adding the domain knowledge derived feature. Therefore, it is highly desirable to propose a genomic feature-based computational method to increase the accuracy and efficiency of the identification of Ψ RNA modification in the mouse transcriptome. In our study, a predictive framework PSI-MOUSE was built. Besides the conventional sequence-based features, PSI-MOUSE first introduced 38 additional genomic features derived from the mouse genome, which achieved a satisfactory improvement in the prediction performance, compared with other existing models. Moreover, PSI-MOUSE also features in automatically annotating the putative Ψ sites with diverse types of posttranscriptional regulations (RNA-binding protein [RBP]-binding regions, miRNA-RNA interactions, and splicing sites), which can serve as a useful research tool for the study of Ψ RNA modification in the mouse genome. Finally, 3282 experimentally validated mouse Ψ sites were also collected in a database with customized query functions. For the convenience of academic users, a website was built to provide a user-friendly interface for the query and analysis on the database. The website is freely accessible at www.xjtlu.edu.cn/biologicalsciences/psimouse and http://psimouse.rnamd.com. We introduced the genome-derived features to mouse for the first time, and we achieved a good performance in mouse Ψ site prediction. Compared with the existing state-of-art methods, our newly developed approach PSI-MOUSE obtained a substantial improvement in prediction accuracy, marking the reliable contributions of genomic features for the prediction of RNA modifications in a species other than human.</p>","PeriodicalId":50472,"journal":{"name":"Evolutionary Bioinformatics","volume":"16 ","pages":"1176934320925752"},"PeriodicalIF":1.7000,"publicationDate":"2020-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/f9/df/10.1177_1176934320925752.PMC7285933.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Evolutionary Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1177/1176934320925752","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/1/1 0:00:00","PubModel":"eCollection","JCR":"Q4","JCRName":"EVOLUTIONARY BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Pseudouridine (Ψ) is the first discovered and the most prevalent posttranscriptional modification, which has been widely studied during the past decades. Pseudouridine was observed in almost all kinds of RNAs and shown to have important biological functions. Currently, the time-consuming and high-cost procedures of experimental approaches limit its uses in real-life Ψ site detection. Alternatively, by taking advantage of the explosive growth of Ψ sequencing data, the computational methods may provide a more cost-effective avenue. To date, the existing mouse Ψ site predictors were all developed based on sequence-derived features, and their performance can be further improved by adding the domain knowledge derived feature. Therefore, it is highly desirable to propose a genomic feature-based computational method to increase the accuracy and efficiency of the identification of Ψ RNA modification in the mouse transcriptome. In our study, a predictive framework PSI-MOUSE was built. Besides the conventional sequence-based features, PSI-MOUSE first introduced 38 additional genomic features derived from the mouse genome, which achieved a satisfactory improvement in the prediction performance, compared with other existing models. Moreover, PSI-MOUSE also features in automatically annotating the putative Ψ sites with diverse types of posttranscriptional regulations (RNA-binding protein [RBP]-binding regions, miRNA-RNA interactions, and splicing sites), which can serve as a useful research tool for the study of Ψ RNA modification in the mouse genome. Finally, 3282 experimentally validated mouse Ψ sites were also collected in a database with customized query functions. For the convenience of academic users, a website was built to provide a user-friendly interface for the query and analysis on the database. The website is freely accessible at www.xjtlu.edu.cn/biologicalsciences/psimouse and http://psimouse.rnamd.com. We introduced the genome-derived features to mouse for the first time, and we achieved a good performance in mouse Ψ site prediction. Compared with the existing state-of-art methods, our newly developed approach PSI-MOUSE obtained a substantial improvement in prediction accuracy, marking the reliable contributions of genomic features for the prediction of RNA modifications in a species other than human.

Abstract Image

Abstract Image

Abstract Image

PSI-MOUSE:从序列和基因组衍生特征预测小鼠假尿嘧啶位点。
假尿嘧啶(Ψ)是最早发现的转录后修饰,也是最常见的转录后修饰,在过去的几十年中被广泛研究。假尿嘧啶在几乎所有种类的 RNA 中都被观察到,并被证明具有重要的生物学功能。目前,实验方法耗时长、成本高,限制了其在Ψ位点检测中的实际应用。另外,利用Ψ测序数据的爆炸性增长,计算方法可能会提供一种更具成本效益的途径。迄今为止,现有的小鼠Ψ位点预测器都是基于序列衍生特征开发的,如果加入领域知识衍生特征,其性能还能进一步提高。因此,提出一种基于基因组特征的计算方法来提高鉴定小鼠转录组中Ψ RNA修饰的准确性和效率是非常可取的。在我们的研究中,建立了一个预测框架 PSI-MOUSE。除了传统的基于序列的特征外,PSI-MOUSE首先引入了38个来自小鼠基因组的额外基因组特征,与其他现有模型相比,预测性能有了令人满意的提高。此外,PSI-MOUSE还能自动注释具有不同转录后调控类型(RNA结合蛋白[RBP]结合区、miRNA-RNA相互作用和剪接位点)的推定Ψ位点,可作为研究小鼠基因组中ΨRNA修饰的有用工具。最后,该数据库还收集了3282个经实验验证的小鼠Ψ位点,并提供定制的查询功能。为了方便学术用户,我们建立了一个网站,为数据库的查询和分析提供友好的用户界面。该网站可在 www.xjtlu.edu.cn/biologicalsciences/psimouse 和 http://psimouse.rnamd.com 免费访问。我们首次将基因组衍生特征引入小鼠,并在小鼠Ψ位点预测方面取得了良好的效果。与现有的先进方法相比,我们新开发的 PSI-MOUSE 方法大大提高了预测的准确性,这标志着基因组特征对人类以外物种的 RNA 修饰预测做出了可靠的贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Evolutionary Bioinformatics
Evolutionary Bioinformatics 生物-进化生物学
CiteScore
4.20
自引率
0.00%
发文量
25
审稿时长
12 months
期刊介绍: Evolutionary Bioinformatics is an open access, peer reviewed international journal focusing on evolutionary bioinformatics. The journal aims to support understanding of organismal form and function through use of molecular, genetic, genomic and proteomic data by giving due consideration to its evolutionary context.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信