基于支持向量机的长链非编码RNA启动子预测方法

IF 1.9 4区 生物学 Q4 CELL BIOLOGY
Guohua Huang, Taigan Xue, Weihong Chen, Liangliang Huang, Qi Dai, JinYun Jiang
{"title":"基于支持向量机的长链非编码RNA启动子预测方法","authors":"Guohua Huang,&nbsp;Taigan Xue,&nbsp;Weihong Chen,&nbsp;Liangliang Huang,&nbsp;Qi Dai,&nbsp;JinYun Jiang","doi":"10.1049/syb2.70013","DOIUrl":null,"url":null,"abstract":"<p>Long non-coding RNAs (lncRNAs) are closely associated with the regulation of gene expression, whose promoters play a crucial role in comprehensively understanding lncRNA regulatory mechanisms, functions and their roles in diseases. Due to limitations of the current techniques, accurately identifying lncRNA promoters remains a challenge. To address this challenge, we propose a support vector machine (SVM)–based method for predicting lncRNA promoters, called SVM-LncRNAPro. This method uses position-specific trinucleotide propensity based on single-strand (PSTNPss) to encode the DNA sequences and employs an SVM as the learning algorithm. The SVM-LncRNAPro achieves state-of-the-art performance with reduced complexity. Additionally, experiments demonstrate that this method exhibits a strong generalisation ability. For the convenience of academic research, we have made the source code of SVM-LncRNAPro publicly available. Researchers can download the code and perform the prediction of the lncRNA promoter via the following link: https://github.com/TG0F7/Prom/tree/master.</p>","PeriodicalId":50379,"journal":{"name":"IET Systems Biology","volume":"19 1","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2025-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/syb2.70013","citationCount":"0","resultStr":"{\"title\":\"SVM-LncRNAPro: An SVM-Based Method for Predicting Long Noncoding RNA Promoters\",\"authors\":\"Guohua Huang,&nbsp;Taigan Xue,&nbsp;Weihong Chen,&nbsp;Liangliang Huang,&nbsp;Qi Dai,&nbsp;JinYun Jiang\",\"doi\":\"10.1049/syb2.70013\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Long non-coding RNAs (lncRNAs) are closely associated with the regulation of gene expression, whose promoters play a crucial role in comprehensively understanding lncRNA regulatory mechanisms, functions and their roles in diseases. Due to limitations of the current techniques, accurately identifying lncRNA promoters remains a challenge. To address this challenge, we propose a support vector machine (SVM)–based method for predicting lncRNA promoters, called SVM-LncRNAPro. This method uses position-specific trinucleotide propensity based on single-strand (PSTNPss) to encode the DNA sequences and employs an SVM as the learning algorithm. The SVM-LncRNAPro achieves state-of-the-art performance with reduced complexity. Additionally, experiments demonstrate that this method exhibits a strong generalisation ability. For the convenience of academic research, we have made the source code of SVM-LncRNAPro publicly available. Researchers can download the code and perform the prediction of the lncRNA promoter via the following link: https://github.com/TG0F7/Prom/tree/master.</p>\",\"PeriodicalId\":50379,\"journal\":{\"name\":\"IET Systems Biology\",\"volume\":\"19 1\",\"pages\":\"\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2025-04-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1049/syb2.70013\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IET Systems Biology\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1049/syb2.70013\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"CELL BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Systems Biology","FirstCategoryId":"99","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/syb2.70013","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

长链非编码rna (Long non-coding RNAs, lncRNAs)与基因表达调控密切相关,其启动子对于全面了解lncRNA调控机制、功能及其在疾病中的作用起着至关重要的作用。由于当前技术的局限性,准确识别lncRNA启动子仍然是一个挑战。为了解决这一挑战,我们提出了一种基于支持向量机(SVM)的预测lncRNA启动子的方法,称为SVM- lncrnapro。该方法采用基于单链的位置特异性三核苷酸倾向(PSTNPss)对DNA序列进行编码,并采用支持向量机作为学习算法。SVM-LncRNAPro在降低复杂性的同时实现了最先进的性能。实验表明,该方法具有较强的泛化能力。为了方便学术研究,我们公开了SVM-LncRNAPro的源代码。研究人员可以通过以下链接下载代码并对lncRNA启动子进行预测:https://github.com/TG0F7/Prom/tree/master。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

SVM-LncRNAPro: An SVM-Based Method for Predicting Long Noncoding RNA Promoters

SVM-LncRNAPro: An SVM-Based Method for Predicting Long Noncoding RNA Promoters

Long non-coding RNAs (lncRNAs) are closely associated with the regulation of gene expression, whose promoters play a crucial role in comprehensively understanding lncRNA regulatory mechanisms, functions and their roles in diseases. Due to limitations of the current techniques, accurately identifying lncRNA promoters remains a challenge. To address this challenge, we propose a support vector machine (SVM)–based method for predicting lncRNA promoters, called SVM-LncRNAPro. This method uses position-specific trinucleotide propensity based on single-strand (PSTNPss) to encode the DNA sequences and employs an SVM as the learning algorithm. The SVM-LncRNAPro achieves state-of-the-art performance with reduced complexity. Additionally, experiments demonstrate that this method exhibits a strong generalisation ability. For the convenience of academic research, we have made the source code of SVM-LncRNAPro publicly available. Researchers can download the code and perform the prediction of the lncRNA promoter via the following link: https://github.com/TG0F7/Prom/tree/master.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IET Systems Biology
IET Systems Biology 生物-数学与计算生物学
CiteScore
4.20
自引率
4.30%
发文量
17
审稿时长
>12 weeks
期刊介绍: IET Systems Biology covers intra- and inter-cellular dynamics, using systems- and signal-oriented approaches. Papers that analyse genomic data in order to identify variables and basic relationships between them are considered if the results provide a basis for mathematical modelling and simulation of cellular dynamics. Manuscripts on molecular and cell biological studies are encouraged if the aim is a systems approach to dynamic interactions within and between cells. The scope includes the following topics: Genomics, transcriptomics, proteomics, metabolomics, cells, tissue and the physiome; molecular and cellular interaction, gene, cell and protein function; networks and pathways; metabolism and cell signalling; dynamics, regulation and control; systems, signals, and information; experimental data analysis; mathematical modelling, simulation and theoretical analysis; biological modelling, simulation, prediction and control; methodologies, databases, tools and algorithms for modelling and simulation; modelling, analysis and control of biological networks; synthetic biology and bioengineering based on systems biology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信