Volcano: a pipeline to characterize long terminal repeat-retrotransposons families in plants.

IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Bioinformatics advances Pub Date : 2025-07-04 eCollection Date: 2025-01-01 DOI:10.1093/bioadv/vbaf162
Hao He, Fei Shen, Yong Hou, Xiaozeng Yang
{"title":"Volcano: a pipeline to characterize long terminal repeat-retrotransposons families in plants.","authors":"Hao He, Fei Shen, Yong Hou, Xiaozeng Yang","doi":"10.1093/bioadv/vbaf162","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Long Terminal Repeat Retrotransposons (LTR-RTs) comprise a significant portion of repetitive sequences in numerous plant species. LTR-RTs hold considerable functional significance, as they can impact gene family functionality and contribute to the formation of new genes. Investigating the quantities and activities of LTR-RTs is essential for understanding species' evolutionary dynamics and the foundational mechanisms driving genome evolution. While current softwares can predict and initially classify LTR-RTs, there is a high need for more comprehensive and efficient software to fully characterize and quantify LTR-RTs during burst events and in subsequent detailed classification and quantification, especially given the surged demands of genome annotation.</p><p><strong>Results: </strong>In this study, we have developed a pipeline called Volcano to accurately classify LTR-RTs and characterize burst families in plants. To distinguish different clades of LTR-RTs, we have implemented an improved depth-first search algorithm. Volcano can also quantify LTR-RT expression using RNA-seq data. By analyzing LTR-RTs in three genomes from the Asteraceae family, we observed that larger genomes tend to contain a greater number of LTR-RTs, and our software effectively categorizes them at the clade level.</p><p><strong>Availability and implementation: </strong>The proposed Volcano compressor can be downloaded from https://github.com/Suosihe/volcano_LTR.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf162"},"PeriodicalIF":2.8000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12349922/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Long Terminal Repeat Retrotransposons (LTR-RTs) comprise a significant portion of repetitive sequences in numerous plant species. LTR-RTs hold considerable functional significance, as they can impact gene family functionality and contribute to the formation of new genes. Investigating the quantities and activities of LTR-RTs is essential for understanding species' evolutionary dynamics and the foundational mechanisms driving genome evolution. While current softwares can predict and initially classify LTR-RTs, there is a high need for more comprehensive and efficient software to fully characterize and quantify LTR-RTs during burst events and in subsequent detailed classification and quantification, especially given the surged demands of genome annotation.

Results: In this study, we have developed a pipeline called Volcano to accurately classify LTR-RTs and characterize burst families in plants. To distinguish different clades of LTR-RTs, we have implemented an improved depth-first search algorithm. Volcano can also quantify LTR-RT expression using RNA-seq data. By analyzing LTR-RTs in three genomes from the Asteraceae family, we observed that larger genomes tend to contain a greater number of LTR-RTs, and our software effectively categorizes them at the clade level.

Availability and implementation: The proposed Volcano compressor can be downloaded from https://github.com/Suosihe/volcano_LTR.

Abstract Image

Abstract Image

Abstract Image

火山:表征植物长末端重复反转录转座子家族的管道。
动机:在许多植物物种中,长末端重复反转录转座子(LTR-RTs)构成了重复序列的重要部分。LTR-RTs具有相当大的功能意义,因为它们可以影响基因家族功能并有助于新基因的形成。研究LTR-RTs的数量和活性对于理解物种进化动力学和驱动基因组进化的基本机制至关重要。虽然目前的软件可以预测和初步分类LTR-RTs,但在突发事件期间以及随后的详细分类和量化中,特别是在基因组注释需求激增的情况下,迫切需要更全面、更高效的软件来充分表征和量化LTR-RTs。结果:在本研究中,我们开发了一个名为Volcano的管道来准确分类LTR-RTs并表征植物中的爆发家族。为了区分LTR-RTs的不同分支,我们实现了一种改进的深度优先搜索算法。Volcano还可以使用RNA-seq数据定量LTR-RT表达。通过分析来自Asteraceae家族的三个基因组的LTR-RTs,我们观察到较大的基因组往往包含更多的LTR-RTs,并且我们的软件在进化水平上有效地对它们进行了分类。可用性和实现:建议的Volcano压缩器可以从https://github.com/Suosihe/volcano_LTR下载。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信