FUSION: a family-level integration approach for robust differential analysis of small non-coding RNAs.

IF 5.4
Hukam C Rawal, Qi Chen, Tong Zhou
{"title":"FUSION: a family-level integration approach for robust differential analysis of small non-coding RNAs.","authors":"Hukam C Rawal, Qi Chen, Tong Zhou","doi":"10.1093/bioinformatics/btaf526","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Beyond well-studied microRNAs, noncanonical small non-coding RNAs (sncRNAs) derived from longer parental templates such as tRNAs, rRNAs, and Y RNAs, are emerging as important regulators in various biological processes and diseases. Yet, analyzing these noncanonical sncRNAs from sequencing data remains challenging due to the intrinsic sequence heterogeneity and highly noisy nature. Conventional strategies either sum up all sequencing reads mapped to a parental RNA, which sacrifices the resolution of single sncRNA species, or treat each unique RNA species/sequence independently, which faces substantial noise in low-replicate settings.</p><p><strong>Results: </strong>Here, we introduce FUSION (Family-level Unique Small RNA Integration), a computational tool bridging these conventional approaches by first quantifying unique sncRNA species and then aggregating them into their respective parental RNA families. This family-level integration captures the contributions of individual sncRNA species while enhancing statistical power and robustness for differential abundance analysis. FUSION includes two modules: FUSION_ms, which reduces noise and amplifies signals for multiple-sample comparison to detect family-level abundance changes even with a small sample size, and FUSION_ps, which is powered by paired-sample analysis and optimized for \"1-on-1\" differential abundance analysis in single-case studies. Both modules are validated by cross-lab discoveries of dysregulated sncRNA families that could not be identified using conventional methods. In summary, FUSION provides a powerful framework for sncRNA sequencing data analysis, enhancing data interpretation and supporting small sample research.</p><p><strong>Availability and implementation: </strong>FUSION is available at https://github.com/cozyrna/FUSION and archived at https://doi.org/10.5281/zenodo.16929712.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12502913/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf526","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Beyond well-studied microRNAs, noncanonical small non-coding RNAs (sncRNAs) derived from longer parental templates such as tRNAs, rRNAs, and Y RNAs, are emerging as important regulators in various biological processes and diseases. Yet, analyzing these noncanonical sncRNAs from sequencing data remains challenging due to the intrinsic sequence heterogeneity and highly noisy nature. Conventional strategies either sum up all sequencing reads mapped to a parental RNA, which sacrifices the resolution of single sncRNA species, or treat each unique RNA species/sequence independently, which faces substantial noise in low-replicate settings.

Results: Here, we introduce FUSION (Family-level Unique Small RNA Integration), a computational tool bridging these conventional approaches by first quantifying unique sncRNA species and then aggregating them into their respective parental RNA families. This family-level integration captures the contributions of individual sncRNA species while enhancing statistical power and robustness for differential abundance analysis. FUSION includes two modules: FUSION_ms, which reduces noise and amplifies signals for multiple-sample comparison to detect family-level abundance changes even with a small sample size, and FUSION_ps, which is powered by paired-sample analysis and optimized for "1-on-1" differential abundance analysis in single-case studies. Both modules are validated by cross-lab discoveries of dysregulated sncRNA families that could not be identified using conventional methods. In summary, FUSION provides a powerful framework for sncRNA sequencing data analysis, enhancing data interpretation and supporting small sample research.

Availability and implementation: FUSION is available at https://github.com/cozyrna/FUSION and archived at https://doi.org/10.5281/zenodo.16929712.

FUSION:用于小型非编码rna的稳健差异分析的家族级集成方法。
动机:除了被充分研究的microrna之外,来自较长亲本模板的非规范小非编码rna (sncrna),如trna、RNAs和Y rna,正在成为各种生物过程和疾病的重要调节因子。然而,由于固有的序列异质性和高度嘈杂的性质,从测序数据中分析这些非规范sncrna仍然具有挑战性。传统的策略要么是将所有的测序读数汇总到一个亲本RNA上,这牺牲了单个sncRNA物种的分辨率,要么是单独处理每个独特的RNA物种/序列,这在低重复环境中面临着巨大的噪音。结果:在这里,我们引入FUSION(家族级独特小RNA整合),这是一种计算工具,通过首先量化独特的sncRNA物种,然后将它们聚集到各自的亲本RNA家族中,将这些传统方法连接起来。这种家族水平的整合捕获了单个sncRNA物种的贡献,同时增强了差异丰度分析的统计能力和稳健性。FUSION包括两个模块:FUSION_ms,它减少噪音和放大信号,用于多样本比较,以检测家庭水平的丰度变化,即使样样量小,和FUSION_ps,这是由成对样本分析和优化的“1对1”差异丰度分析在单例研究。这两个模块都通过跨实验室发现的常规方法无法识别的失调sncRNA家族得到验证。综上所述,FUSION为sncRNA测序数据分析提供了一个强大的框架,增强了数据解释和支持小样本研究。可获得性和实施:FUSION可在https://github.com/cozyrna/FUSION上获得,并在https://doi.org/10.5281/zenodo.16929712.Supplementary上存档:信息:补充表格和数据可在Bioinformatics在线上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信