{"title":"SplitFusion enables ultrasensitive gene fusion detection and reveals fusion variant-associated tumor heterogeneity.","authors":"Weiwei Bian, Baifeng Zhang, Zhengbo Song, Binyamin A Knisbacher, Yee Man Chan, Chloe Bao, Chunwei Xu, Wenxian Wang, Athena Hoi Yee Chu, Chenyu Lu, Hongxian Wang, Siyu Bao, Zhenyu Gong, Hoi Yee Keung, Zi-Ying Maggie Chow, Yiping Zhang, Wah Cheuk, Gad Getz, Valentina Nardi, Mengsu Yang, William Chi Shing Cho, Jian Wang, Juxiang Chen, Zongli Zheng","doi":"10.1016/j.patter.2025.101174","DOIUrl":null,"url":null,"abstract":"<p><p>Gene fusions are common cancer drivers and therapeutic targets, but clinical-grade open-source bioinformatic tools are lacking. Here, we introduce a fusion detection method named SplitFusion, which is fast by leveraging Burrows-Wheeler Aligner-maximal exact match (BWA-MEM) split alignments, can detect cryptic splice-site fusions (e.g., <i>EML4::ALK</i> v3b and <i>ARv7</i>), call fusions involving highly repetitive gene partners (e.g., <i>CIC::DUX4</i>), and infer frame-ness and exon-boundary alignments for functional prediction and minimizing false positives. Using 1,848 datasets of various sizes, SplitFusion demonstrated superior sensitivity and specificity compared to three other tools. In 1,076 formalin-fixed paraffin-embedded lung cancer samples, SplitFusion identified novel fusions and revealed that <i>EML4::ALK</i> variant 3 was associated with multiple fusion variants coexisting in the same tumor. Additionally, SplitFusion can call targeted splicing variants. Using data from 515 The Cancer Genome Atlas (TCGA) samples, SplitFusion showed the highest sensitivity and uncovered two cases of <i>SLC34A2::ROS1</i> that were missed in previous studies. These capabilities make SplitFusion highly suitable for clinical applications and the study of fusion-defined tumor heterogeneity.</p>","PeriodicalId":36242,"journal":{"name":"Patterns","volume":"6 2","pages":"101174"},"PeriodicalIF":6.7000,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11873004/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Patterns","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.patter.2025.101174","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Gene fusions are common cancer drivers and therapeutic targets, but clinical-grade open-source bioinformatic tools are lacking. Here, we introduce a fusion detection method named SplitFusion, which is fast by leveraging Burrows-Wheeler Aligner-maximal exact match (BWA-MEM) split alignments, can detect cryptic splice-site fusions (e.g., EML4::ALK v3b and ARv7), call fusions involving highly repetitive gene partners (e.g., CIC::DUX4), and infer frame-ness and exon-boundary alignments for functional prediction and minimizing false positives. Using 1,848 datasets of various sizes, SplitFusion demonstrated superior sensitivity and specificity compared to three other tools. In 1,076 formalin-fixed paraffin-embedded lung cancer samples, SplitFusion identified novel fusions and revealed that EML4::ALK variant 3 was associated with multiple fusion variants coexisting in the same tumor. Additionally, SplitFusion can call targeted splicing variants. Using data from 515 The Cancer Genome Atlas (TCGA) samples, SplitFusion showed the highest sensitivity and uncovered two cases of SLC34A2::ROS1 that were missed in previous studies. These capabilities make SplitFusion highly suitable for clinical applications and the study of fusion-defined tumor heterogeneity.
基因融合是常见的癌症驱动因素和治疗靶点,但缺乏临床级的开源生物信息学工具。在这里,我们介绍了一种名为SplitFusion的融合检测方法,该方法通过利用Burrows-Wheeler aligner - maximum exact match (BWA-MEM)分裂比对来快速检测隐剪接位点融合(例如,EML4::ALK v3b和ARv7),调用涉及高度重复基因伴侣的融合(例如,CIC::DUX4),并推断框架性和外显子边界比对,以进行功能预测和最小化假阳性。使用1,848个不同大小的数据集,与其他三种工具相比,SplitFusion显示出更高的灵敏度和特异性。在1076份福尔马林固定石蜡包埋肺癌样本中,SplitFusion发现了新的融合,并发现EML4::ALK变体3与同一肿瘤中共存的多种融合变体相关。此外,SplitFusion可以调用目标剪接变体。使用来自515个癌症基因组图谱(TCGA)样本的数据,SplitFusion显示出最高的灵敏度,并发现了先前研究中遗漏的2例SLC34A2::ROS1。这些功能使得SplitFusion非常适合临床应用和融合定义肿瘤异质性的研究。