Qingxiang Guo, Yangyang Li, Tingyou Wang, Abhi Ramakrishnan, Rendong Yang
{"title":"OctopusV and TentacleSV: a one-stop toolkit for multi-sample, cross-platform structural variant comparison and analysis.","authors":"Qingxiang Guo, Yangyang Li, Tingyou Wang, Abhi Ramakrishnan, Rendong Yang","doi":"10.1101/2025.03.24.645012","DOIUrl":null,"url":null,"abstract":"<p><p>Structural variants (SVs) significantly influence genomic variability and disease, but their accurate analysis across multiple samples and sequencing platforms remains challenging. We developed OctopusV, a tool that standardizes ambiguous breakend (BND) annotations into canonical SV types (inversions, duplications, translocations) and integrates variant calls using flexible set operations, such as union, intersection, difference, and complement, enabling cohort-specific variant identification. Together with TentacleSV, an automated pipeline, OctopusV provides an end-to-end solution from raw data to final callsets. Evaluations show improved precision, recall, and consistency, highlighting its value in cancer genomics and rare disease diagnostics. Both tools are available at https://github.com/ylab-hi/OctopusV and https://github.com/ylab-hi/TentacleSV.</p>","PeriodicalId":519960,"journal":{"name":"bioRxiv : the preprint server for biology","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11974888/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv : the preprint server for biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.03.24.645012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Structural variants (SVs) significantly influence genomic variability and disease, but their accurate analysis across multiple samples and sequencing platforms remains challenging. We developed OctopusV, a tool that standardizes ambiguous breakend (BND) annotations into canonical SV types (inversions, duplications, translocations) and integrates variant calls using flexible set operations, such as union, intersection, difference, and complement, enabling cohort-specific variant identification. Together with TentacleSV, an automated pipeline, OctopusV provides an end-to-end solution from raw data to final callsets. Evaluations show improved precision, recall, and consistency, highlighting its value in cancer genomics and rare disease diagnostics. Both tools are available at https://github.com/ylab-hi/OctopusV and https://github.com/ylab-hi/TentacleSV.