Qingxiang Guo, Yangyang Li, Tingyou Wang, Abhi Ramakrishnan, Rendong Yang
{"title":"OctopusV和TentacleSV:用于多样本、跨平台结构变体比较和分析的一站式工具包。","authors":"Qingxiang Guo, Yangyang Li, Tingyou Wang, Abhi Ramakrishnan, Rendong Yang","doi":"10.1101/2025.03.24.645012","DOIUrl":null,"url":null,"abstract":"<p><p>Structural variants (SVs) significantly influence genomic variability and disease, but their accurate analysis across multiple samples and sequencing platforms remains challenging. We developed OctopusV, a tool that standardizes ambiguous breakend (BND) annotations into canonical SV types (inversions, duplications, translocations) and integrates variant calls using flexible set operations, such as union, intersection, difference, and complement, enabling cohort-specific variant identification. Together with TentacleSV, an automated pipeline, OctopusV provides an end-to-end solution from raw data to final callsets. Evaluations show improved precision, recall, and consistency, highlighting its value in cancer genomics and rare disease diagnostics. Both tools are available at https://github.com/ylab-hi/OctopusV and https://github.com/ylab-hi/TentacleSV.</p>","PeriodicalId":519960,"journal":{"name":"bioRxiv : the preprint server for biology","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11974888/pdf/","citationCount":"0","resultStr":"{\"title\":\"OctopusV and TentacleSV: a one-stop toolkit for multi-sample, cross-platform structural variant comparison and analysis.\",\"authors\":\"Qingxiang Guo, Yangyang Li, Tingyou Wang, Abhi Ramakrishnan, Rendong Yang\",\"doi\":\"10.1101/2025.03.24.645012\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Structural variants (SVs) significantly influence genomic variability and disease, but their accurate analysis across multiple samples and sequencing platforms remains challenging. We developed OctopusV, a tool that standardizes ambiguous breakend (BND) annotations into canonical SV types (inversions, duplications, translocations) and integrates variant calls using flexible set operations, such as union, intersection, difference, and complement, enabling cohort-specific variant identification. Together with TentacleSV, an automated pipeline, OctopusV provides an end-to-end solution from raw data to final callsets. Evaluations show improved precision, recall, and consistency, highlighting its value in cancer genomics and rare disease diagnostics. Both tools are available at https://github.com/ylab-hi/OctopusV and https://github.com/ylab-hi/TentacleSV.</p>\",\"PeriodicalId\":519960,\"journal\":{\"name\":\"bioRxiv : the preprint server for biology\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-03-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11974888/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"bioRxiv : the preprint server for biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2025.03.24.645012\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"bioRxiv : the preprint server for biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.03.24.645012","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
OctopusV and TentacleSV: a one-stop toolkit for multi-sample, cross-platform structural variant comparison and analysis.
Structural variants (SVs) significantly influence genomic variability and disease, but their accurate analysis across multiple samples and sequencing platforms remains challenging. We developed OctopusV, a tool that standardizes ambiguous breakend (BND) annotations into canonical SV types (inversions, duplications, translocations) and integrates variant calls using flexible set operations, such as union, intersection, difference, and complement, enabling cohort-specific variant identification. Together with TentacleSV, an automated pipeline, OctopusV provides an end-to-end solution from raw data to final callsets. Evaluations show improved precision, recall, and consistency, highlighting its value in cancer genomics and rare disease diagnostics. Both tools are available at https://github.com/ylab-hi/OctopusV and https://github.com/ylab-hi/TentacleSV.