Systematic benchmarking of tools for structural variation detection using short- and long-read sequencing data in pigs

IF 4.6 2区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES
Sang He , Bangmin Song , Yueting Tang , Xiaolu Qu , Xingzheng Li , Xintong Yang , Qi Bao , Lingzhao Fang , Jicai Jiang , Zhonglin Tang , Guoqiang Yi
{"title":"Systematic benchmarking of tools for structural variation detection using short- and long-read sequencing data in pigs","authors":"Sang He ,&nbsp;Bangmin Song ,&nbsp;Yueting Tang ,&nbsp;Xiaolu Qu ,&nbsp;Xingzheng Li ,&nbsp;Xintong Yang ,&nbsp;Qi Bao ,&nbsp;Lingzhao Fang ,&nbsp;Jicai Jiang ,&nbsp;Zhonglin Tang ,&nbsp;Guoqiang Yi","doi":"10.1016/j.isci.2025.111983","DOIUrl":null,"url":null,"abstract":"<div><div>Evaluating diverse structural variation (SV) detection-relevant programs leveraging different algorithms has become a pressing need in humans and farm animals. We addressed this by sequencing five genetically diverse pig individuals (breeds) with short- and long-read DNA-sequencing platforms. We created the SV benchmark set for each breed and assessed the performance of 16 SV calling-relevant tools. Results showed that long-read platforms enabled detecting many SVs missed by short-read platforms with similar precision. Benchmark SVs, mainly 200–500 bp insertions/deletions, had high validation rates. The assembly-based SV calling program SVIM-asm showed superior detection performance and resource consumption. The SVs with more supporting reads, sizes under 1 kb, outside simple repeat area, in low GC content and runs of homozygosity regions, had higher detection accuracy. Alignment-based tools performed well even at 5 <span><math><mrow><mo>×</mo></mrow></math></span> depth. Our study provides systematic guidance for an optimal SV calling pipeline in pigs and other farm animals.</div></div>","PeriodicalId":342,"journal":{"name":"iScience","volume":"28 3","pages":"Article 111983"},"PeriodicalIF":4.6000,"publicationDate":"2025-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"iScience","FirstCategoryId":"103","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589004225002433","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Evaluating diverse structural variation (SV) detection-relevant programs leveraging different algorithms has become a pressing need in humans and farm animals. We addressed this by sequencing five genetically diverse pig individuals (breeds) with short- and long-read DNA-sequencing platforms. We created the SV benchmark set for each breed and assessed the performance of 16 SV calling-relevant tools. Results showed that long-read platforms enabled detecting many SVs missed by short-read platforms with similar precision. Benchmark SVs, mainly 200–500 bp insertions/deletions, had high validation rates. The assembly-based SV calling program SVIM-asm showed superior detection performance and resource consumption. The SVs with more supporting reads, sizes under 1 kb, outside simple repeat area, in low GC content and runs of homozygosity regions, had higher detection accuracy. Alignment-based tools performed well even at 5 × depth. Our study provides systematic guidance for an optimal SV calling pipeline in pigs and other farm animals.

Abstract Image

利用不同的算法评估各种结构变异(SV)检测相关程序已成为人类和农场动物的迫切需要。为此,我们利用短线程和长线程 DNA 测序平台对 5 个不同基因的猪个体(品种)进行了测序。我们为每个品种创建了 SV 基准集,并评估了 16 种 SV 调用相关工具的性能。结果表明,长读取平台能以相似的精度检测出许多被短读取平台遗漏的 SV。基准 SV(主要是 200-500 bp 插入/缺失)的验证率很高。基于汇编的 SV 调用程序 SVIM-asm 在检测性能和资源消耗方面都表现优异。具有更多支持读数、大小在 1 kb 以下、在简单重复区域之外、在低 GC 含量和同源性区域运行的 SV 具有更高的检测准确率。基于比对的工具即使在 5 × 深度时也表现良好。我们的研究为猪和其他农场动物的最佳 SV 调用管道提供了系统指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
iScience
iScience Multidisciplinary-Multidisciplinary
CiteScore
7.20
自引率
1.70%
发文量
1972
审稿时长
6 weeks
期刊介绍: Science has many big remaining questions. To address them, we will need to work collaboratively and across disciplines. The goal of iScience is to help fuel that type of interdisciplinary thinking. iScience is a new open-access journal from Cell Press that provides a platform for original research in the life, physical, and earth sciences. The primary criterion for publication in iScience is a significant contribution to a relevant field combined with robust results and underlying methodology. The advances appearing in iScience include both fundamental and applied investigations across this interdisciplinary range of topic areas. To support transparency in scientific investigation, we are happy to consider replication studies and papers that describe negative results. We know you want your work to be published quickly and to be widely visible within your community and beyond. With the strong international reputation of Cell Press behind it, publication in iScience will help your work garner the attention and recognition it merits. Like all Cell Press journals, iScience prioritizes rapid publication. Our editorial team pays special attention to high-quality author service and to efficient, clear-cut decisions based on the information available within the manuscript. iScience taps into the expertise across Cell Press journals and selected partners to inform our editorial decisions and help publish your science in a timely and seamless way.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信