低覆盖率全基因组测序对标拷贝数变异检测。

IF 7.7 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Nan Wang, Zi-Yu Tao, Tao Wu, Jinyu Wang, Weiliang Wang, Huaqiu Shi, Xue-Song Liu
{"title":"低覆盖率全基因组测序对标拷贝数变异检测。","authors":"Nan Wang, Zi-Yu Tao, Tao Wu, Jinyu Wang, Weiliang Wang, Huaqiu Shi, Xue-Song Liu","doi":"10.1093/bib/bbaf514","DOIUrl":null,"url":null,"abstract":"<p><p>Low-coverage whole-genome sequencing (lcWGS) provides a cost-effective method for genome-wide copy number variation (CNV) profiling, yet its technical limitations and analytical variability require systematic evaluation. We benchmarked five CNV detection tools using simulated and real-world datasets, focusing on sequencing depth, formalin-fixed paraffin-embedded (FFPE) artifacts, tumor purity, multi-center reproducibility, and signature-level stability. Our results demonstrate that ichorCNA outperformed other tools in precision and runtime at high purity (≥50%), making it the optimal choice for lcWGS-based workflows. Prolonged FFPE fixation induced artifactual short-segment CNVs due to formalin-driven DNA fragmentation, a bias none of the tools could computationally correct, necessitating strict fixation time control or prioritization of fresh-frozen samples. Multi-center analysis revealed high reproducibility for the same tool across sequencing facilities, but comparisons between different tools showed low concordance. Copy number features extracted by the Wang et al. method exhibited superior stability across conditions compared with the Steele et al. method and the Tao et al. method. This study establishes actionable guidelines for lcWGS: prioritize ichorCNA (ensuring ≥50% tumor purity), optimize FFPE protocol, and use Wang et al. features to ensure robust copy number profiling in precision oncology.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 5","pages":""},"PeriodicalIF":7.7000,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12481693/pdf/","citationCount":"0","resultStr":"{\"title\":\"Benchmarking copy number variation detection with low-coverage whole-genome sequencing.\",\"authors\":\"Nan Wang, Zi-Yu Tao, Tao Wu, Jinyu Wang, Weiliang Wang, Huaqiu Shi, Xue-Song Liu\",\"doi\":\"10.1093/bib/bbaf514\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Low-coverage whole-genome sequencing (lcWGS) provides a cost-effective method for genome-wide copy number variation (CNV) profiling, yet its technical limitations and analytical variability require systematic evaluation. We benchmarked five CNV detection tools using simulated and real-world datasets, focusing on sequencing depth, formalin-fixed paraffin-embedded (FFPE) artifacts, tumor purity, multi-center reproducibility, and signature-level stability. Our results demonstrate that ichorCNA outperformed other tools in precision and runtime at high purity (≥50%), making it the optimal choice for lcWGS-based workflows. Prolonged FFPE fixation induced artifactual short-segment CNVs due to formalin-driven DNA fragmentation, a bias none of the tools could computationally correct, necessitating strict fixation time control or prioritization of fresh-frozen samples. Multi-center analysis revealed high reproducibility for the same tool across sequencing facilities, but comparisons between different tools showed low concordance. Copy number features extracted by the Wang et al. method exhibited superior stability across conditions compared with the Steele et al. method and the Tao et al. method. This study establishes actionable guidelines for lcWGS: prioritize ichorCNA (ensuring ≥50% tumor purity), optimize FFPE protocol, and use Wang et al. features to ensure robust copy number profiling in precision oncology.</p>\",\"PeriodicalId\":9209,\"journal\":{\"name\":\"Briefings in bioinformatics\",\"volume\":\"26 5\",\"pages\":\"\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2025-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12481693/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Briefings in bioinformatics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1093/bib/bbaf514\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbaf514","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

低覆盖全基因组测序(lcWGS)为全基因组拷贝数变异(CNV)分析提供了一种经济有效的方法,但其技术局限性和分析可变性需要系统的评估。我们使用模拟和真实数据集对五种CNV检测工具进行基准测试,重点关注测序深度、福尔马林固定石蜡包埋(FFPE)伪像、肿瘤纯度、多中心可重复性和特征级稳定性。我们的研究结果表明,在高纯度(≥50%)下,ichorCNA在精度和运行时间方面优于其他工具,使其成为基于lcwgs的工作流的最佳选择。由于福尔马林驱动的DNA碎片化,长时间的FFPE固定会导致人工短片段CNVs,没有一种工具可以在计算上纠正这种偏差,因此需要严格的固定时间控制或对新鲜冷冻样品进行优先排序。多中心分析显示,同一工具在不同测序设备间的重复性较高,但不同工具间的一致性较低。与Steele等人的方法和Tao等人的方法相比,Wang等人的方法提取的拷贝数特征在不同条件下都表现出更好的稳定性。本研究为lcWGS建立了可操作的指导方针:优先考虑ichorCNA(确保≥50%的肿瘤纯度),优化FFPE方案,并使用Wang等人的特征来确保精确肿瘤学中拷贝数分析的可靠性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Benchmarking copy number variation detection with low-coverage whole-genome sequencing.

Low-coverage whole-genome sequencing (lcWGS) provides a cost-effective method for genome-wide copy number variation (CNV) profiling, yet its technical limitations and analytical variability require systematic evaluation. We benchmarked five CNV detection tools using simulated and real-world datasets, focusing on sequencing depth, formalin-fixed paraffin-embedded (FFPE) artifacts, tumor purity, multi-center reproducibility, and signature-level stability. Our results demonstrate that ichorCNA outperformed other tools in precision and runtime at high purity (≥50%), making it the optimal choice for lcWGS-based workflows. Prolonged FFPE fixation induced artifactual short-segment CNVs due to formalin-driven DNA fragmentation, a bias none of the tools could computationally correct, necessitating strict fixation time control or prioritization of fresh-frozen samples. Multi-center analysis revealed high reproducibility for the same tool across sequencing facilities, but comparisons between different tools showed low concordance. Copy number features extracted by the Wang et al. method exhibited superior stability across conditions compared with the Steele et al. method and the Tao et al. method. This study establishes actionable guidelines for lcWGS: prioritize ichorCNA (ensuring ≥50% tumor purity), optimize FFPE protocol, and use Wang et al. features to ensure robust copy number profiling in precision oncology.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Briefings in bioinformatics
Briefings in bioinformatics 生物-生化研究方法
CiteScore
13.20
自引率
13.70%
发文量
549
审稿时长
6 months
期刊介绍: Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data. The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信