Benchmarking Ploidy Estimation Methods for Bulk and Single-Cell Whole Genome Sequencing.

IF 14.1 1区 材料科学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Yawei Song, Zilv Mei, Qijie Zheng, Qingqing Yuan, Yu Liang, Jiaqi Gao, Lang Zhou, Shuheng Wu, Wei Wu
{"title":"Benchmarking Ploidy Estimation Methods for Bulk and Single-Cell Whole Genome Sequencing.","authors":"Yawei Song, Zilv Mei, Qijie Zheng, Qingqing Yuan, Yu Liang, Jiaqi Gao, Lang Zhou, Shuheng Wu, Wei Wu","doi":"10.1002/advs.202507839","DOIUrl":null,"url":null,"abstract":"<p><p>Maintaining cellular ploidy is critical for normal physiological processes, although gains in ploidy are frequently observed during development, tissue regeneration, and metabolism, and potentially contribute to aneuploidy, thereby promoting tumor evolution. Although numerous computational tools have been developed to estimate cellular ploidy from whole-genome sequencing (WGS) data at bulk or single-cell resolution, to the knowledge, no systematic comparison of their performance has been conducted. Here, a benchmarking study is presented of 11 methods for bulk WGS and 8 methods for single-cell WGS data, utilizing both experimental and simulated datasets derived from diploid cells mixed with aneuploid or polyploid cells. For bulk WGS tools, their performance is evaluated in estimating tumor purity and ploidy, as well as the influence of preprocessing steps, somatic mutation callers, purity, sequencing platforms, and depths. It is found that PURPLE outperforms other methods when tumor purity exceeded 30%, regardless of sequencing coverage or platform. However, all existing tools performed poorly applied to euploid samples or long-read sequencing data. For single-cell WGS tools, their ploidy detection accuracy is assessed, and SeCNV is identified as the top-performing method. These findings provide valuable guidance for future research on ploidy analysis and ongoing improvements in computational tools for single-cell sequencing data.</p>","PeriodicalId":117,"journal":{"name":"Advanced Science","volume":" ","pages":"e07839"},"PeriodicalIF":14.1000,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Science","FirstCategoryId":"88","ListUrlMain":"https://doi.org/10.1002/advs.202507839","RegionNum":1,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Maintaining cellular ploidy is critical for normal physiological processes, although gains in ploidy are frequently observed during development, tissue regeneration, and metabolism, and potentially contribute to aneuploidy, thereby promoting tumor evolution. Although numerous computational tools have been developed to estimate cellular ploidy from whole-genome sequencing (WGS) data at bulk or single-cell resolution, to the knowledge, no systematic comparison of their performance has been conducted. Here, a benchmarking study is presented of 11 methods for bulk WGS and 8 methods for single-cell WGS data, utilizing both experimental and simulated datasets derived from diploid cells mixed with aneuploid or polyploid cells. For bulk WGS tools, their performance is evaluated in estimating tumor purity and ploidy, as well as the influence of preprocessing steps, somatic mutation callers, purity, sequencing platforms, and depths. It is found that PURPLE outperforms other methods when tumor purity exceeded 30%, regardless of sequencing coverage or platform. However, all existing tools performed poorly applied to euploid samples or long-read sequencing data. For single-cell WGS tools, their ploidy detection accuracy is assessed, and SeCNV is identified as the top-performing method. These findings provide valuable guidance for future research on ploidy analysis and ongoing improvements in computational tools for single-cell sequencing data.

批量和单细胞全基因组测序的基准倍性估计方法。
维持细胞倍性对正常的生理过程至关重要,尽管在发育、组织再生和代谢过程中经常观察到倍性的增加,并可能导致非整倍性,从而促进肿瘤的进化。尽管已经开发了许多计算工具来估计细胞倍性,从整体或单细胞分辨率的全基因组测序(WGS)数据中,据了解,还没有对它们的性能进行系统的比较。本文利用二倍体细胞与非整倍体或多倍体细胞混合的实验和模拟数据集,对11种批量WGS方法和8种单细胞WGS数据进行了基准研究。对于批量WGS工具,评估其在估计肿瘤纯度和倍性方面的性能,以及预处理步骤、体细胞突变调用者、纯度、测序平台和深度的影响。发现当肿瘤纯度超过30%时,无论测序覆盖率或平台如何,PURPLE都优于其他方法。然而,所有现有的工具都表现不佳,适用于整倍体样本或长读测序数据。对于单细胞WGS工具,对其倍性检测精度进行了评估,并确定了SeCNV是表现最好的方法。这些发现为未来的倍性分析研究和单细胞测序数据计算工具的持续改进提供了有价值的指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advanced Science
Advanced Science CHEMISTRY, MULTIDISCIPLINARYNANOSCIENCE &-NANOSCIENCE & NANOTECHNOLOGY
CiteScore
18.90
自引率
2.60%
发文量
1602
审稿时长
1.9 months
期刊介绍: Advanced Science is a prestigious open access journal that focuses on interdisciplinary research in materials science, physics, chemistry, medical and life sciences, and engineering. The journal aims to promote cutting-edge research by employing a rigorous and impartial review process. It is committed to presenting research articles with the highest quality production standards, ensuring maximum accessibility of top scientific findings. With its vibrant and innovative publication platform, Advanced Science seeks to revolutionize the dissemination and organization of scientific knowledge.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信