Reference-informed evaluation of batch correction for single-cell omics data with overcorrection awareness.

IF 5.2 1区 生物学 Q1 BIOLOGY
Xiaoyue Hu, He Li, Ming Chen, Junbin Qian, Hangjin Jiang
{"title":"Reference-informed evaluation of batch correction for single-cell omics data with overcorrection awareness.","authors":"Xiaoyue Hu, He Li, Ming Chen, Junbin Qian, Hangjin Jiang","doi":"10.1038/s42003-025-07947-7","DOIUrl":null,"url":null,"abstract":"<p><p>Batch effect correction (BEC) is fundamental to integrate multiple single-cell RNA sequencing datasets, and its success is critical to empower in-depth interrogation for biological insights. However, no simple metric is available to evaluate BEC performance with sensitivity to data overcorrection, which erases true biological variations and leads to false biological discoveries. Here, we propose RBET, a reference-informed statistical framework for evaluating the success of BEC. Using extensive simulations and six real data examples including scRNA-seq and scATAC-seq datasets with different numbers of batches, batch effect sizes and numbers of cell types, we demonstrate that RBET evaluates the performance of BEC methods more fairly with biologically meaningful insights from data, while other methods may lead to false results. Moreover, RBET is computationally efficient, sensitive to overcorrection and robust to large batch effect sizes. Thus, RBET provides a robust guideline on selecting case-specific BEC method, and the concept of RBET is extendable to other modalities.</p>","PeriodicalId":10552,"journal":{"name":"Communications Biology","volume":"8 1","pages":"521"},"PeriodicalIF":5.2000,"publicationDate":"2025-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11954866/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications Biology","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s42003-025-07947-7","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Batch effect correction (BEC) is fundamental to integrate multiple single-cell RNA sequencing datasets, and its success is critical to empower in-depth interrogation for biological insights. However, no simple metric is available to evaluate BEC performance with sensitivity to data overcorrection, which erases true biological variations and leads to false biological discoveries. Here, we propose RBET, a reference-informed statistical framework for evaluating the success of BEC. Using extensive simulations and six real data examples including scRNA-seq and scATAC-seq datasets with different numbers of batches, batch effect sizes and numbers of cell types, we demonstrate that RBET evaluates the performance of BEC methods more fairly with biologically meaningful insights from data, while other methods may lead to false results. Moreover, RBET is computationally efficient, sensitive to overcorrection and robust to large batch effect sizes. Thus, RBET provides a robust guideline on selecting case-specific BEC method, and the concept of RBET is extendable to other modalities.

具有过校正意识的单细胞组学数据批量校正的参考评估。
批效应校正(BEC)是整合多个单细胞RNA测序数据集的基础,它的成功对深入探究生物学见解至关重要。然而,没有一个简单的指标可以用来评估BEC的性能,因为它对数据过度校正的敏感性会抹去真正的生物学变化,导致错误的生物学发现。在此,我们提出了RBET,这是一个评估BEC成功与否的参考信息统计框架。通过广泛的模拟和6个真实数据示例,包括scRNA-seq和scATAC-seq数据集,具有不同的批数,批效应大小和细胞类型数量,我们证明RBET更公平地评估BEC方法的性能,从数据中获得有生物学意义的见解,而其他方法可能导致错误的结果。此外,RBET具有计算效率高、对过校正敏感和对大批量效应的鲁棒性。因此,RBET为选择特定案例的BEC方法提供了一个健壮的指导方针,并且RBET的概念可扩展到其他模式。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Communications Biology
Communications Biology Medicine-Medicine (miscellaneous)
CiteScore
8.60
自引率
1.70%
发文量
1233
审稿时长
13 weeks
期刊介绍: Communications Biology is an open access journal from Nature Research publishing high-quality research, reviews and commentary in all areas of the biological sciences. Research papers published by the journal represent significant advances bringing new biological insight to a specialized area of research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信