从单细胞/细胞核RNA测序数据定量测量基因表达的精确性和准确性。

IF 7.9
Rujia Dai, Ming Zhang, Tianyao Chu, Richard Kopp, Chunling Zhang, Kefu Liu, Yue Wang, Xusheng Wang, Chao Chen, Chunyu Liu
{"title":"从单细胞/细胞核RNA测序数据定量测量基因表达的精确性和准确性。","authors":"Rujia Dai, Ming Zhang, Tianyao Chu, Richard Kopp, Chunling Zhang, Kefu Liu, Yue Wang, Xusheng Wang, Chao Chen, Chunyu Liu","doi":"10.1093/gpbjnl/qzaf077","DOIUrl":null,"url":null,"abstract":"<p><p>Single-cell and single-nucleus RNA sequencing (sc/snRNA-seq) have become essential tools for profiling gene expression across different cell types in biomedical research. While factors like RNA integrity, cell count, and sequencing depth are known to influence data quality, quantitative benchmarks and actionable guidelines are lacking. This gap contributes to variability in study designs and inconsistencies in downstream analyses. In this study, we systematically evaluated quantitative precision and accuracy in expression measures across 23 sc/snRNA-seq datasets comprising 3,682,576 cells from 339 samples. Precision was assessed using technical replicates based on pseudo-bulks created from subsampling. Accuracy was evaluated using sample-matched scRNA-seq and pooled-cell RNA sequencing (RNA-seq) data of mononuclear phagocytes from four species. Our results show that precision and accuracy are generally low at the single-cell level, with reproducibility being strongly influenced by cell count and RNA quality. We established data-driven thresholds for optimizing study design, recommending at least 500 cells per cell type per individual to achieve reliable quantification. Furthermore, we showed that signal-to-noise ratio is a key metric for identifying reproducible differentially expressed genes. To support future research, we developed Variability In single-Cell gene Expressions (VICE), a tool that evaluates sc/snRNA-seq data quality and estimates the true positive rate of differential expression results based on sample size, observed noise levels, and expected effect size. These findings provide practical, evidence-based guidelines to enhance the reliability and reproducibility of sc/snRNA-seq studies.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":7.9000,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Precision and Accuracy in Quantitative Measurement of Gene Expression from Single-cell/nuclei RNA Sequencing Data.\",\"authors\":\"Rujia Dai, Ming Zhang, Tianyao Chu, Richard Kopp, Chunling Zhang, Kefu Liu, Yue Wang, Xusheng Wang, Chao Chen, Chunyu Liu\",\"doi\":\"10.1093/gpbjnl/qzaf077\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Single-cell and single-nucleus RNA sequencing (sc/snRNA-seq) have become essential tools for profiling gene expression across different cell types in biomedical research. While factors like RNA integrity, cell count, and sequencing depth are known to influence data quality, quantitative benchmarks and actionable guidelines are lacking. This gap contributes to variability in study designs and inconsistencies in downstream analyses. In this study, we systematically evaluated quantitative precision and accuracy in expression measures across 23 sc/snRNA-seq datasets comprising 3,682,576 cells from 339 samples. Precision was assessed using technical replicates based on pseudo-bulks created from subsampling. Accuracy was evaluated using sample-matched scRNA-seq and pooled-cell RNA sequencing (RNA-seq) data of mononuclear phagocytes from four species. Our results show that precision and accuracy are generally low at the single-cell level, with reproducibility being strongly influenced by cell count and RNA quality. We established data-driven thresholds for optimizing study design, recommending at least 500 cells per cell type per individual to achieve reliable quantification. Furthermore, we showed that signal-to-noise ratio is a key metric for identifying reproducible differentially expressed genes. To support future research, we developed Variability In single-Cell gene Expressions (VICE), a tool that evaluates sc/snRNA-seq data quality and estimates the true positive rate of differential expression results based on sample size, observed noise levels, and expected effect size. These findings provide practical, evidence-based guidelines to enhance the reliability and reproducibility of sc/snRNA-seq studies.</p>\",\"PeriodicalId\":94020,\"journal\":{\"name\":\"Genomics, proteomics & bioinformatics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":7.9000,\"publicationDate\":\"2025-08-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genomics, proteomics & bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/gpbjnl/qzaf077\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics, proteomics & bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/gpbjnl/qzaf077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

单细胞和单核RNA测序(sc/snRNA-seq)已成为生物医学研究中分析不同细胞类型基因表达的重要工具。虽然RNA完整性、细胞计数和测序深度等因素已知会影响数据质量,但缺乏定量基准和可操作的指导方针。这一差距导致了研究设计的可变性和下游分析的不一致性。在这项研究中,我们系统地评估了23个sc/snRNA-seq数据集表达测量的定量精度和准确性,这些数据集包括来自339个样本的3,682,576个细胞。使用基于从次抽样创建的伪批量的技术重复来评估精度。使用样本匹配的scRNA-seq和来自四个物种的单核吞噬细胞的池细胞RNA测序(RNA-seq)数据来评估准确性。我们的研究结果表明,在单细胞水平上,精度和准确性普遍较低,可重复性受到细胞计数和RNA质量的强烈影响。我们建立了数据驱动的阈值来优化研究设计,建议每个个体每种细胞类型至少500个细胞,以实现可靠的定量。此外,我们发现信噪比是鉴定可重复差异表达基因的关键指标。为了支持未来的研究,我们开发了单细胞基因表达变异性(VICE),这是一个评估sc/snRNA-seq数据质量的工具,并根据样本量、观察到的噪声水平和预期的效应大小估计差异表达结果的真阳性率。这些发现为提高sc/snRNA-seq研究的可靠性和可重复性提供了实用的、基于证据的指南。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Precision and Accuracy in Quantitative Measurement of Gene Expression from Single-cell/nuclei RNA Sequencing Data.

Single-cell and single-nucleus RNA sequencing (sc/snRNA-seq) have become essential tools for profiling gene expression across different cell types in biomedical research. While factors like RNA integrity, cell count, and sequencing depth are known to influence data quality, quantitative benchmarks and actionable guidelines are lacking. This gap contributes to variability in study designs and inconsistencies in downstream analyses. In this study, we systematically evaluated quantitative precision and accuracy in expression measures across 23 sc/snRNA-seq datasets comprising 3,682,576 cells from 339 samples. Precision was assessed using technical replicates based on pseudo-bulks created from subsampling. Accuracy was evaluated using sample-matched scRNA-seq and pooled-cell RNA sequencing (RNA-seq) data of mononuclear phagocytes from four species. Our results show that precision and accuracy are generally low at the single-cell level, with reproducibility being strongly influenced by cell count and RNA quality. We established data-driven thresholds for optimizing study design, recommending at least 500 cells per cell type per individual to achieve reliable quantification. Furthermore, we showed that signal-to-noise ratio is a key metric for identifying reproducible differentially expressed genes. To support future research, we developed Variability In single-Cell gene Expressions (VICE), a tool that evaluates sc/snRNA-seq data quality and estimates the true positive rate of differential expression results based on sample size, observed noise levels, and expected effect size. These findings provide practical, evidence-based guidelines to enhance the reliability and reproducibility of sc/snRNA-seq studies.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信