Construction of RNA reference materials for improving the quantification of transcriptomic data.

IF 13.1 1区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Ying Yu, Wanwan Hou, Qingwang Chen, Xiaorou Guo, Leqing Sang, Hao Xue, Duo Wang, Jinming Li, Xiang Fang, Rui Zhang, Lianhua Dong, Leming Shi, Yuanting Zheng
{"title":"Construction of RNA reference materials for improving the quantification of transcriptomic data.","authors":"Ying Yu, Wanwan Hou, Qingwang Chen, Xiaorou Guo, Leqing Sang, Hao Xue, Duo Wang, Jinming Li, Xiang Fang, Rui Zhang, Lianhua Dong, Leming Shi, Yuanting Zheng","doi":"10.1038/s41596-024-01111-x","DOIUrl":null,"url":null,"abstract":"<p><p>RNA reference materials and their corresponding reference datasets act as the 'ground truth' for the normalization of experimental values and are indispensable tools for reliably measuring intrinsically small differences in RNA-sequencing data, such as those between molecular subtypes of diseases in clinical samples. However, the variability in 'absolute' expression profiles measured across different batches, methods or platforms limits the use of conventional RNA reference datasets. We recently proposed a ratio-based method for constructing reference datasets. The ratio for a gene is defined as the normalized expression levels between two sample groups and produces more reliable values than the 'absolute' values obtained across diverse transcriptomic technologies and batches. Our gene ratios have been used for the successful generation of omics-wide reference datasets. Here, we describe a step-by-step process for establishing RNA reference materials and reference datasets, covering three stages: (1) reference materials, including material preparation, homogeneity testing and stability testing; (2) ratio-based reference datasets, including characterization, uncertainty estimation and orthogonal validation; and (3) applications, including definition of performance metrics, performing proficiency tests and diagnosing and correcting batch effects. This approach established the Quartet RNA reference materials and reference datasets (chinese-quartet.org) that have been approved as the first suite of nationally certified RNA reference materials by China's State Administration for Market Regulation. The protocol can be utilized to establish and apply reference materials to improve RNA-sequencing data quality in diverse clinical settings. The procedure can be completed in 2 d and requires expertise in molecular biology and bioinformatics.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Protocols","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41596-024-01111-x","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

RNA reference materials and their corresponding reference datasets act as the 'ground truth' for the normalization of experimental values and are indispensable tools for reliably measuring intrinsically small differences in RNA-sequencing data, such as those between molecular subtypes of diseases in clinical samples. However, the variability in 'absolute' expression profiles measured across different batches, methods or platforms limits the use of conventional RNA reference datasets. We recently proposed a ratio-based method for constructing reference datasets. The ratio for a gene is defined as the normalized expression levels between two sample groups and produces more reliable values than the 'absolute' values obtained across diverse transcriptomic technologies and batches. Our gene ratios have been used for the successful generation of omics-wide reference datasets. Here, we describe a step-by-step process for establishing RNA reference materials and reference datasets, covering three stages: (1) reference materials, including material preparation, homogeneity testing and stability testing; (2) ratio-based reference datasets, including characterization, uncertainty estimation and orthogonal validation; and (3) applications, including definition of performance metrics, performing proficiency tests and diagnosing and correcting batch effects. This approach established the Quartet RNA reference materials and reference datasets (chinese-quartet.org) that have been approved as the first suite of nationally certified RNA reference materials by China's State Administration for Market Regulation. The protocol can be utilized to establish and apply reference materials to improve RNA-sequencing data quality in diverse clinical settings. The procedure can be completed in 2 d and requires expertise in molecular biology and bioinformatics.

构建RNA标准物质,提高转录组学数据的定量化。
RNA参考材料及其相应的参考数据集作为实验值归一化的“基础真相”,是可靠地测量RNA测序数据内在微小差异(例如临床样品中疾病分子亚型之间的差异)的不可或缺的工具。然而,在不同批次、方法或平台上测量的“绝对”表达谱的可变性限制了传统RNA参考数据集的使用。我们最近提出了一种基于比率的方法来构建参考数据集。一个基因的比率被定义为两个样本组之间的标准化表达水平,并且产生比通过不同转录组技术和批次获得的“绝对”值更可靠的值。我们的基因比率已被用于成功生成全组学参考数据集。在这里,我们描述了一个逐步建立RNA标准物质和标准数据集的过程,包括三个阶段:(1)标准物质,包括材料制备、均匀性测试和稳定性测试;(2)基于比例的参考数据集,包括表征、不确定度估计和正交验证;(3)应用,包括绩效指标的定义、能力测试的执行以及批量效应的诊断和纠正。该方法建立了四方RNA参考物质和参考数据集(chinesequartet.org),已被中国国家市场监督管理总局批准为第一套国家认证的RNA参考物质。该方案可用于建立和应用参考材料,以提高不同临床环境中的rna测序数据质量。该过程可以在二维中完成,需要分子生物学和生物信息学方面的专业知识。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Nature Protocols
Nature Protocols 生物-生化研究方法
CiteScore
29.10
自引率
0.70%
发文量
128
审稿时长
4 months
期刊介绍: Nature Protocols focuses on publishing protocols used to address significant biological and biomedical science research questions, including methods grounded in physics and chemistry with practical applications to biological problems. The journal caters to a primary audience of research scientists and, as such, exclusively publishes protocols with research applications. Protocols primarily aimed at influencing patient management and treatment decisions are not featured. The specific techniques covered encompass a wide range, including but not limited to: Biochemistry, Cell biology, Cell culture, Chemical modification, Computational biology, Developmental biology, Epigenomics, Genetic analysis, Genetic modification, Genomics, Imaging, Immunology, Isolation, purification, and separation, Lipidomics, Metabolomics, Microbiology, Model organisms, Nanotechnology, Neuroscience, Nucleic-acid-based molecular biology, Pharmacology, Plant biology, Protein analysis, Proteomics, Spectroscopy, Structural biology, Synthetic chemistry, Tissue culture, Toxicology, and Virology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信