{"title":"Construction of RNA reference materials for improving the quantification of transcriptomic data.","authors":"Ying Yu, Wanwan Hou, Qingwang Chen, Xiaorou Guo, Leqing Sang, Hao Xue, Duo Wang, Jinming Li, Xiang Fang, Rui Zhang, Lianhua Dong, Leming Shi, Yuanting Zheng","doi":"10.1038/s41596-024-01111-x","DOIUrl":null,"url":null,"abstract":"<p><p>RNA reference materials and their corresponding reference datasets act as the 'ground truth' for the normalization of experimental values and are indispensable tools for reliably measuring intrinsically small differences in RNA-sequencing data, such as those between molecular subtypes of diseases in clinical samples. However, the variability in 'absolute' expression profiles measured across different batches, methods or platforms limits the use of conventional RNA reference datasets. We recently proposed a ratio-based method for constructing reference datasets. The ratio for a gene is defined as the normalized expression levels between two sample groups and produces more reliable values than the 'absolute' values obtained across diverse transcriptomic technologies and batches. Our gene ratios have been used for the successful generation of omics-wide reference datasets. Here, we describe a step-by-step process for establishing RNA reference materials and reference datasets, covering three stages: (1) reference materials, including material preparation, homogeneity testing and stability testing; (2) ratio-based reference datasets, including characterization, uncertainty estimation and orthogonal validation; and (3) applications, including definition of performance metrics, performing proficiency tests and diagnosing and correcting batch effects. This approach established the Quartet RNA reference materials and reference datasets (chinese-quartet.org) that have been approved as the first suite of nationally certified RNA reference materials by China's State Administration for Market Regulation. The protocol can be utilized to establish and apply reference materials to improve RNA-sequencing data quality in diverse clinical settings. The procedure can be completed in 2 d and requires expertise in molecular biology and bioinformatics.</p>","PeriodicalId":18901,"journal":{"name":"Nature Protocols","volume":" ","pages":""},"PeriodicalIF":13.1000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Protocols","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1038/s41596-024-01111-x","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
RNA reference materials and their corresponding reference datasets act as the 'ground truth' for the normalization of experimental values and are indispensable tools for reliably measuring intrinsically small differences in RNA-sequencing data, such as those between molecular subtypes of diseases in clinical samples. However, the variability in 'absolute' expression profiles measured across different batches, methods or platforms limits the use of conventional RNA reference datasets. We recently proposed a ratio-based method for constructing reference datasets. The ratio for a gene is defined as the normalized expression levels between two sample groups and produces more reliable values than the 'absolute' values obtained across diverse transcriptomic technologies and batches. Our gene ratios have been used for the successful generation of omics-wide reference datasets. Here, we describe a step-by-step process for establishing RNA reference materials and reference datasets, covering three stages: (1) reference materials, including material preparation, homogeneity testing and stability testing; (2) ratio-based reference datasets, including characterization, uncertainty estimation and orthogonal validation; and (3) applications, including definition of performance metrics, performing proficiency tests and diagnosing and correcting batch effects. This approach established the Quartet RNA reference materials and reference datasets (chinese-quartet.org) that have been approved as the first suite of nationally certified RNA reference materials by China's State Administration for Market Regulation. The protocol can be utilized to establish and apply reference materials to improve RNA-sequencing data quality in diverse clinical settings. The procedure can be completed in 2 d and requires expertise in molecular biology and bioinformatics.
期刊介绍:
Nature Protocols focuses on publishing protocols used to address significant biological and biomedical science research questions, including methods grounded in physics and chemistry with practical applications to biological problems. The journal caters to a primary audience of research scientists and, as such, exclusively publishes protocols with research applications. Protocols primarily aimed at influencing patient management and treatment decisions are not featured.
The specific techniques covered encompass a wide range, including but not limited to: Biochemistry, Cell biology, Cell culture, Chemical modification, Computational biology, Developmental biology, Epigenomics, Genetic analysis, Genetic modification, Genomics, Imaging, Immunology, Isolation, purification, and separation, Lipidomics, Metabolomics, Microbiology, Model organisms, Nanotechnology, Neuroscience, Nucleic-acid-based molecular biology, Pharmacology, Plant biology, Protein analysis, Proteomics, Spectroscopy, Structural biology, Synthetic chemistry, Tissue culture, Toxicology, and Virology.