样本和基因比较证实分子通路层面的 RNA 和蛋白质表达数据具有更高的相似性,并为高通量表达数据库的数据质量检查提供了一种方法

IF 2.3 4区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Mikhail Raevskiy, Maxim Sorokin, Aleksandra Emelianova, Galina Zakharova, Elena Poddubskaya, Marianna Zolotovskaia, Anton Buzdin
{"title":"样本和基因比较证实分子通路层面的 RNA 和蛋白质表达数据具有更高的相似性,并为高通量表达数据库的数据质量检查提供了一种方法","authors":"Mikhail Raevskiy,&nbsp;Maxim Sorokin,&nbsp;Aleksandra Emelianova,&nbsp;Galina Zakharova,&nbsp;Elena Poddubskaya,&nbsp;Marianna Zolotovskaia,&nbsp;Anton Buzdin","doi":"10.1134/S0006297924040126","DOIUrl":null,"url":null,"abstract":"<p>Identification of genes and molecular pathways with congruent profiles in the proteomic and transcriptomic datasets may result in the discovery of promising transcriptomic biomarkers that would be more relevant to phenotypic changes. In this study, we conducted comparative analysis of 943 paired RNA and proteomic profiles obtained for the same samples of seven human cancer types from The Cancer Genome Atlas (TCGA) and NCI Clinical Proteomic Tumor Analysis Consortium (CPTAC) [two major open human cancer proteomic and transcriptomic databases] that included 15,112 protein-coding genes and 1611 molecular pathways. Overall, our findings demonstrated statistically significant improvement of the congruence between RNA and proteomic profiles when performing analysis at the level of molecular pathways rather than at the level of individual gene products. Transition to the molecular pathway level of data analysis increased the correlation to 0.19-0.57 (Pearson) and 0.14-057 (Spearman), or 2-3-fold for some cancer types. Evaluating the gain of the correlation upon transition to the data analysis the pathway level can be used to refine the omics data by identifying outliers that can be excluded from the comparison of RNA and proteomic profiles. We suggest using sample- and gene-wise correlations for individual genes and molecular pathways as a measure of quality of RNA/protein paired molecular data. We also provide a database of human genes, molecular pathways, and samples related to the correlation between RNA and protein products to facilitate an exploration of new cancer transcriptomic biomarkers and molecular mechanisms at different levels of human gene expression.</p>","PeriodicalId":483,"journal":{"name":"Biochemistry (Moscow)","volume":null,"pages":null},"PeriodicalIF":2.3000,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sample-Wise and Gene-Wise Comparisons Confirm a Greater Similarity of RNA and Protein Expression Data at the Level of Molecular Pathways and Suggest an Approach for the Data Quality Check in High-Throughput Expression Databases\",\"authors\":\"Mikhail Raevskiy,&nbsp;Maxim Sorokin,&nbsp;Aleksandra Emelianova,&nbsp;Galina Zakharova,&nbsp;Elena Poddubskaya,&nbsp;Marianna Zolotovskaia,&nbsp;Anton Buzdin\",\"doi\":\"10.1134/S0006297924040126\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Identification of genes and molecular pathways with congruent profiles in the proteomic and transcriptomic datasets may result in the discovery of promising transcriptomic biomarkers that would be more relevant to phenotypic changes. In this study, we conducted comparative analysis of 943 paired RNA and proteomic profiles obtained for the same samples of seven human cancer types from The Cancer Genome Atlas (TCGA) and NCI Clinical Proteomic Tumor Analysis Consortium (CPTAC) [two major open human cancer proteomic and transcriptomic databases] that included 15,112 protein-coding genes and 1611 molecular pathways. Overall, our findings demonstrated statistically significant improvement of the congruence between RNA and proteomic profiles when performing analysis at the level of molecular pathways rather than at the level of individual gene products. Transition to the molecular pathway level of data analysis increased the correlation to 0.19-0.57 (Pearson) and 0.14-057 (Spearman), or 2-3-fold for some cancer types. Evaluating the gain of the correlation upon transition to the data analysis the pathway level can be used to refine the omics data by identifying outliers that can be excluded from the comparison of RNA and proteomic profiles. We suggest using sample- and gene-wise correlations for individual genes and molecular pathways as a measure of quality of RNA/protein paired molecular data. We also provide a database of human genes, molecular pathways, and samples related to the correlation between RNA and protein products to facilitate an exploration of new cancer transcriptomic biomarkers and molecular mechanisms at different levels of human gene expression.</p>\",\"PeriodicalId\":483,\"journal\":{\"name\":\"Biochemistry (Moscow)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2024-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biochemistry (Moscow)\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://link.springer.com/article/10.1134/S0006297924040126\",\"RegionNum\":4,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biochemistry (Moscow)","FirstCategoryId":"99","ListUrlMain":"https://link.springer.com/article/10.1134/S0006297924040126","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

摘要鉴定蛋白质组和转录组数据集中具有一致特征的基因和分子通路可能有助于发现与表型变化更相关的转录组生物标记物。在这项研究中,我们对从癌症基因组图谱(TCGA)和NCI临床蛋白质组肿瘤分析联盟(CPTAC)[两个主要的开放式人类癌症蛋白质组和转录组数据库]中获得的943份相同人类癌症类型样本的配对RNA和蛋白质组图谱进行了比较分析,其中包括15112个蛋白编码基因和1611条分子通路。总体而言,我们的研究结果表明,在分子通路层面而非单个基因产物层面进行分析时,RNA 和蛋白质组图谱之间的一致性在统计学上有显著提高。转到分子通路层面进行数据分析后,相关性提高到 0.19-0.57 (Pearson)和 0.14-057(Spearman),即某些癌症类型的相关性提高了 2-3 倍。评估转换到通路层面数据分析后的相关性增益,可通过识别异常值来完善omics数据,从而将其从RNA和蛋白质组图谱的比较中排除。我们建议使用单个基因和分子通路的样本和基因相关性来衡量 RNA/ 蛋白质配对分子数据的质量。我们还提供了一个与 RNA 和蛋白质产物之间的相关性有关的人类基因、分子通路和样本数据库,以便于探索新的癌症转录组生物标志物和人类基因表达不同水平的分子机制。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Sample-Wise and Gene-Wise Comparisons Confirm a Greater Similarity of RNA and Protein Expression Data at the Level of Molecular Pathways and Suggest an Approach for the Data Quality Check in High-Throughput Expression Databases

Sample-Wise and Gene-Wise Comparisons Confirm a Greater Similarity of RNA and Protein Expression Data at the Level of Molecular Pathways and Suggest an Approach for the Data Quality Check in High-Throughput Expression Databases

Identification of genes and molecular pathways with congruent profiles in the proteomic and transcriptomic datasets may result in the discovery of promising transcriptomic biomarkers that would be more relevant to phenotypic changes. In this study, we conducted comparative analysis of 943 paired RNA and proteomic profiles obtained for the same samples of seven human cancer types from The Cancer Genome Atlas (TCGA) and NCI Clinical Proteomic Tumor Analysis Consortium (CPTAC) [two major open human cancer proteomic and transcriptomic databases] that included 15,112 protein-coding genes and 1611 molecular pathways. Overall, our findings demonstrated statistically significant improvement of the congruence between RNA and proteomic profiles when performing analysis at the level of molecular pathways rather than at the level of individual gene products. Transition to the molecular pathway level of data analysis increased the correlation to 0.19-0.57 (Pearson) and 0.14-057 (Spearman), or 2-3-fold for some cancer types. Evaluating the gain of the correlation upon transition to the data analysis the pathway level can be used to refine the omics data by identifying outliers that can be excluded from the comparison of RNA and proteomic profiles. We suggest using sample- and gene-wise correlations for individual genes and molecular pathways as a measure of quality of RNA/protein paired molecular data. We also provide a database of human genes, molecular pathways, and samples related to the correlation between RNA and protein products to facilitate an exploration of new cancer transcriptomic biomarkers and molecular mechanisms at different levels of human gene expression.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biochemistry (Moscow)
Biochemistry (Moscow) 生物-生化与分子生物学
CiteScore
4.70
自引率
3.60%
发文量
139
审稿时长
2 months
期刊介绍: Biochemistry (Moscow) is the journal that includes research papers in all fields of biochemistry as well as biochemical aspects of molecular biology, bioorganic chemistry, microbiology, immunology, physiology, and biomedical sciences. Coverage also extends to new experimental methods in biochemistry, theoretical contributions of biochemical importance, reviews of contemporary biochemical topics, and mini-reviews (News in Biochemistry).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信