Integrating Proteomics and GWAS to Identify Key Tissues and Genes Underlying Human Complex Diseases.

IF 3.6 3区 生物学 Q1 BIOLOGY
Chao Xue, Miao Zhou
{"title":"Integrating Proteomics and GWAS to Identify Key Tissues and Genes Underlying Human Complex Diseases.","authors":"Chao Xue, Miao Zhou","doi":"10.3390/biology14050554","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The tissues of origin and molecular mechanisms underlying human complex diseases remain incompletely understood. Previous studies have leveraged transcriptomic data to interpret genome-wide association studies (GWASs) for identifying disease-relevant tissues and fine-mapping causal genes. However, according to the central dogma, proteins more directly reflect cellular molecular activities than RNA. Therefore, in this study, we integrated proteomic data with GWAS to identify disease-associated tissues and genes.</p><p><strong>Methods: </strong>We compiled proteomic and paired transcriptomic data for 12,229 genes across 32 human tissues from the GTEx project. Using three tissue inference approaches-S-LDSC, MAGMA, and DESE-we analyzed GWAS data for six representative complex diseases (bipolar disorder, schizophrenia, coronary artery disease, Crohn's disease, rheumatoid arthritis, and type 2 diabetes), with an average sample size of 260 K. We systematically compared disease-associated tissues and genes identified using proteomic versus transcriptomic data.</p><p><strong>Results: </strong>Tissue-specific protein abundance showed a moderate correlation with RNA expression (mean correlation coefficient = 0.46, 95% CI: 0.42-0.49). Proteomic data accurately identified disease-relevant tissues, such as the association between brain regions and schizophrenia and between coronary arteries and coronary artery disease. Compared to GWAS-based gene association estimates alone, incorporating proteomic data significantly improved gene association detection (AUC difference test, <i>p</i> = 0.0028). Furthermore, proteomic data revealed unique disease-associated genes that were not identified using transcriptomic data, such as the association between bipolar disorder and <i>CREB1</i>.</p><p><strong>Conclusions: </strong>Integrating proteomic data enables accurate identification of disease-associated tissues and provides irreplaceable advantages in fine-mapping genes for complex diseases.</p>","PeriodicalId":48624,"journal":{"name":"Biology-Basel","volume":"14 5","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12109507/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biology-Basel","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3390/biology14050554","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: The tissues of origin and molecular mechanisms underlying human complex diseases remain incompletely understood. Previous studies have leveraged transcriptomic data to interpret genome-wide association studies (GWASs) for identifying disease-relevant tissues and fine-mapping causal genes. However, according to the central dogma, proteins more directly reflect cellular molecular activities than RNA. Therefore, in this study, we integrated proteomic data with GWAS to identify disease-associated tissues and genes.

Methods: We compiled proteomic and paired transcriptomic data for 12,229 genes across 32 human tissues from the GTEx project. Using three tissue inference approaches-S-LDSC, MAGMA, and DESE-we analyzed GWAS data for six representative complex diseases (bipolar disorder, schizophrenia, coronary artery disease, Crohn's disease, rheumatoid arthritis, and type 2 diabetes), with an average sample size of 260 K. We systematically compared disease-associated tissues and genes identified using proteomic versus transcriptomic data.

Results: Tissue-specific protein abundance showed a moderate correlation with RNA expression (mean correlation coefficient = 0.46, 95% CI: 0.42-0.49). Proteomic data accurately identified disease-relevant tissues, such as the association between brain regions and schizophrenia and between coronary arteries and coronary artery disease. Compared to GWAS-based gene association estimates alone, incorporating proteomic data significantly improved gene association detection (AUC difference test, p = 0.0028). Furthermore, proteomic data revealed unique disease-associated genes that were not identified using transcriptomic data, such as the association between bipolar disorder and CREB1.

Conclusions: Integrating proteomic data enables accurate identification of disease-associated tissues and provides irreplaceable advantages in fine-mapping genes for complex diseases.

整合蛋白质组学和GWAS鉴定人类复杂疾病的关键组织和基因。
背景:人类复杂疾病的起源组织和分子机制尚不完全清楚。先前的研究利用转录组学数据来解释全基因组关联研究(GWASs),以识别疾病相关组织和精细定位因果基因。然而,根据中心法则,蛋白质比RNA更直接地反映细胞分子活动。因此,在本研究中,我们将蛋白质组学数据与GWAS相结合,以鉴定疾病相关组织和基因。方法:我们收集了来自GTEx项目的32个人体组织的12229个基因的蛋白质组学和配对转录组学数据。使用三种组织推断方法- s- ldsc、MAGMA和desi -我们分析了六种代表性复杂疾病(双相情感障碍、精神分裂症、冠状动脉疾病、克罗恩病、类风湿性关节炎和2型糖尿病)的GWAS数据,平均样本量为260 K。我们系统地比较了用蛋白质组学和转录组学数据鉴定的疾病相关组织和基因。结果:组织特异性蛋白丰度与RNA表达呈中等相关性(平均相关系数= 0.46,95% CI: 0.42-0.49)。蛋白质组学数据准确地确定了与疾病相关的组织,例如大脑区域与精神分裂症之间的关联,以及冠状动脉与冠状动脉疾病之间的关联。与单独基于gwas的基因关联估计相比,结合蛋白质组学数据显著提高了基因关联检测(AUC差异检验,p = 0.0028)。此外,蛋白质组学数据揭示了转录组学数据未确定的独特疾病相关基因,例如双相情感障碍与CREB1之间的关联。结论:整合蛋白质组学数据可以准确识别疾病相关组织,在复杂疾病的基因精细定位方面具有不可替代的优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Biology-Basel
Biology-Basel Biological Science-Biological Science
CiteScore
5.70
自引率
4.80%
发文量
1618
审稿时长
11 weeks
期刊介绍: Biology (ISSN 2079-7737) is an international, peer-reviewed, quick-refereeing open access journal of Biological Science published by MDPI online. It publishes reviews, research papers and communications in all areas of biology and at the interface of related disciplines. Our aim is to encourage scientists to publish their experimental and theoretical results in as much detail as possible. There is no restriction on the length of the papers. The full experimental details must be provided so that the results can be reproduced. Electronic files regarding the full details of the experimental procedure, if unable to be published in a normal way, can be deposited as supplementary material.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信