Yue Xu, Chunfeng He, Jiayao Fan, Yuan Zhou, Chunxiao Cheng, Ran Meng, Ya Cui, Wei Li, Eric R Gamazon, Dan Zhou
{"title":"A multi-modal framework improves prediction of tissue-specific gene expression from a surrogate tissue.","authors":"Yue Xu, Chunfeng He, Jiayao Fan, Yuan Zhou, Chunxiao Cheng, Ran Meng, Ya Cui, Wei Li, Eric R Gamazon, Dan Zhou","doi":"10.1016/j.ebiom.2024.105305","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Tissue-specific analysis of the transcriptome is critical to elucidating the molecular basis of complex traits, but central tissues are often not accessible. We propose a methodology, Multi-mOdal-based framework to bridge the Transcriptome between PEripheral and Central tissues (MOTPEC).</p><p><strong>Methods: </strong>Multi-modal regulatory elements in peripheral blood are incorporated as features for gene expression prediction in 48 central tissues. To demonstrate the utility, we apply it to the identification of BMI-associated genes and compare the tissue-specific results with those derived directly from surrogate blood.</p><p><strong>Findings: </strong>MOTPEC models demonstrate superior performance compared with both baseline models in blood and existing models across the 48 central tissues. We identify a set of BMI-associated genes using the central tissue MOTPEC-predicted transcriptome data. The MOTPEC-based differential gene expression (DGE) analysis of BMI in the central tissues (including brain caudate basal ganglia and visceral omentum adipose tissue) identifies 378 genes overlapping the results from a TWAS of BMI, while only 162 overlapping genes are identified using gene expression in blood. Cellular perturbation analysis further supports the utility of MOTPEC for identifying trait-associated gene sets and narrowing the effect size divergence between peripheral blood and central tissues.</p><p><strong>Interpretation: </strong>The MOTPEC framework improves the gene expression prediction accuracy for central tissues and enhances the identification of tissue-specific trait-associated genes.</p><p><strong>Funding: </strong>This research is supported by the National Natural Science Foundation of China 82204118 (D.Z.), the seed funding of the Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province (2020E10004), the National Institutes of Health (NIH) Genomic Innovator Award R35HG010718 (E.R.G.), NIH/NHGRI R01HG011138 (E.R.G.), NIH/NIA R56AG068026 (E.R.G.), NIH Office of the Director U24OD035523 (E.R.G.), and NIH/NIGMS R01GM140287 (E.R.G.).</p>","PeriodicalId":11494,"journal":{"name":"EBioMedicine","volume":null,"pages":null},"PeriodicalIF":9.7000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11388271/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"EBioMedicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.ebiom.2024.105305","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/8/23 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Tissue-specific analysis of the transcriptome is critical to elucidating the molecular basis of complex traits, but central tissues are often not accessible. We propose a methodology, Multi-mOdal-based framework to bridge the Transcriptome between PEripheral and Central tissues (MOTPEC).
Methods: Multi-modal regulatory elements in peripheral blood are incorporated as features for gene expression prediction in 48 central tissues. To demonstrate the utility, we apply it to the identification of BMI-associated genes and compare the tissue-specific results with those derived directly from surrogate blood.
Findings: MOTPEC models demonstrate superior performance compared with both baseline models in blood and existing models across the 48 central tissues. We identify a set of BMI-associated genes using the central tissue MOTPEC-predicted transcriptome data. The MOTPEC-based differential gene expression (DGE) analysis of BMI in the central tissues (including brain caudate basal ganglia and visceral omentum adipose tissue) identifies 378 genes overlapping the results from a TWAS of BMI, while only 162 overlapping genes are identified using gene expression in blood. Cellular perturbation analysis further supports the utility of MOTPEC for identifying trait-associated gene sets and narrowing the effect size divergence between peripheral blood and central tissues.
Interpretation: The MOTPEC framework improves the gene expression prediction accuracy for central tissues and enhances the identification of tissue-specific trait-associated genes.
Funding: This research is supported by the National Natural Science Foundation of China 82204118 (D.Z.), the seed funding of the Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province (2020E10004), the National Institutes of Health (NIH) Genomic Innovator Award R35HG010718 (E.R.G.), NIH/NHGRI R01HG011138 (E.R.G.), NIH/NIA R56AG068026 (E.R.G.), NIH Office of the Director U24OD035523 (E.R.G.), and NIH/NIGMS R01GM140287 (E.R.G.).
EBioMedicineBiochemistry, Genetics and Molecular Biology-General Biochemistry,Genetics and Molecular Biology
CiteScore
17.70
自引率
0.90%
发文量
579
审稿时长
5 weeks
期刊介绍:
eBioMedicine is a comprehensive biomedical research journal that covers a wide range of studies that are relevant to human health. Our focus is on original research that explores the fundamental factors influencing human health and disease, including the discovery of new therapeutic targets and treatments, the identification of biomarkers and diagnostic tools, and the investigation and modification of disease pathways and mechanisms. We welcome studies from any biomedical discipline that contribute to our understanding of disease and aim to improve human health.