基于子集的跨组织全转录组关联研究方法提高了研究效率和可解释性

IF 3.3 Q2 GENETICS & HEREDITY
HGG Advances Pub Date : 2024-04-11 Epub Date: 2024-03-16 DOI:10.1016/j.xhgg.2024.100283
Xinyu Guo, Nilanjan Chatterjee, Diptavo Dutta
{"title":"基于子集的跨组织全转录组关联研究方法提高了研究效率和可解释性","authors":"Xinyu Guo, Nilanjan Chatterjee, Diptavo Dutta","doi":"10.1016/j.xhgg.2024.100283","DOIUrl":null,"url":null,"abstract":"<p><p>Integrating results from genome-wide association studies (GWASs) and studies of molecular phenotypes such as gene expressions can improve our understanding of the biological functions of trait-associated variants and can help prioritize candidate genes for downstream analysis. Using reference expression quantitative trait locus (eQTL) studies, several methods have been proposed to identify gene-trait associations, primarily based on gene expression imputation. To increase the statistical power by leveraging substantial eQTL sharing across tissues, meta-analysis methods aggregating such gene-based test results across multiple tissues or contexts have been developed as well. However, most existing meta-analysis methods have limited power to identify associations when the gene has weaker associations in only a few tissues and cannot identify the subset of tissues in which the gene is \"activated.\" For this, we developed a cross-tissue subset-based transcriptome-wide association study (CSTWAS) meta-analysis method that improves power under such scenarios and can extract the set of potentially associated tissues. To improve applicability, CSTWAS uses only GWAS summary statistics and pre-computed correlation matrices to identify a subset of tissues that have the maximal evidence of gene-trait association. Through numerical simulations, we found that CSTWAS can maintain a well-calibrated type-I error rate, improves power especially when there is a small number of associated tissues for a gene-trait association, and identifies an accurate associated tissue set. By analyzing GWAS summary statistics of three complex traits and diseases, we demonstrate that CSTWAS could identify biological meaningful signals while providing an interpretation of disease etiology by extracting a set of potentially associated tissues.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":null,"pages":null},"PeriodicalIF":3.3000,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10999697/pdf/","citationCount":"0","resultStr":"{\"title\":\"Subset-based method for cross-tissue transcriptome-wide association studies improves power and interpretability.\",\"authors\":\"Xinyu Guo, Nilanjan Chatterjee, Diptavo Dutta\",\"doi\":\"10.1016/j.xhgg.2024.100283\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Integrating results from genome-wide association studies (GWASs) and studies of molecular phenotypes such as gene expressions can improve our understanding of the biological functions of trait-associated variants and can help prioritize candidate genes for downstream analysis. Using reference expression quantitative trait locus (eQTL) studies, several methods have been proposed to identify gene-trait associations, primarily based on gene expression imputation. To increase the statistical power by leveraging substantial eQTL sharing across tissues, meta-analysis methods aggregating such gene-based test results across multiple tissues or contexts have been developed as well. However, most existing meta-analysis methods have limited power to identify associations when the gene has weaker associations in only a few tissues and cannot identify the subset of tissues in which the gene is \\\"activated.\\\" For this, we developed a cross-tissue subset-based transcriptome-wide association study (CSTWAS) meta-analysis method that improves power under such scenarios and can extract the set of potentially associated tissues. To improve applicability, CSTWAS uses only GWAS summary statistics and pre-computed correlation matrices to identify a subset of tissues that have the maximal evidence of gene-trait association. Through numerical simulations, we found that CSTWAS can maintain a well-calibrated type-I error rate, improves power especially when there is a small number of associated tissues for a gene-trait association, and identifies an accurate associated tissue set. By analyzing GWAS summary statistics of three complex traits and diseases, we demonstrate that CSTWAS could identify biological meaningful signals while providing an interpretation of disease etiology by extracting a set of potentially associated tissues.</p>\",\"PeriodicalId\":34530,\"journal\":{\"name\":\"HGG Advances\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10999697/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"HGG Advances\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1016/j.xhgg.2024.100283\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/3/16 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"HGG Advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.xhgg.2024.100283","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/3/16 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

整合全基因组关联研究(GWAS)和基因表达等分子表型研究的结果,可以提高我们对性状相关变异的生物学功能的理解,并有助于优先选择候选基因进行下游分析。利用参考表达量性状位点(eQTL)研究,已经提出了几种方法来确定基因与性状的关联,这些方法主要基于基因表达估算。为了利用不同组织间大量的 eQTL 共享来提高统计能力,人们还开发了荟萃分析方法,将这种基于基因的测试结果汇总到多个组织或环境中。然而,大多数现有的荟萃分析方法在基因仅在少数组织中具有较弱关联时,识别关联的能力有限,而且无法识别基因在哪些组织中被 "激活"。为此,我们开发了一种基于跨组织子集的荟萃分析(CSTWAS)方法,它能提高在这种情况下的分析能力,并能提取潜在相关组织的集合。为了提高适用性,CSTWAS 只使用 GWAS 统计摘要和预先计算的相关矩阵来识别具有最大基因-性状关联证据的组织子集。通过数值模拟,我们发现 CSTWAS 可以保持良好的校准 I 型错误率,尤其是当基因-性状关联的相关组织数量较少时,它可以提高功率,并识别出准确的相关组织集。通过分析三种复杂性状和疾病的 GWAS 统计摘要,我们证明了 CSTWAS 可以识别有生物学意义的信号,同时通过提取潜在相关组织集来解释疾病的病因学。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Subset-based method for cross-tissue transcriptome-wide association studies improves power and interpretability.

Integrating results from genome-wide association studies (GWASs) and studies of molecular phenotypes such as gene expressions can improve our understanding of the biological functions of trait-associated variants and can help prioritize candidate genes for downstream analysis. Using reference expression quantitative trait locus (eQTL) studies, several methods have been proposed to identify gene-trait associations, primarily based on gene expression imputation. To increase the statistical power by leveraging substantial eQTL sharing across tissues, meta-analysis methods aggregating such gene-based test results across multiple tissues or contexts have been developed as well. However, most existing meta-analysis methods have limited power to identify associations when the gene has weaker associations in only a few tissues and cannot identify the subset of tissues in which the gene is "activated." For this, we developed a cross-tissue subset-based transcriptome-wide association study (CSTWAS) meta-analysis method that improves power under such scenarios and can extract the set of potentially associated tissues. To improve applicability, CSTWAS uses only GWAS summary statistics and pre-computed correlation matrices to identify a subset of tissues that have the maximal evidence of gene-trait association. Through numerical simulations, we found that CSTWAS can maintain a well-calibrated type-I error rate, improves power especially when there is a small number of associated tissues for a gene-trait association, and identifies an accurate associated tissue set. By analyzing GWAS summary statistics of three complex traits and diseases, we demonstrate that CSTWAS could identify biological meaningful signals while providing an interpretation of disease etiology by extracting a set of potentially associated tissues.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
HGG Advances
HGG Advances Biochemistry, Genetics and Molecular Biology-Molecular Medicine
CiteScore
4.30
自引率
4.50%
发文量
69
审稿时长
14 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信