S. Taylor Head, Qile Dai, Joellen Schildkraut, David J. Cutler, Jingjing Yang, Michael P. Epstein
{"title":"CADET: Enhanced transcriptome-wide association analyses in admixed samples using eQTL summary data","authors":"S. Taylor Head, Qile Dai, Joellen Schildkraut, David J. Cutler, Jingjing Yang, Michael P. Epstein","doi":"10.1016/j.ajhg.2025.05.010","DOIUrl":null,"url":null,"abstract":"A transcriptome-wide association study (TWAS) is a popular statistical method for identifying genes whose genetically regulated expression (GReX) component is associated with a trait of interest. Most TWAS approaches fundamentally assume that the training dataset (used to fit the gene expression prediction model) and target genome-wide association study (GWAS) dataset are from the same ancestrally homogeneous population. If this assumption is violated, studies have shown a marked negative impact on expression prediction accuracy as well as reduced power of the downstream gene-trait association test. These issues pose a particular problem for admixed individuals whose genomes represent a mosaic of multiple continental ancestral segments. To resolve these issues, we present CADET, which enables powerful TWAS of admixed cohorts leveraging the local-ancestry (LA) information of the cohort along with summary-level expression quantitative trait locus (eQTL) data from reference panels of different ancestral groups. CADET combines multiple polygenic risk score models based on the summary-level eQTL reference data to predict LA-aware GReX components in admixed target samples. Using simulated data, we compare the imputation accuracy, power, and type I error rate of our proposed LA-aware approach to LA-unaware methods for performing TWASs. We show that CADET performs optimally in nearly all settings regardless of whether the genetic architecture of gene expression is dependent or independent of ancestry. We further illustrate CADET by performing a TWAS of 29 common blood biochemistry phenotypes within an admixed cohort from the UK Biobank and identify 18 hits unique to our LA-aware strategy, with the majority of hits supported by existing GWAS findings.","PeriodicalId":7659,"journal":{"name":"American journal of human genetics","volume":"11 1","pages":""},"PeriodicalIF":8.1000,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of human genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.ajhg.2025.05.010","RegionNum":1,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
A transcriptome-wide association study (TWAS) is a popular statistical method for identifying genes whose genetically regulated expression (GReX) component is associated with a trait of interest. Most TWAS approaches fundamentally assume that the training dataset (used to fit the gene expression prediction model) and target genome-wide association study (GWAS) dataset are from the same ancestrally homogeneous population. If this assumption is violated, studies have shown a marked negative impact on expression prediction accuracy as well as reduced power of the downstream gene-trait association test. These issues pose a particular problem for admixed individuals whose genomes represent a mosaic of multiple continental ancestral segments. To resolve these issues, we present CADET, which enables powerful TWAS of admixed cohorts leveraging the local-ancestry (LA) information of the cohort along with summary-level expression quantitative trait locus (eQTL) data from reference panels of different ancestral groups. CADET combines multiple polygenic risk score models based on the summary-level eQTL reference data to predict LA-aware GReX components in admixed target samples. Using simulated data, we compare the imputation accuracy, power, and type I error rate of our proposed LA-aware approach to LA-unaware methods for performing TWASs. We show that CADET performs optimally in nearly all settings regardless of whether the genetic architecture of gene expression is dependent or independent of ancestry. We further illustrate CADET by performing a TWAS of 29 common blood biochemistry phenotypes within an admixed cohort from the UK Biobank and identify 18 hits unique to our LA-aware strategy, with the majority of hits supported by existing GWAS findings.
期刊介绍:
The American Journal of Human Genetics (AJHG) is a monthly journal published by Cell Press, chosen by The American Society of Human Genetics (ASHG) as its premier publication starting from January 2008. AJHG represents Cell Press's first society-owned journal, and both ASHG and Cell Press anticipate significant synergies between AJHG content and that of other Cell Press titles.