Yichao Zhou, Temidayo Adeluwa, Lisha Zhu, Sofia Salazar-Magaña, Sarah Sumner, Hyunki Kim, Saideep Gona, Festus Nyasimi, Rohit Kulkarni, Joseph E Powell, Ravi Madduri, Boxiang Liu, Mengjie Chen, Hae Kyung Im
{"title":"scPrediXcan integrates deep learning methods and single-cell data into a cell-type-specific transcriptome-wide association study framework.","authors":"Yichao Zhou, Temidayo Adeluwa, Lisha Zhu, Sofia Salazar-Magaña, Sarah Sumner, Hyunki Kim, Saideep Gona, Festus Nyasimi, Rohit Kulkarni, Joseph E Powell, Ravi Madduri, Boxiang Liu, Mengjie Chen, Hae Kyung Im","doi":"10.1016/j.xgen.2025.100875","DOIUrl":null,"url":null,"abstract":"<p><p>Transcriptome-wide association studies (TWASs) help identify disease-causing genes but often fail to pinpoint disease mechanisms at the cellular level because of the limited sample sizes and sparsity of cell-type-specific expression data. Here, we propose scPrediXcan, which integrates state-of-the-art deep learning approaches that predict epigenetic features from DNA sequences with the canonical TWAS framework. Our prediction approach, ctPred, predicts cell-type-specific expression with high accuracy and captures complex gene-regulatory grammar that linear models overlook. Applied to type 2 diabetes (T2D) and systemic lupus erythematosus (SLE), scPrediXcan outperformed the canonical TWAS framework by identifying more candidate causal genes, explaining more genome-wide association study (GWAS) loci and providing insights into the cellular specificity of TWAS hits. Overall, our results demonstrate that scPrediXcan represents a significant advance, promising to deepen our understanding of the cellular mechanisms underlying complex diseases.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":"5 5","pages":"100875"},"PeriodicalIF":11.1000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12143334/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.xgen.2025.100875","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Transcriptome-wide association studies (TWASs) help identify disease-causing genes but often fail to pinpoint disease mechanisms at the cellular level because of the limited sample sizes and sparsity of cell-type-specific expression data. Here, we propose scPrediXcan, which integrates state-of-the-art deep learning approaches that predict epigenetic features from DNA sequences with the canonical TWAS framework. Our prediction approach, ctPred, predicts cell-type-specific expression with high accuracy and captures complex gene-regulatory grammar that linear models overlook. Applied to type 2 diabetes (T2D) and systemic lupus erythematosus (SLE), scPrediXcan outperformed the canonical TWAS framework by identifying more candidate causal genes, explaining more genome-wide association study (GWAS) loci and providing insights into the cellular specificity of TWAS hits. Overall, our results demonstrate that scPrediXcan represents a significant advance, promising to deepen our understanding of the cellular mechanisms underlying complex diseases.