Roni Haas, Michael P Margolis, Angela Wei, Takafumi N Yamaguchi, Jeffrey Feng, Thai Tran, Veronica Tozzo, Katelyn J Queen, Mohammed Faizal Eeman Mootor, Vishakha Patil, Michael E Broudy, Paul Tung, Shafiul Alam, Danielle B Martinez, Yash Patel, Nicole Zeltser, Rupert Hugh-White, Jaron Arbet, Christa Caggiano, Ruhollah Shemirani, Mao Tian, Prapti Thapaliya, Lora Eloyan, Lawrence O Chen, Maryam Ariannejad, Clara Lajonchere, Bogdan Pasaniuc, Alex Bui, Valerie A Arboleda, Timothy S Chang, Noah Zaitlen, Paul T Spellman, Paul C Boutros, Daniel H Geschwind
{"title":"Diverse Genomes, Shared Health: Insights from a Health System Biobank.","authors":"Roni Haas, Michael P Margolis, Angela Wei, Takafumi N Yamaguchi, Jeffrey Feng, Thai Tran, Veronica Tozzo, Katelyn J Queen, Mohammed Faizal Eeman Mootor, Vishakha Patil, Michael E Broudy, Paul Tung, Shafiul Alam, Danielle B Martinez, Yash Patel, Nicole Zeltser, Rupert Hugh-White, Jaron Arbet, Christa Caggiano, Ruhollah Shemirani, Mao Tian, Prapti Thapaliya, Lora Eloyan, Lawrence O Chen, Maryam Ariannejad, Clara Lajonchere, Bogdan Pasaniuc, Alex Bui, Valerie A Arboleda, Timothy S Chang, Noah Zaitlen, Paul T Spellman, Paul C Boutros, Daniel H Geschwind","doi":"10.1101/2025.06.11.25329386","DOIUrl":null,"url":null,"abstract":"<p><p>Coupling genetic profiling with electronic health records from hospital biobanks is a foundational resource for precision medicine. However, lack of ancestral heterogeneity limits discovery and generalizability. We leveraged the UCLA ATLAS Community Health Initiative, a diverse biobank with >35% non-European participants in a single health system, to inform disease prevalence and genetic risk across five continental and 36 fine-scale ancestry groups. Analyzing clinical and genetic data for 93,937 individuals, 61,797 with whole-exome sequencing (WES), we identified novel associations between genetic variants and phenotypes, including <i>STARD7</i> with asthma risk in Mexican Americans and <i>FN3K</i> with intestinal disaccharidase deficiency across Europeans and Admixed Americans. Top decile polygenic scores (PGS) predicted patient status for many common diseases (40% of patients with Type 1 diabetes); an effect markedly diminished in non-Europeans. Exploring the distribution of ACMG ClinGen rare variants across populations demonstrated European bias in curated clinical variants. Mitigating this bias using computationally predicted deleterious variants, we identified new gene-disease associations, including <i>EXOC1L</i> and blood glucose level in East Asians. We identified <i>PTPRU</i> as a modulator of semaglutide's effects on weight loss, and additionally found variability across ancestries and a relationship with type-2-diabetes PGS. We provide an interactive web portal for accessing cross-ancestry associations at atlas-phewas.mednet.ucla.edu. Collectively, our findings support the value of ancestral diversity in advancing precision health across a broad spectrum of populations.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12204455/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2025.06.11.25329386","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Coupling genetic profiling with electronic health records from hospital biobanks is a foundational resource for precision medicine. However, lack of ancestral heterogeneity limits discovery and generalizability. We leveraged the UCLA ATLAS Community Health Initiative, a diverse biobank with >35% non-European participants in a single health system, to inform disease prevalence and genetic risk across five continental and 36 fine-scale ancestry groups. Analyzing clinical and genetic data for 93,937 individuals, 61,797 with whole-exome sequencing (WES), we identified novel associations between genetic variants and phenotypes, including STARD7 with asthma risk in Mexican Americans and FN3K with intestinal disaccharidase deficiency across Europeans and Admixed Americans. Top decile polygenic scores (PGS) predicted patient status for many common diseases (40% of patients with Type 1 diabetes); an effect markedly diminished in non-Europeans. Exploring the distribution of ACMG ClinGen rare variants across populations demonstrated European bias in curated clinical variants. Mitigating this bias using computationally predicted deleterious variants, we identified new gene-disease associations, including EXOC1L and blood glucose level in East Asians. We identified PTPRU as a modulator of semaglutide's effects on weight loss, and additionally found variability across ancestries and a relationship with type-2-diabetes PGS. We provide an interactive web portal for accessing cross-ancestry associations at atlas-phewas.mednet.ucla.edu. Collectively, our findings support the value of ancestral diversity in advancing precision health across a broad spectrum of populations.