Manuel Corpas, Mkpouto Pius, Marie Poburennaya, Heinner Guio, Miriam Dwek, Shivashankar Nagaraj, Catalina Lopez-Correa, Alice Popejoy, Segun Fatumo
{"title":"Bridging genomics' greatest challenge: The diversity gap.","authors":"Manuel Corpas, Mkpouto Pius, Marie Poburennaya, Heinner Guio, Miriam Dwek, Shivashankar Nagaraj, Catalina Lopez-Correa, Alice Popejoy, Segun Fatumo","doi":"10.1016/j.xgen.2024.100724","DOIUrl":null,"url":null,"abstract":"<p><p>Achieving diverse representation in biomedical data is critical for healthcare equity. Failure to do so perpetuates health disparities and exacerbates biases that may harm patients with underrepresented ancestral backgrounds. We present a quantitative assessment of representation in datasets used across human genomics, including genome-wide association studies (GWASs), pharmacogenomics, clinical trials, and direct-to-consumer (DTC) genetic testing. We suggest that relative proportions of ancestries represented in datasets, compared to the global census population, provide insufficient representation of global ancestral genetic diversity. Some populations have greater proportional representation in data relative to their population size and the genomic diversity present in their ancestral haplotypes. As insights from genomics become increasingly integrated into evidence-based medicine, strategic inclusion and effective mechanisms to ensure representation of global genomic diversity in datasets are imperative.</p>","PeriodicalId":72539,"journal":{"name":"Cell genomics","volume":" ","pages":"100724"},"PeriodicalIF":11.1000,"publicationDate":"2024-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cell genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.xgen.2024.100724","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CELL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Achieving diverse representation in biomedical data is critical for healthcare equity. Failure to do so perpetuates health disparities and exacerbates biases that may harm patients with underrepresented ancestral backgrounds. We present a quantitative assessment of representation in datasets used across human genomics, including genome-wide association studies (GWASs), pharmacogenomics, clinical trials, and direct-to-consumer (DTC) genetic testing. We suggest that relative proportions of ancestries represented in datasets, compared to the global census population, provide insufficient representation of global ancestral genetic diversity. Some populations have greater proportional representation in data relative to their population size and the genomic diversity present in their ancestral haplotypes. As insights from genomics become increasingly integrated into evidence-based medicine, strategic inclusion and effective mechanisms to ensure representation of global genomic diversity in datasets are imperative.