Dennis Khodasevich, Nicole Gladish, Saher Daredia, Anne K Bozack, Hanyang Shen, Jamaji C Nwanaji-Enwerem, Belinda L Needham, David H Rehkopf, Andres Cardenas
{"title":"Influence of race, ethnicity, and sex on the performance of epigenetic predictors of phenotypic traits.","authors":"Dennis Khodasevich, Nicole Gladish, Saher Daredia, Anne K Bozack, Hanyang Shen, Jamaji C Nwanaji-Enwerem, Belinda L Needham, David H Rehkopf, Andres Cardenas","doi":"10.1186/s13148-025-01864-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>DNA methylation-based predictors of phenotypic traits including leukocyte proportions, smoking activity, biological aging, and circulating levels of plasma proteins are widely used as biomarkers in public health research. However, limited racial and ethnic diversity of research participants is an ongoing issue for epigenetics research, and the potential downstream impacts of limited diversity in training samples on the performance of epigenetic predictors remains poorly understood. We examined the performance of epigenetic predictors of chronological age (also known as epigenetic clocks), telomere length, cell proportions, and plasma proteins within a diverse sample of adult NHANES participants during the 1999-2000 and 2001-2002 survey cycles, both overall and stratified by self-reported race/ethnicity and sex. We utilized correlation coefficients and median absolute errors (MAE) to judge predictor performance, and bootstrapping and multivariate regression to assess the significance of differences between groups.</p><p><strong>Results: </strong>All epigenetic predictors were significantly associated with their corresponding phenotypic traits in the overall population, with particularly high correlations for the epigenetic clocks and cell proportion estimates. Several significant differences in performance were observed between racial/ethnic groups, particularly for the plasma protein predictors, with a reoccurring trend of lower correlation in Mexican American and non-Hispanic Black participants compared to non-Hispanic White participants. Sex-differences in performance for several predictors were also identified but were not as pronounced. Multivariate regression models indicated that disparities in epigenetic predictor performance persisted after accounting for overall differences in epigenetic predictions related to race/ethnicity and sex, as well as further adjustment for estimated cell proportions and SES variables.</p><p><strong>Conclusions: </strong>We found evidence for substantial disparities in epigenetic predictor performance, with each predictor exhibiting at least one significant difference in correlation or MAE related to race, ethnicity, or sex.</p>","PeriodicalId":10366,"journal":{"name":"Clinical Epigenetics","volume":"17 1","pages":"59"},"PeriodicalIF":4.8000,"publicationDate":"2025-04-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11983795/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical Epigenetics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13148-025-01864-6","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: DNA methylation-based predictors of phenotypic traits including leukocyte proportions, smoking activity, biological aging, and circulating levels of plasma proteins are widely used as biomarkers in public health research. However, limited racial and ethnic diversity of research participants is an ongoing issue for epigenetics research, and the potential downstream impacts of limited diversity in training samples on the performance of epigenetic predictors remains poorly understood. We examined the performance of epigenetic predictors of chronological age (also known as epigenetic clocks), telomere length, cell proportions, and plasma proteins within a diverse sample of adult NHANES participants during the 1999-2000 and 2001-2002 survey cycles, both overall and stratified by self-reported race/ethnicity and sex. We utilized correlation coefficients and median absolute errors (MAE) to judge predictor performance, and bootstrapping and multivariate regression to assess the significance of differences between groups.
Results: All epigenetic predictors were significantly associated with their corresponding phenotypic traits in the overall population, with particularly high correlations for the epigenetic clocks and cell proportion estimates. Several significant differences in performance were observed between racial/ethnic groups, particularly for the plasma protein predictors, with a reoccurring trend of lower correlation in Mexican American and non-Hispanic Black participants compared to non-Hispanic White participants. Sex-differences in performance for several predictors were also identified but were not as pronounced. Multivariate regression models indicated that disparities in epigenetic predictor performance persisted after accounting for overall differences in epigenetic predictions related to race/ethnicity and sex, as well as further adjustment for estimated cell proportions and SES variables.
Conclusions: We found evidence for substantial disparities in epigenetic predictor performance, with each predictor exhibiting at least one significant difference in correlation or MAE related to race, ethnicity, or sex.
期刊介绍:
Clinical Epigenetics, the official journal of the Clinical Epigenetics Society, is an open access, peer-reviewed journal that encompasses all aspects of epigenetic principles and mechanisms in relation to human disease, diagnosis and therapy. Clinical trials and research in disease model organisms are particularly welcome.