Evaluating and improving health equity and fairness of polygenic scores.

IF 3.3 Q2 GENETICS & HEREDITY

HGG Advances Pub Date : 2024-04-11 Epub Date: 2024-02-23 DOI:10.1016/j.xhgg.2024.100280

Tianyu Zhang, Geyu Zhou, Lambertus Klei, Peng Liu, Alexandra Chouldechova, Hongyu Zhao, Kathryn Roeder, Max G'Sell, Bernie Devlin

{"title":"Evaluating and improving health equity and fairness of polygenic scores.","authors":"Tianyu Zhang, Geyu Zhou, Lambertus Klei, Peng Liu, Alexandra Chouldechova, Hongyu Zhao, Kathryn Roeder, Max G'Sell, Bernie Devlin","doi":"10.1016/j.xhgg.2024.100280","DOIUrl":null,"url":null,"abstract":"<p><p>Polygenic scores (PGSs) are quantitative metrics for predicting phenotypic values, such as human height or disease status. Some PGS methods require only summary statistics of a relevant genome-wide association study (GWAS) for their score. One such method is Lassosum, which inherits the model selection advantages of Lasso to select a meaningful subset of the GWAS single-nucleotide polymorphisms as predictors from their association statistics. However, even efficient scores like Lassosum, when derived from European-based GWASs, are poor predictors of phenotype for subjects of non-European ancestry; that is, they have limited portability to other ancestries. To increase the portability of Lassosum, when GWAS information and estimates of linkage disequilibrium are available for both ancestries, we propose Joint-Lassosum (JLS). In the simulation settings we explore, JLS provides more accurate PGSs compared to other methods, especially when measured in terms of fairness. In analyses of UK Biobank data, JLS was computationally more efficient but slightly less accurate than a Bayesian comparator, SDPRX. Like all PGS methods, JLS requires selection of predictors, which are determined by data-driven tuning parameters. We describe a new approach to selecting tuning parameters and note its relevance for model selection for any PGS. We also draw connections to the literature on algorithmic fairness and discuss how JLS can help mitigate fairness-related harms that might result from the use of PGSs in clinical settings. While no PGS method is likely to be universally portable, due to the diversity of human populations and unequal information content of GWASs for different ancestries, JLS is an effective approach for enhancing portability and reducing predictive bias.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100280"},"PeriodicalIF":3.3000,"publicationDate":"2024-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10937319/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"HGG Advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.xhgg.2024.100280","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/23 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}

引用次数: 0

Abstract

Polygenic scores (PGSs) are quantitative metrics for predicting phenotypic values, such as human height or disease status. Some PGS methods require only summary statistics of a relevant genome-wide association study (GWAS) for their score. One such method is Lassosum, which inherits the model selection advantages of Lasso to select a meaningful subset of the GWAS single-nucleotide polymorphisms as predictors from their association statistics. However, even efficient scores like Lassosum, when derived from European-based GWASs, are poor predictors of phenotype for subjects of non-European ancestry; that is, they have limited portability to other ancestries. To increase the portability of Lassosum, when GWAS information and estimates of linkage disequilibrium are available for both ancestries, we propose Joint-Lassosum (JLS). In the simulation settings we explore, JLS provides more accurate PGSs compared to other methods, especially when measured in terms of fairness. In analyses of UK Biobank data, JLS was computationally more efficient but slightly less accurate than a Bayesian comparator, SDPRX. Like all PGS methods, JLS requires selection of predictors, which are determined by data-driven tuning parameters. We describe a new approach to selecting tuning parameters and note its relevance for model selection for any PGS. We also draw connections to the literature on algorithmic fairness and discuss how JLS can help mitigate fairness-related harms that might result from the use of PGSs in clinical settings. While no PGS method is likely to be universally portable, due to the diversity of human populations and unequal information content of GWASs for different ancestries, JLS is an effective approach for enhancing portability and reducing predictive bias.

查看原文本刊更多论文

评估和改善多基因评分的健康公平性和公正性。

多基因评分（PGS）是预测人类身高或疾病状况等表型值的量化指标。一些多基因评分方法只需要相关全基因组关联研究（GWAS）的汇总统计数据就能得出评分。Lassosum 就是这样一种方法，它继承了 Lasso 的模型选择优势，从关联统计中选择有意义的全基因组关联研究单核苷酸多态性子集作为预测因子。然而，即使是像 Lassosum 这样高效的分数，如果来自欧洲的 GWAS，对非欧洲血统受试者的表型预测能力也很差；也就是说，它们对其他血统的可移植性有限。为了提高 Lassosum 的可移植性，当两个祖先的 GWAS 信息和连锁不平衡估计值都可用时，我们提出了联合 Lassosum。在我们探索的模拟设置中，与其他方法相比，Joint-Lassosum 提供了更准确的 PGS，尤其是在公平性方面。在对英国生物库数据的分析中，JLS 的计算效率更高，但准确性略低于贝叶斯比较方法 SDPRX。与所有 PGS 方法一样，Joint-Lassosum 也需要选择预测因子，而预测因子是由数据驱动的调整参数决定的。我们介绍了一种选择调整参数的新方法，并指出了它与任何 PGS 模型选择的相关性。我们还引出了与算法公平性相关的文献，并讨论了 Joint-Lassosum 如何帮助减轻在临床环境中使用 PGS 评分可能导致的与公平性相关的危害。虽然任何 PGS 方法都不可能具有普遍的可移植性，但由于人类种群的多样性和不同祖先的 GWAS 信息含量不平等，Joint-Lassosum 是提高可移植性和减少预测偏差的有效方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊