{"title":"剖析群体规模生物库中推定功能缺失变异体的降低渗透率。","authors":"David R Blair, Neil Risch","doi":"10.1101/2024.09.23.24314008","DOIUrl":null,"url":null,"abstract":"<p><p>Loss-of-function genetic variants (LoFs) often result in severe phenotypes, including autosomal dominant diseases driven by haploinsufficiency. Due to low carrier frequencies, their penetrance is generally unknown but typically variable. Here, we investigate the penetrance of >6,000 predicted LoFs (pLoFs) linked to 91 haploinsufficient diseases using a cohort of ≈24,000 carriers with linked electronic health record data. We find evidence for widespread reduced penetrance, which persisted after accounting for variant annotation artifacts, missed diagnoses, and incomplete clinical data. We thus hypothesized that many pLoFs have incomplete penetrance, which may be driven by residual allelic activity. To test this, we trained machine learning models to predict pLoF penetrance using variant-specific genomic features that may correlate with incomplete loss-of-function. The models were predictive of pLoF penetrance across a range of diseases and variant types, including those with prior clinical evidence for pathogenicity. This suggests that many pLoFs have incomplete penetrance due to residual allelic activity, complicating disease prognostication in asymptomatic carriers.</p>","PeriodicalId":94281,"journal":{"name":"medRxiv : the preprint server for health sciences","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11469360/pdf/","citationCount":"0","resultStr":"{\"title\":\"Reduced Penetrance is Common Among Predicted Loss-of-Function Variants and is Likely Driven by Residual Allelic Activity.\",\"authors\":\"David R Blair, Neil Risch\",\"doi\":\"10.1101/2024.09.23.24314008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Loss-of-function genetic variants (LoFs) often result in severe phenotypes, including autosomal dominant diseases driven by haploinsufficiency. Due to low carrier frequencies, their penetrance is generally unknown but typically variable. Here, we investigate the penetrance of >6,000 predicted LoFs (pLoFs) linked to 91 haploinsufficient diseases using a cohort of ≈24,000 carriers with linked electronic health record data. We find evidence for widespread reduced penetrance, which persisted after accounting for variant annotation artifacts, missed diagnoses, and incomplete clinical data. We thus hypothesized that many pLoFs have incomplete penetrance, which may be driven by residual allelic activity. To test this, we trained machine learning models to predict pLoF penetrance using variant-specific genomic features that may correlate with incomplete loss-of-function. The models were predictive of pLoF penetrance across a range of diseases and variant types, including those with prior clinical evidence for pathogenicity. This suggests that many pLoFs have incomplete penetrance due to residual allelic activity, complicating disease prognostication in asymptomatic carriers.</p>\",\"PeriodicalId\":94281,\"journal\":{\"name\":\"medRxiv : the preprint server for health sciences\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11469360/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv : the preprint server for health sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.09.23.24314008\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv : the preprint server for health sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.09.23.24314008","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
功能缺失变体(LoFs)会破坏受影响基因的活性。它们通常与临床表型相关,包括由单倍体功能缺失导致的常染色体显性遗传病。最近利用生物库进行的分析表明,某些单倍体缺乏症的 LoF 渗透率可能很低,这一观察结果对人群基因组筛查具有重要意义。然而,生物库中也存在大量缺失数据,这些发现的可靠性仍不确定。在这里,我们利用从英国生物库(UK Biobank)和 "我们所有人 "研究计划(All of Us Research Program)这两个人群规模生物库中提取的≈24,000名携带者组成的队列,研究了推定LoFs(pLoFs)的穿透性。我们研究了pLoF穿透性降低的几种可能原因,包括生物库招募偏差、注释伪影、漏诊和不完整的临床记录。系统地考虑这些因素会增加穿透性,但普遍的穿透性降低仍然存在。因此,我们假设一定是其他因素导致了这一现象。为了验证这一点,我们训练了机器学习模型,利用每个变异体特有的基因组特征来识别具有高穿透性的 pLoFs。这些模型可以预测一系列疾病和 pLoF 类型的穿透性,包括那些之前已有致病性证据的疾病和 pLoF 类型。这表明,pLoF 穿透性降低的情况实际上很常见,因此在为无症状的携带者提供咨询时应小心谨慎。
Reduced Penetrance is Common Among Predicted Loss-of-Function Variants and is Likely Driven by Residual Allelic Activity.
Loss-of-function genetic variants (LoFs) often result in severe phenotypes, including autosomal dominant diseases driven by haploinsufficiency. Due to low carrier frequencies, their penetrance is generally unknown but typically variable. Here, we investigate the penetrance of >6,000 predicted LoFs (pLoFs) linked to 91 haploinsufficient diseases using a cohort of ≈24,000 carriers with linked electronic health record data. We find evidence for widespread reduced penetrance, which persisted after accounting for variant annotation artifacts, missed diagnoses, and incomplete clinical data. We thus hypothesized that many pLoFs have incomplete penetrance, which may be driven by residual allelic activity. To test this, we trained machine learning models to predict pLoF penetrance using variant-specific genomic features that may correlate with incomplete loss-of-function. The models were predictive of pLoF penetrance across a range of diseases and variant types, including those with prior clinical evidence for pathogenicity. This suggests that many pLoFs have incomplete penetrance due to residual allelic activity, complicating disease prognostication in asymptomatic carriers.