等位基因频率谱变异对囊性肾病的影响

Omid Sadeghi-Alavijeh, Melanie MY Chan, Gabriel Doctor, Catalin Voinescu, Alex Stuckey, Athanasios Kousathanas, Alexander Ho, Genomics England Research Consortium, Horia Stanescu, Horia Stanescu, Detlef Bockenhauer, Richard Sandford, Adam P Levine, Daniel P Gale
{"title":"等位基因频率谱变异对囊性肾病的影响","authors":"Omid Sadeghi-Alavijeh, Melanie MY Chan, Gabriel Doctor, Catalin Voinescu, Alex Stuckey, Athanasios Kousathanas, Alexander Ho, Genomics England Research Consortium, Horia Stanescu, Horia Stanescu, Detlef Bockenhauer, Richard Sandford, Adam P Levine, Daniel P Gale","doi":"10.1101/2024.02.14.24302377","DOIUrl":null,"url":null,"abstract":"Introduction\nCystic kidney disease (CyKD) is frequently a familial disease, with ~85% of probands receiving a monogenic diagnosis. However, gene discovery has been led by family-based and candidate gene studies, limiting the ascertainment of non-Mendelian genetic contributors to the disease. Using whole genome sequencing data provided by the 100,000 Genomes Project (100KGP), we used hypothesis-free approaches to systematically characterize and quantify the genetic contributors to CyKD across variant types and the allele frequency spectrum. Methods\nWe performed a sequencing-based genome-wide association study in 1,209 unrelated patients recruited to the 100,000 Genomes Project with CyKD and 26,096 ancestry-matched unaffected controls. The analysis was inclusive of individuals with diverse genetic ancestries. Enrichment of common, low-frequency (minor allele frequency [MAF] > 0.1%) and rare (MAF < 0.1%) single-nucleotide variants (SNV), indels and rare structural variants (SV) on a genome-wide and per-gene basis was sought using a generalised linear mixed model approach to account for population structure. Meta-analysis of CyKD cohorts from Finngen, the UK Biobank and BioBank Japan was performed. Results\nIn 995 of the 1209 (82.30%) CyKD cases a likely disease-causing monogenic variant was identified. Gene-based analysis of rare SNVs/indels predicted to be damaging revealed PKD1 (P=1.13x10-309), PKD2 (P=1.96x10-150), DNAJB11 (P=3.52x10-7), COL4A3 (P=1.26x10-6) and truncating monoallelic PKHD1 (P=2.98x10-8) variants to be significantly associated with disease. Depleting for solved cases led to the emergence of a significant association at IFT140 (P=3.46x10-17) and strengthening of the COL4A3 (P=9.27x10-7) association, driven exclusively by heterozygous variants for both genes. After depleting for those harbouring IFT140 and COL4A3 variants , no other genes were identified. Risk of disease attributable to monoallelic defects of multiple genes linked with CyKD was quantified, with lower risk seen in rarer and more recently described genetic diagnoses.\nGenome-wide structural variant associations highlighted deletions in PKD1 (P=2.17x10-22), PKD2 (P=7.48x10-12) and the 17q12 locus containing HNF1B (P=4.12x10-8) as statistically significant contributors to disease. Genome-wide analysis of over 18 million common and low-frequency variants in the Finnish population revealed evidence of association (P=1.4x10-149) of a heterozygous stop-gain variant in PKHD1 that is endemic (MAF=4.7x10-03) in this population. Meta-analysis of 2,923 cases and 900,824 controls across 6,641,351 common and low frequency variants including UK, Japanese and Finnish biobanks did not reveal any novel significant associations. SNVs with a MAF>0.1% accounted for between 3 and 9% of the heritability of CyKD across three different European ancestry cohorts. Conclusions\nThese findings represent an unbiased examination of the genetic architecture of a national CyKD cohort using robust statistical methodology. Causative monoallelic mutations in IFT140 have recently been reported in other cohorts associated with a milder phenotype than PKD1/2-associated disease. The association with COL4A3 suggests that in some circumstances CyKD may be the presenting feature of collagen IV-related kidney disease and the significant association observed with monoallelic predicted loss-of-function PKHD1 variants extends the spectrum of phenotypic abnormalities associated with this gene. In addition to quantification of the contribution of non-coding and structural variants to CyKD, the per gene quantification of CyKD risk presented could be used to inform genetic testing and counselling strategies clinically and we also show that common variants make a small contribution to CyKD heritability. Keywords: genomics, cystic kidney disease, renal, ADPKD","PeriodicalId":501513,"journal":{"name":"medRxiv - Nephrology","volume":"17 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Contribution of variants across the allelic frequency spectrum to cystic kidney disease\",\"authors\":\"Omid Sadeghi-Alavijeh, Melanie MY Chan, Gabriel Doctor, Catalin Voinescu, Alex Stuckey, Athanasios Kousathanas, Alexander Ho, Genomics England Research Consortium, Horia Stanescu, Horia Stanescu, Detlef Bockenhauer, Richard Sandford, Adam P Levine, Daniel P Gale\",\"doi\":\"10.1101/2024.02.14.24302377\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction\\nCystic kidney disease (CyKD) is frequently a familial disease, with ~85% of probands receiving a monogenic diagnosis. However, gene discovery has been led by family-based and candidate gene studies, limiting the ascertainment of non-Mendelian genetic contributors to the disease. Using whole genome sequencing data provided by the 100,000 Genomes Project (100KGP), we used hypothesis-free approaches to systematically characterize and quantify the genetic contributors to CyKD across variant types and the allele frequency spectrum. Methods\\nWe performed a sequencing-based genome-wide association study in 1,209 unrelated patients recruited to the 100,000 Genomes Project with CyKD and 26,096 ancestry-matched unaffected controls. The analysis was inclusive of individuals with diverse genetic ancestries. Enrichment of common, low-frequency (minor allele frequency [MAF] > 0.1%) and rare (MAF < 0.1%) single-nucleotide variants (SNV), indels and rare structural variants (SV) on a genome-wide and per-gene basis was sought using a generalised linear mixed model approach to account for population structure. Meta-analysis of CyKD cohorts from Finngen, the UK Biobank and BioBank Japan was performed. Results\\nIn 995 of the 1209 (82.30%) CyKD cases a likely disease-causing monogenic variant was identified. Gene-based analysis of rare SNVs/indels predicted to be damaging revealed PKD1 (P=1.13x10-309), PKD2 (P=1.96x10-150), DNAJB11 (P=3.52x10-7), COL4A3 (P=1.26x10-6) and truncating monoallelic PKHD1 (P=2.98x10-8) variants to be significantly associated with disease. Depleting for solved cases led to the emergence of a significant association at IFT140 (P=3.46x10-17) and strengthening of the COL4A3 (P=9.27x10-7) association, driven exclusively by heterozygous variants for both genes. After depleting for those harbouring IFT140 and COL4A3 variants , no other genes were identified. Risk of disease attributable to monoallelic defects of multiple genes linked with CyKD was quantified, with lower risk seen in rarer and more recently described genetic diagnoses.\\nGenome-wide structural variant associations highlighted deletions in PKD1 (P=2.17x10-22), PKD2 (P=7.48x10-12) and the 17q12 locus containing HNF1B (P=4.12x10-8) as statistically significant contributors to disease. Genome-wide analysis of over 18 million common and low-frequency variants in the Finnish population revealed evidence of association (P=1.4x10-149) of a heterozygous stop-gain variant in PKHD1 that is endemic (MAF=4.7x10-03) in this population. Meta-analysis of 2,923 cases and 900,824 controls across 6,641,351 common and low frequency variants including UK, Japanese and Finnish biobanks did not reveal any novel significant associations. SNVs with a MAF>0.1% accounted for between 3 and 9% of the heritability of CyKD across three different European ancestry cohorts. Conclusions\\nThese findings represent an unbiased examination of the genetic architecture of a national CyKD cohort using robust statistical methodology. Causative monoallelic mutations in IFT140 have recently been reported in other cohorts associated with a milder phenotype than PKD1/2-associated disease. The association with COL4A3 suggests that in some circumstances CyKD may be the presenting feature of collagen IV-related kidney disease and the significant association observed with monoallelic predicted loss-of-function PKHD1 variants extends the spectrum of phenotypic abnormalities associated with this gene. In addition to quantification of the contribution of non-coding and structural variants to CyKD, the per gene quantification of CyKD risk presented could be used to inform genetic testing and counselling strategies clinically and we also show that common variants make a small contribution to CyKD heritability. Keywords: genomics, cystic kidney disease, renal, ADPKD\",\"PeriodicalId\":501513,\"journal\":{\"name\":\"medRxiv - Nephrology\",\"volume\":\"17 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"medRxiv - Nephrology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1101/2024.02.14.24302377\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Nephrology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.02.14.24302377","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

导言囊性肾病(CyKD)通常是一种家族性疾病,约有 85% 的原发性肾病患者被诊断为单基因遗传病。然而,基因的发现一直以基于家族和候选基因的研究为主导,从而限制了对该疾病非孟德尔遗传因素的确定。利用十万基因组计划(100,000 Genomes Project,100KGP)提供的全基因组测序数据,我们采用了无假设方法,系统地描述和量化了不同变异类型和等位基因频率谱中导致 CyKD 的遗传因素。方法我们对 100,000 基因组计划中招募的 1,209 名非亲属关系的 CyKD 患者和 26,096 名祖先匹配的未受影响对照进行了基于测序的全基因组关联研究。该分析涵盖了具有不同遗传血统的个体。利用广义线性混合模型方法,在全基因组和每个基因的基础上对常见、低频(小等位基因频率 [MAF] > 0.1%)和罕见(MAF < 0.1%)单核苷酸变异 (SNV)、嵌合体和罕见结构变异 (SV) 进行了富集,以考虑群体结构。对 Finngen、英国生物库(UK Biobank)和日本生物库(BioBank Japan)的 CyKD 队列进行了元分析。结果 在 1209 例 CyKD 病例中的 995 例(82.30%)发现了可能致病的单基因变异。对预测具有损伤性的罕见 SNV/嵌段进行的基于基因的分析表明,PKD1(P=1.13x10-309)、PKD2(P=1.96x10-150)、DNAJB11(P=3.52x10-7)、COL4A3(P=1.26x10-6)和截短单偶 PKHD1(P=2.98x10-8)变异与疾病显著相关。对已解决的病例进行筛选后,IFT140(P=3.46x10-17)出现了明显的关联,COL4A3(P=9.27x10-7)的关联也得到了加强,这完全是由这两个基因的杂合变异引起的。在筛选出携带 IFT140 和 COL4A3 变体的人之后,没有发现其他基因。全基因组结构变异关联强调了 PKD1(P=2.17x10-22)、PKD2(P=7.48x10-12)和包含 HNF1B 的 17q12 位点(P=4.12x10-8)的缺失,这些缺失在统计学上对疾病的发生有重要影响。对芬兰人群中 1,800 多万个常见和低频变异的全基因组分析显示,PKHD1 中的一个杂合停止-增益变异与该疾病有关联(P=1.4x10-149),该变异在该人群中流行(MAF=4.7x10-03)。对英国、日本和芬兰生物库中6,641,351个常见和低频变异中的2,923个病例和900,824个对照进行的元分析没有发现任何新的显著关联。在三个不同的欧洲血统队列中,MAF>0.1%的SNV占CyKD遗传率的3%至9%。结论这些发现代表了利用可靠的统计方法对全国 CyKD 队列的遗传结构进行的无偏见检查。最近在其他队列中报告了 IFT140 的致病单倍突变,与 PKD1/2 相关疾病相比,其表型较轻。与 COL4A3 的关联表明,在某些情况下,CyKD 可能是胶原 IV 相关肾病的表现特征,而与单等位基因预测功能缺失 PKHD1 变异的显著关联则扩展了与该基因相关的表型异常的范围。除了量化非编码变异和结构变异对 CyKD 的贡献外,每个基因对 CyKD 风险的量化还可用于为临床上的基因检测和咨询策略提供信息,我们还表明常见变异对 CyKD 遗传性的贡献很小。关键词:基因组学;囊性肾病;肾脏;ADPKD
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Contribution of variants across the allelic frequency spectrum to cystic kidney disease
Introduction Cystic kidney disease (CyKD) is frequently a familial disease, with ~85% of probands receiving a monogenic diagnosis. However, gene discovery has been led by family-based and candidate gene studies, limiting the ascertainment of non-Mendelian genetic contributors to the disease. Using whole genome sequencing data provided by the 100,000 Genomes Project (100KGP), we used hypothesis-free approaches to systematically characterize and quantify the genetic contributors to CyKD across variant types and the allele frequency spectrum. Methods We performed a sequencing-based genome-wide association study in 1,209 unrelated patients recruited to the 100,000 Genomes Project with CyKD and 26,096 ancestry-matched unaffected controls. The analysis was inclusive of individuals with diverse genetic ancestries. Enrichment of common, low-frequency (minor allele frequency [MAF] > 0.1%) and rare (MAF < 0.1%) single-nucleotide variants (SNV), indels and rare structural variants (SV) on a genome-wide and per-gene basis was sought using a generalised linear mixed model approach to account for population structure. Meta-analysis of CyKD cohorts from Finngen, the UK Biobank and BioBank Japan was performed. Results In 995 of the 1209 (82.30%) CyKD cases a likely disease-causing monogenic variant was identified. Gene-based analysis of rare SNVs/indels predicted to be damaging revealed PKD1 (P=1.13x10-309), PKD2 (P=1.96x10-150), DNAJB11 (P=3.52x10-7), COL4A3 (P=1.26x10-6) and truncating monoallelic PKHD1 (P=2.98x10-8) variants to be significantly associated with disease. Depleting for solved cases led to the emergence of a significant association at IFT140 (P=3.46x10-17) and strengthening of the COL4A3 (P=9.27x10-7) association, driven exclusively by heterozygous variants for both genes. After depleting for those harbouring IFT140 and COL4A3 variants , no other genes were identified. Risk of disease attributable to monoallelic defects of multiple genes linked with CyKD was quantified, with lower risk seen in rarer and more recently described genetic diagnoses. Genome-wide structural variant associations highlighted deletions in PKD1 (P=2.17x10-22), PKD2 (P=7.48x10-12) and the 17q12 locus containing HNF1B (P=4.12x10-8) as statistically significant contributors to disease. Genome-wide analysis of over 18 million common and low-frequency variants in the Finnish population revealed evidence of association (P=1.4x10-149) of a heterozygous stop-gain variant in PKHD1 that is endemic (MAF=4.7x10-03) in this population. Meta-analysis of 2,923 cases and 900,824 controls across 6,641,351 common and low frequency variants including UK, Japanese and Finnish biobanks did not reveal any novel significant associations. SNVs with a MAF>0.1% accounted for between 3 and 9% of the heritability of CyKD across three different European ancestry cohorts. Conclusions These findings represent an unbiased examination of the genetic architecture of a national CyKD cohort using robust statistical methodology. Causative monoallelic mutations in IFT140 have recently been reported in other cohorts associated with a milder phenotype than PKD1/2-associated disease. The association with COL4A3 suggests that in some circumstances CyKD may be the presenting feature of collagen IV-related kidney disease and the significant association observed with monoallelic predicted loss-of-function PKHD1 variants extends the spectrum of phenotypic abnormalities associated with this gene. In addition to quantification of the contribution of non-coding and structural variants to CyKD, the per gene quantification of CyKD risk presented could be used to inform genetic testing and counselling strategies clinically and we also show that common variants make a small contribution to CyKD heritability. Keywords: genomics, cystic kidney disease, renal, ADPKD
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信