Naren Ramesh, Alexandria Evans, Kevin Wojta, Zhongan Yang, Marco P Boks, René S Kahn, Sterre C M de Boer, Sven J van der Lee, Yolande A L Pijnenburg, Lianne M Reus, Roel A Ophoff
{"title":"Accurate DNA Methylation Predictor for C9orf72 Repeat Expansion Alleles in the Pathogenic Range.","authors":"Naren Ramesh, Alexandria Evans, Kevin Wojta, Zhongan Yang, Marco P Boks, René S Kahn, Sterre C M de Boer, Sven J van der Lee, Yolande A L Pijnenburg, Lianne M Reus, Roel A Ophoff","doi":"10.1016/j.xhgg.2025.100522","DOIUrl":null,"url":null,"abstract":"<p><p>The hexanucleotide (G<sub>4</sub>C<sub>2</sub>) repeat expansion in the promoter region of C9orf72 is the most frequent genetic cause of frontotemporal dementia (FTD) and amyotrophic lateral sclerosis (ALS). In this study, we conducted a genome-wide DNA methylation (DNAm) analysis using EPIC version 2 (EPICv2) arrays on an FTD cohort comprising 27 carriers and 250 non-carriers of the pathogenic C9orf72 repeat expansion from the Amsterdam Dementia Cohort. We identified differentially methylated CpGs probes associated with the pathogenic C9orf72 expansion and used these findings to create a DNAm Least Absolute Shrinkage and Selection Operator (LASSO) predictor to identify repeat expansion carriers. Eight CpG sites at the C9orf72 locus were significantly differentially hypermethylated in repeat expansion carriers compared to non-carriers. The LASSO model predicted repeat expansion status with an average accuracy of 98.6%. The LASSO predictor was further validated in a separate, independent validation cohort containing 1,589 bipolar disorder patients, 580 first degree relatives, and 289 independent controls with available EPICv2 data, identifying four C9orf72 repeat expansion carriers, subsequently confirmed by repeat-primed PCR. This result highlights the accuracy and generalizability of the DNAm predictor of C9orf72 repeat expansion carriers. The identification of a highly accurate DNAm biomarker for a repeat expansion locus associated with neurodegenerative disorders may provide great value for studying this locus. The approach holds significant promise for investigating this and other repeat expansion loci, particularly given the growing interest in epigenetic epidemiological studies involving large cohorts with available DNAm data.</p>","PeriodicalId":34530,"journal":{"name":"HGG Advances","volume":" ","pages":"100522"},"PeriodicalIF":3.6000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"HGG Advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1016/j.xhgg.2025.100522","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
The hexanucleotide (G4C2) repeat expansion in the promoter region of C9orf72 is the most frequent genetic cause of frontotemporal dementia (FTD) and amyotrophic lateral sclerosis (ALS). In this study, we conducted a genome-wide DNA methylation (DNAm) analysis using EPIC version 2 (EPICv2) arrays on an FTD cohort comprising 27 carriers and 250 non-carriers of the pathogenic C9orf72 repeat expansion from the Amsterdam Dementia Cohort. We identified differentially methylated CpGs probes associated with the pathogenic C9orf72 expansion and used these findings to create a DNAm Least Absolute Shrinkage and Selection Operator (LASSO) predictor to identify repeat expansion carriers. Eight CpG sites at the C9orf72 locus were significantly differentially hypermethylated in repeat expansion carriers compared to non-carriers. The LASSO model predicted repeat expansion status with an average accuracy of 98.6%. The LASSO predictor was further validated in a separate, independent validation cohort containing 1,589 bipolar disorder patients, 580 first degree relatives, and 289 independent controls with available EPICv2 data, identifying four C9orf72 repeat expansion carriers, subsequently confirmed by repeat-primed PCR. This result highlights the accuracy and generalizability of the DNAm predictor of C9orf72 repeat expansion carriers. The identification of a highly accurate DNAm biomarker for a repeat expansion locus associated with neurodegenerative disorders may provide great value for studying this locus. The approach holds significant promise for investigating this and other repeat expansion loci, particularly given the growing interest in epigenetic epidemiological studies involving large cohorts with available DNAm data.
C9orf72启动子区域的六核苷酸(G4C2)重复扩增是额颞叶痴呆(FTD)和肌萎缩侧索硬化症(ALS)最常见的遗传原因。在这项研究中,我们使用EPIC version 2 (EPICv2)阵列对一个FTD队列进行了全基因组DNA甲基化(DNAm)分析,该队列包括来自阿姆斯特丹痴呆队列的27名致病C9orf72重复扩增的携带者和250名非携带者。我们发现了与致病性C9orf72扩增相关的差异甲基化CpGs探针,并利用这些发现创建了DNAm最小绝对收缩和选择算子(LASSO)预测因子,以识别重复扩增载体。重复扩增基因携带者与非携带者相比,C9orf72位点的8个CpG位点的高甲基化显著差异。LASSO模型预测重复扩张状态的平均准确率为98.6%。LASSO预测因子在一个独立的独立验证队列中进一步验证,该队列包含1,589名双相情感障碍患者,580名一级亲属和289名具有可用EPICv2数据的独立对照,确定了4个C9orf72重复扩增载体,随后通过重复引物PCR证实。这一结果突出了C9orf72重复扩增载体DNAm预测的准确性和通用性。识别与神经退行性疾病相关的重复扩增位点的高精度DNAm生物标志物可能为该位点的研究提供重要价值。该方法对研究这一重复扩增位点和其他重复扩增位点具有重要的前景,特别是考虑到对涉及具有可用DNAm数据的大型队列的表观遗传流行病学研究日益增长的兴趣。