Yuanchao Zheng, Kathryn L Lunetta, Chunyu Liu, Alicia K Smith, Richard Sherva, Mark W Miller, Mark W Logue
{"title":"A novel principal component based method for identifying differentially methylated regions in Illumina Infinium MethylationEPIC BeadChip data.","authors":"Yuanchao Zheng, Kathryn L Lunetta, Chunyu Liu, Alicia K Smith, Richard Sherva, Mark W Miller, Mark W Logue","doi":"10.1080/15592294.2023.2207959","DOIUrl":null,"url":null,"abstract":"<p><p>Differentially methylated regions (DMRs) are genomic regions with methylation patterns across multiple CpG sites that are associated with a phenotype. In this study, we proposed a Principal Component (PC) based DMR analysis method for use with data generated using the Illumina Infinium MethylationEPIC BeadChip (EPIC) array. We obtained methylation residuals by regressing the M-values of CpGs within a region on covariates, extracted PCs of the residuals, and then combined association information across PCs to obtain regional significance. Simulation-based genome-wide false positive (GFP) rates and true positive rates were estimated under a variety of conditions before determining the final version of our method, which we have named DMR<sub>PC</sub>. Then, DMR<sub>PC</sub> and another DMR method, coMethDMR, were used to perform epigenome-wide analyses of several phenotypes known to have multiple associated methylation loci (age, sex, and smoking) in a discovery and a replication cohort. Among regions that were analysed by both methods, DMR<sub>PC</sub> identified 50% more genome-wide significant age-associated DMRs than coMethDMR. The replication rate for the loci that were identified by only DMR<sub>PC</sub> was higher than the rate for those that were identified by only coMethDMR (90% for DMRPC vs. 76% for coMethDMR). Furthermore, DMR<sub>PC</sub> identified replicable associations in regions of moderate between-CpG correlation which are typically not analysed by coMethDMR. For the analyses of sex and smoking, the advantage of DMR<sub>PC</sub> was less clear. In conclusion, DMR<sub>PC</sub> is a new powerful DMR discovery tool that retains power in genomic regions with moderate correlation across CpGs.</p>","PeriodicalId":11767,"journal":{"name":"Epigenetics","volume":"18 1","pages":"2207959"},"PeriodicalIF":2.9000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10193914/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epigenetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1080/15592294.2023.2207959","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Differentially methylated regions (DMRs) are genomic regions with methylation patterns across multiple CpG sites that are associated with a phenotype. In this study, we proposed a Principal Component (PC) based DMR analysis method for use with data generated using the Illumina Infinium MethylationEPIC BeadChip (EPIC) array. We obtained methylation residuals by regressing the M-values of CpGs within a region on covariates, extracted PCs of the residuals, and then combined association information across PCs to obtain regional significance. Simulation-based genome-wide false positive (GFP) rates and true positive rates were estimated under a variety of conditions before determining the final version of our method, which we have named DMRPC. Then, DMRPC and another DMR method, coMethDMR, were used to perform epigenome-wide analyses of several phenotypes known to have multiple associated methylation loci (age, sex, and smoking) in a discovery and a replication cohort. Among regions that were analysed by both methods, DMRPC identified 50% more genome-wide significant age-associated DMRs than coMethDMR. The replication rate for the loci that were identified by only DMRPC was higher than the rate for those that were identified by only coMethDMR (90% for DMRPC vs. 76% for coMethDMR). Furthermore, DMRPC identified replicable associations in regions of moderate between-CpG correlation which are typically not analysed by coMethDMR. For the analyses of sex and smoking, the advantage of DMRPC was less clear. In conclusion, DMRPC is a new powerful DMR discovery tool that retains power in genomic regions with moderate correlation across CpGs.
期刊介绍:
Epigenetics publishes peer-reviewed original research and review articles that provide an unprecedented forum where epigenetic mechanisms and their role in diverse biological processes can be revealed, shared, and discussed.
Epigenetics research studies heritable changes in gene expression caused by mechanisms others than the modification of the DNA sequence. Epigenetics therefore plays critical roles in a variety of biological systems, diseases, and disciplines. Topics of interest include (but are not limited to):
DNA methylation
Nucleosome positioning and modification
Gene silencing
Imprinting
Nuclear reprogramming
Chromatin remodeling
Non-coding RNA
Non-histone chromosomal elements
Dosage compensation
Nuclear organization
Epigenetic therapy and diagnostics
Nutrition and environmental epigenetics
Cancer epigenetics
Neuroepigenetics