Direction-aware functional class scoring enrichment analysis of infinium DNA methylation data.

IF 2.9 3区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Epigenetics Pub Date : 2024-12-01 Epub Date: 2024-07-05 DOI:10.1080/15592294.2024.2375022
Mark Ziemann, Mandhri Abeysooriya, Anusuiya Bora, Séverine Lamon, Mary Sravya Kasu, Mitchell W Norris, Yen Ting Wong, Jeffrey M Craig
{"title":"Direction-aware functional class scoring enrichment analysis of infinium DNA methylation data.","authors":"Mark Ziemann, Mandhri Abeysooriya, Anusuiya Bora, Séverine Lamon, Mary Sravya Kasu, Mitchell W Norris, Yen Ting Wong, Jeffrey M Craig","doi":"10.1080/15592294.2024.2375022","DOIUrl":null,"url":null,"abstract":"<p><p>Infinium Methylation BeadChip arrays remain one of the most popular platforms for epigenome-wide association studies, but tools for downstream pathway analysis have their limitations. Functional class scoring (FCS) is a group of pathway enrichment techniques that involve the ranking of genes and evaluation of their collective regulation in biological systems, but the implementations described for Infinium methylation array data do not retain direction information, which is important for mechanistic understanding of genomic regulation. Here, we evaluate several candidate FCS methods that retain directional information. According to simulation results, the best-performing method involves the mean aggregation of probe limma t-statistics by gene followed by a rank-ANOVA enrichment test using the mitch package. This method, which we call 'LAM,' outperformed an existing over-representation analysis method in simulations, and showed higher sensitivity and robustness in an analysis of real lung tumour-normal paired datasets. Using matched RNA-seq data, we examine the relationship of methylation differences at promoters and gene bodies with RNA expression at the level of pathways in lung cancer. To demonstrate the utility of our approach, we apply it to three other contexts where public data were available. First, we examine the differential pathway methylation associated with chronological age. Second, we investigate pathway methylation differences in infants conceived with in vitro fertilization. Lastly, we analyse differential pathway methylation in 19 disease states, identifying hundreds of novel associations. These results show LAM is a powerful method for the detection of differential pathway methylation complementing existing methods. A reproducible vignette is provided to illustrate how to implement this method.</p>","PeriodicalId":11767,"journal":{"name":"Epigenetics","volume":"19 1","pages":"2375022"},"PeriodicalIF":2.9000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11229754/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Epigenetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1080/15592294.2024.2375022","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/5 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Infinium Methylation BeadChip arrays remain one of the most popular platforms for epigenome-wide association studies, but tools for downstream pathway analysis have their limitations. Functional class scoring (FCS) is a group of pathway enrichment techniques that involve the ranking of genes and evaluation of their collective regulation in biological systems, but the implementations described for Infinium methylation array data do not retain direction information, which is important for mechanistic understanding of genomic regulation. Here, we evaluate several candidate FCS methods that retain directional information. According to simulation results, the best-performing method involves the mean aggregation of probe limma t-statistics by gene followed by a rank-ANOVA enrichment test using the mitch package. This method, which we call 'LAM,' outperformed an existing over-representation analysis method in simulations, and showed higher sensitivity and robustness in an analysis of real lung tumour-normal paired datasets. Using matched RNA-seq data, we examine the relationship of methylation differences at promoters and gene bodies with RNA expression at the level of pathways in lung cancer. To demonstrate the utility of our approach, we apply it to three other contexts where public data were available. First, we examine the differential pathway methylation associated with chronological age. Second, we investigate pathway methylation differences in infants conceived with in vitro fertilization. Lastly, we analyse differential pathway methylation in 19 disease states, identifying hundreds of novel associations. These results show LAM is a powerful method for the detection of differential pathway methylation complementing existing methods. A reproducible vignette is provided to illustrate how to implement this method.

对无限DNA甲基化数据进行方向感知功能分类评分富集分析。
Infinium 甲基化芯片阵列仍然是最流行的表观基因组关联研究平台之一,但用于下游通路分析的工具有其局限性。功能分类评分(FCS)是一组通路富集技术,涉及基因排序和评估它们在生物系统中的集体调控,但针对 Infinium 甲基化阵列数据描述的实现方法不保留方向信息,而方向信息对于从机理上理解基因组调控非常重要。在此,我们评估了几种保留方向信息的候选 FCS 方法。根据模拟结果,表现最好的方法是按基因对探针 limma t 统计量进行平均聚合,然后使用 mitch 软件包进行秩-ANOVA 富集检验。我们称这种方法为 "LAM",它在模拟中的表现优于现有的过度代表性分析方法,并在真实肺部肿瘤-正常配对数据集的分析中表现出更高的灵敏度和稳健性。利用匹配的 RNA-seq 数据,我们研究了启动子和基因体的甲基化差异与肺癌通路水平的 RNA 表达之间的关系。为了证明我们的方法的实用性,我们将其应用到了其他三个有公开数据的领域。首先,我们研究了与年龄相关的不同通路甲基化。其次,我们研究了体外受精婴儿的通路甲基化差异。最后,我们分析了 19 种疾病状态中不同的通路甲基化,发现了数百种新的关联。这些结果表明,LAM 是一种检测不同通路甲基化的强大方法,是对现有方法的补充。我们提供了一个可重复的小故事来说明如何实施这种方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Epigenetics
Epigenetics 生物-生化与分子生物学
CiteScore
6.80
自引率
2.70%
发文量
82
审稿时长
3-8 weeks
期刊介绍: Epigenetics publishes peer-reviewed original research and review articles that provide an unprecedented forum where epigenetic mechanisms and their role in diverse biological processes can be revealed, shared, and discussed. Epigenetics research studies heritable changes in gene expression caused by mechanisms others than the modification of the DNA sequence. Epigenetics therefore plays critical roles in a variety of biological systems, diseases, and disciplines. Topics of interest include (but are not limited to): DNA methylation Nucleosome positioning and modification Gene silencing Imprinting Nuclear reprogramming Chromatin remodeling Non-coding RNA Non-histone chromosomal elements Dosage compensation Nuclear organization Epigenetic therapy and diagnostics Nutrition and environmental epigenetics Cancer epigenetics Neuroepigenetics
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信