基于通用复合统计框架的DNA甲基化流行病学数据分析

I. Valavanis, E. Sifakis, P. Georgiadis, S. Kyrtopoulos, A. Chatziioannou
{"title":"基于通用复合统计框架的DNA甲基化流行病学数据分析","authors":"I. Valavanis, E. Sifakis, P. Georgiadis, S. Kyrtopoulos, A. Chatziioannou","doi":"10.1109/BIBE.2012.6399775","DOIUrl":null,"url":null,"abstract":"DNA methylation events represent epigenetic heritable modifications that regulate gene expression by affecting chromatin remodeling. They are encountered more often in CpG rich promoter regions, while they do not alter the DNA sequence itself. High-volume DNA methylation profiling methods exploit microarray technologies and provide a wealth of data. This data solicits rigorous, generic, yet ad-hoc adjusted, analytical pipelines for the meaningful systems-level analysis and interpretation. In this work, the Illumina Infinium HumanMethylation450 BeadChip platform is utilized in an epidemiological cohort from Italy in an effort to correlate interesting methylation patterns with breast cancer predisposition. The composite computational framework proposed here builds upon well established, analytical techniques, employed in mRNA analysis. For analysis purposes, the log2(ratio) of the intensities of a Methylated probe (IMeth) versus an UnMethylated probe (IUn-Meth), quoted as M-value, is used. Intensity based correction of the M-signal distribution is systematically applied, based upon Intensity-related error measures from quality controls samples incorporated in each chip. Thus, batch effects are corrected, while probe-specific, intensity-related, error measures are considered too. Robust, (based on bootstrapping) statistical measures measuring biological variation at the probe level, are derived in order to propose candidate biomarkers. To this end, coefficient variation measurements of DNA methylation between controls and cases are utilized, alleviating simultaneously the impact of technical variation, and are juxtaposed to classical statistical differential analysis measures.","PeriodicalId":330164,"journal":{"name":"2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Analysis of DNA methylation epidemiological data through a generic composite statistical framework\",\"authors\":\"I. Valavanis, E. Sifakis, P. Georgiadis, S. Kyrtopoulos, A. Chatziioannou\",\"doi\":\"10.1109/BIBE.2012.6399775\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"DNA methylation events represent epigenetic heritable modifications that regulate gene expression by affecting chromatin remodeling. They are encountered more often in CpG rich promoter regions, while they do not alter the DNA sequence itself. High-volume DNA methylation profiling methods exploit microarray technologies and provide a wealth of data. This data solicits rigorous, generic, yet ad-hoc adjusted, analytical pipelines for the meaningful systems-level analysis and interpretation. In this work, the Illumina Infinium HumanMethylation450 BeadChip platform is utilized in an epidemiological cohort from Italy in an effort to correlate interesting methylation patterns with breast cancer predisposition. The composite computational framework proposed here builds upon well established, analytical techniques, employed in mRNA analysis. For analysis purposes, the log2(ratio) of the intensities of a Methylated probe (IMeth) versus an UnMethylated probe (IUn-Meth), quoted as M-value, is used. Intensity based correction of the M-signal distribution is systematically applied, based upon Intensity-related error measures from quality controls samples incorporated in each chip. Thus, batch effects are corrected, while probe-specific, intensity-related, error measures are considered too. Robust, (based on bootstrapping) statistical measures measuring biological variation at the probe level, are derived in order to propose candidate biomarkers. To this end, coefficient variation measurements of DNA methylation between controls and cases are utilized, alleviating simultaneously the impact of technical variation, and are juxtaposed to classical statistical differential analysis measures.\",\"PeriodicalId\":330164,\"journal\":{\"name\":\"2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE)\",\"volume\":\"39 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BIBE.2012.6399775\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2012.6399775","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

DNA甲基化事件代表表观遗传修饰,通过影响染色质重塑来调节基因表达。它们更常出现在富含CpG的启动子区域,而它们本身并不改变DNA序列。大容量DNA甲基化分析方法利用微阵列技术并提供丰富的数据。这些数据需要严格的、通用的、临时调整的分析管道来进行有意义的系统级分析和解释。在这项工作中,Illumina Infinium HumanMethylation450 BeadChip平台被用于意大利的一个流行病学队列,以努力将有趣的甲基化模式与乳腺癌易感性联系起来。本文提出的复合计算框架建立在mRNA分析中采用的成熟的分析技术之上。为了分析目的,使用甲基化探针(IMeth)与未甲基化探针(IUn-Meth)强度的log2(比率),引用为m值。基于强度的m信号分布的校正是系统地应用,基于强度相关的误差措施,从质量控制样品纳入每个芯片。因此,在校正批效应的同时,还考虑了特定于探针的、与强度相关的误差度量。为了提出候选生物标记物,推导了在探针水平上测量生物变异的鲁棒(基于自举)统计度量。为此,利用对照和病例之间DNA甲基化的系数变异测量,同时减轻技术变异的影响,并将其与经典的统计差异分析方法并列。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Analysis of DNA methylation epidemiological data through a generic composite statistical framework
DNA methylation events represent epigenetic heritable modifications that regulate gene expression by affecting chromatin remodeling. They are encountered more often in CpG rich promoter regions, while they do not alter the DNA sequence itself. High-volume DNA methylation profiling methods exploit microarray technologies and provide a wealth of data. This data solicits rigorous, generic, yet ad-hoc adjusted, analytical pipelines for the meaningful systems-level analysis and interpretation. In this work, the Illumina Infinium HumanMethylation450 BeadChip platform is utilized in an epidemiological cohort from Italy in an effort to correlate interesting methylation patterns with breast cancer predisposition. The composite computational framework proposed here builds upon well established, analytical techniques, employed in mRNA analysis. For analysis purposes, the log2(ratio) of the intensities of a Methylated probe (IMeth) versus an UnMethylated probe (IUn-Meth), quoted as M-value, is used. Intensity based correction of the M-signal distribution is systematically applied, based upon Intensity-related error measures from quality controls samples incorporated in each chip. Thus, batch effects are corrected, while probe-specific, intensity-related, error measures are considered too. Robust, (based on bootstrapping) statistical measures measuring biological variation at the probe level, are derived in order to propose candidate biomarkers. To this end, coefficient variation measurements of DNA methylation between controls and cases are utilized, alleviating simultaneously the impact of technical variation, and are juxtaposed to classical statistical differential analysis measures.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信