Methylation-driven model for analysis of dinucleotide evolution in genomes.

Q1 Mathematics
Jian-Hong Sun, Shi-Meng Ai, Shu-Qun Liu
{"title":"Methylation-driven model for analysis of dinucleotide evolution in genomes.","authors":"Jian-Hong Sun,&nbsp;Shi-Meng Ai,&nbsp;Shu-Qun Liu","doi":"10.1186/s12976-020-00122-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>CpGs, the major methylation sites in vertebrate genomes, exhibit a high mutation rate from the methylated form of CpG to TpG/CpA and, therefore, influence the evolution of genome composition. However, the quantitative effects of CpG to TpG/CpA mutations on the evolution of genome composition in terms of the dinucleotide frequencies/proportions remain poorly understood.</p><p><strong>Results: </strong>Based on the neutral theory of molecular evolution, we propose a methylation-driven model (MDM) that allows predicting the changes in frequencies/proportions of the 16 dinucleotides and in the GC content of a genome given the known number of CpG to TpG/CpA mutations. The application of MDM to the 10 published vertebrate genomes shows that, for most of the 16 dinucleotides and the GC content, a good consistency is achieved between the predicted and observed trends of changes in the frequencies and content relative to the assumed initial values, and that the model performs better on the mammalian genomes than it does on the lower-vertebrate genomes. The model's performance depends on the genome composition characteristics, the assumed initial state of the genome, and the estimated parameters, one or more of which are responsible for the different application effects on the mammalian and lower-vertebrate genomes and for the large deviations of the predicted frequencies of a few dinucleotides from their observed frequencies.</p><p><strong>Conclusions: </strong>Despite certain limitations of the current model, the successful application to the higher-vertebrate (mammalian) genomes witnesses its potential for facilitating studies aimed at understanding the role of methylation in driving the evolution of genome dinucleotide composition.</p>","PeriodicalId":51195,"journal":{"name":"Theoretical Biology and Medical Modelling","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2020-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1186/s12976-020-00122-x","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Theoretical Biology and Medical Modelling","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1186/s12976-020-00122-x","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Mathematics","Score":null,"Total":0}
引用次数: 1

Abstract

Background: CpGs, the major methylation sites in vertebrate genomes, exhibit a high mutation rate from the methylated form of CpG to TpG/CpA and, therefore, influence the evolution of genome composition. However, the quantitative effects of CpG to TpG/CpA mutations on the evolution of genome composition in terms of the dinucleotide frequencies/proportions remain poorly understood.

Results: Based on the neutral theory of molecular evolution, we propose a methylation-driven model (MDM) that allows predicting the changes in frequencies/proportions of the 16 dinucleotides and in the GC content of a genome given the known number of CpG to TpG/CpA mutations. The application of MDM to the 10 published vertebrate genomes shows that, for most of the 16 dinucleotides and the GC content, a good consistency is achieved between the predicted and observed trends of changes in the frequencies and content relative to the assumed initial values, and that the model performs better on the mammalian genomes than it does on the lower-vertebrate genomes. The model's performance depends on the genome composition characteristics, the assumed initial state of the genome, and the estimated parameters, one or more of which are responsible for the different application effects on the mammalian and lower-vertebrate genomes and for the large deviations of the predicted frequencies of a few dinucleotides from their observed frequencies.

Conclusions: Despite certain limitations of the current model, the successful application to the higher-vertebrate (mammalian) genomes witnesses its potential for facilitating studies aimed at understanding the role of methylation in driving the evolution of genome dinucleotide composition.

Abstract Image

Abstract Image

基因组中二核苷酸进化分析的甲基化驱动模型。
背景:CpGs是脊椎动物基因组中主要的甲基化位点,从CpG的甲基化形式到TpG/CpA具有很高的突变率,因此影响着基因组组成的进化。然而,CpG到TpG/CpA突变对二核苷酸频率/比例的基因组组成进化的定量影响仍然知之甚少。结果:基于分子进化的中性理论,我们提出了一个甲基化驱动模型(MDM),该模型可以在已知CpG到TpG/CpA突变数量的情况下预测基因组中16个二核苷酸的频率/比例和GC含量的变化。MDM对10个已发表的脊椎动物基因组的应用表明,对于16种二核苷酸和GC含量中的大多数,相对于假设的初始值,预测的频率和含量的变化趋势与观测到的趋势之间实现了很好的一致性,并且该模型在哺乳动物基因组上的表现优于在低等脊椎动物基因组上的表现。该模型的性能取决于基因组组成特征、假设的基因组初始状态和估计的参数,其中一个或多个参数对哺乳动物和低等脊椎动物基因组的不同应用效果负责,并且导致一些二核苷酸的预测频率与观测频率存在较大偏差。结论:尽管目前的模型有一定的局限性,但在高等脊椎动物(哺乳动物)基因组中的成功应用证明了其促进旨在理解甲基化在驱动基因组二核苷酸组成进化中的作用的研究的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Theoretical Biology and Medical Modelling
Theoretical Biology and Medical Modelling MATHEMATICAL & COMPUTATIONAL BIOLOGY-
自引率
0.00%
发文量
0
审稿时长
6-12 weeks
期刊介绍: Theoretical Biology and Medical Modelling is an open access peer-reviewed journal adopting a broad definition of "biology" and focusing on theoretical ideas and models associated with developments in biology and medicine. Mathematicians, biologists and clinicians of various specialisms, philosophers and historians of science are all contributing to the emergence of novel concepts in an age of systems biology, bioinformatics and computer modelling. This is the field in which Theoretical Biology and Medical Modelling operates. We welcome submissions that are technically sound and offering either improved understanding in biology and medicine or progress in theory or method.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信