BayesAge: A maximum likelihood algorithm to predict epigenetic age.

IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Frontiers in bioinformatics Pub Date : 2024-04-04 eCollection Date: 2024-01-01 DOI:10.3389/fbinf.2024.1329144
Lajoyce Mboning, Liudmilla Rubbi, Michael Thompson, Louis-S Bouchard, Matteo Pellegrini
{"title":"BayesAge: A maximum likelihood algorithm to predict epigenetic age.","authors":"Lajoyce Mboning, Liudmilla Rubbi, Michael Thompson, Louis-S Bouchard, Matteo Pellegrini","doi":"10.3389/fbinf.2024.1329144","DOIUrl":null,"url":null,"abstract":"<p><p><b>Introduction:</b> DNA methylation, specifically the formation of 5-methylcytosine at the C5 position of cytosine, undergoes reproducible changes as organisms age, establishing it as a significant biomarker in aging studies. Epigenetic clocks, which integrate methylation patterns to predict age, often employ linear models based on penalized regression, yet they encounter challenges in handling missing data, count-based bisulfite sequence data, and interpretation. <b>Methods:</b> To address these limitations, we introduce BayesAge, an extension of the scAge methodology originally designed for single-cell DNA methylation analysis. BayesAge employs maximum likelihood estimation (MLE) for age inference, models count data using binomial distributions, and incorporates LOWESS smoothing to capture non-linear methylation-age dynamics. This approach is tailored for bulk bisulfite sequencing datasets. <b>Results:</b> BayesAge demonstrates superior performance compared to scAge. Notably, its age residuals exhibit no age association, offering a less biased representation of epigenetic age variation across populations. Furthermore, BayesAge facilitates the estimation of error bounds on age inference. When applied to down-sampled data, BayesAge achieves a higher coefficient of determination between predicted and actual ages compared to both scAge and penalized regression. <b>Discussion:</b> BayesAge presents a promising advancement in epigenetic age prediction, addressing key challenges encountered by existing models. By integrating robust statistical techniques and tailored methodologies for count-based data, BayesAge offers improved accuracy and interpretability in predicting age from bulk bisulfite sequencing datasets. Its ability to estimate error bounds enhances the reliability of age inference, thereby contributing to a more comprehensive understanding of epigenetic aging processes.</p>","PeriodicalId":73066,"journal":{"name":"Frontiers in bioinformatics","volume":"4 ","pages":"1329144"},"PeriodicalIF":2.8000,"publicationDate":"2024-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11024280/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fbinf.2024.1329144","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: DNA methylation, specifically the formation of 5-methylcytosine at the C5 position of cytosine, undergoes reproducible changes as organisms age, establishing it as a significant biomarker in aging studies. Epigenetic clocks, which integrate methylation patterns to predict age, often employ linear models based on penalized regression, yet they encounter challenges in handling missing data, count-based bisulfite sequence data, and interpretation. Methods: To address these limitations, we introduce BayesAge, an extension of the scAge methodology originally designed for single-cell DNA methylation analysis. BayesAge employs maximum likelihood estimation (MLE) for age inference, models count data using binomial distributions, and incorporates LOWESS smoothing to capture non-linear methylation-age dynamics. This approach is tailored for bulk bisulfite sequencing datasets. Results: BayesAge demonstrates superior performance compared to scAge. Notably, its age residuals exhibit no age association, offering a less biased representation of epigenetic age variation across populations. Furthermore, BayesAge facilitates the estimation of error bounds on age inference. When applied to down-sampled data, BayesAge achieves a higher coefficient of determination between predicted and actual ages compared to both scAge and penalized regression. Discussion: BayesAge presents a promising advancement in epigenetic age prediction, addressing key challenges encountered by existing models. By integrating robust statistical techniques and tailored methodologies for count-based data, BayesAge offers improved accuracy and interpretability in predicting age from bulk bisulfite sequencing datasets. Its ability to estimate error bounds enhances the reliability of age inference, thereby contributing to a more comprehensive understanding of epigenetic aging processes.

BayesAge:预测表观遗传年龄的最大似然法算法。
导言:DNA 甲基化,特别是胞嘧啶 C5 位上 5-甲基胞嘧啶的形成,会随着生物体年龄的增长而发生可重复的变化,从而成为衰老研究中的重要生物标志物。表观遗传时钟整合了甲基化模式来预测年龄,通常采用基于惩罚回归的线性模型,但在处理缺失数据、基于计数的亚硫酸氢盐序列数据和解释方面遇到了挑战。方法:为了解决这些局限性,我们引入了贝叶斯年龄(BayesAge),它是最初为单细胞 DNA 甲基化分析而设计的 scAge 方法的扩展。BayesAge 采用最大似然估计(MLE)进行年龄推断,使用二项分布建立计数数据模型,并结合 LOWESS 平滑法捕捉甲基化-年龄的非线性动态变化。这种方法专为大量亚硫酸氢盐测序数据集定制。结果BayesAge 的性能优于 scAge。值得注意的是,它的年龄残差与年龄无关,对不同人群的表观遗传年龄变化的描述偏差较小。此外,BayesAge 还有助于估计年龄推断的误差范围。与 scAge 和惩罚回归相比,BayesAge 在预测年龄和实际年龄之间取得了更高的决定系数。讨论贝叶斯年龄在表观遗传年龄预测方面取得了可喜的进步,解决了现有模型所面临的关键挑战。通过整合稳健的统计技术和为基于计数的数据定制的方法,BayesAge 提高了从大量亚硫酸氢盐测序数据集预测年龄的准确性和可解释性。它估计误差范围的能力增强了年龄推断的可靠性,从而有助于更全面地了解表观遗传衰老过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
2.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信