Identification of Whole-Blood DNA Methylation Signatures and Rules Associated with COVID-19 Severity

Fei Yuan, JingXin Ren, HuiPing Liao, Wei Guo, Lei Chen, KaiYan Feng, Tao Huang, Yu-Dong Cai
{"title":"Identification of Whole-Blood DNA Methylation Signatures and Rules Associated with COVID-19 Severity","authors":"Fei Yuan, JingXin Ren, HuiPing Liao, Wei Guo, Lei Chen, KaiYan Feng, Tao Huang, Yu-Dong Cai","doi":"10.31083/j.fbl2811284","DOIUrl":null,"url":null,"abstract":"Background: Different severities of coronavirus disease 2019 (COVID-19) cause different levels of respiratory symptoms and systemic inflammation. DNA methylation, a heritable epigenetic process, also shows differential changes in different severities of COVID-19. DNA methylation is involved in regulating the activity of various immune cells and influences immune pathways associated with viral infections. It may also be involved in regulating the expression of genes associated with the progression of COVID-19. Methods: In this study, a sophisticated machine-learning workflow was designed to analyze whole-blood DNA methylation data from COVID-19 patients with different severities versus healthy controls. We aimed to understand the role of DNA methylation in the development of COVID-19. The sample set contained 101 negative controls, 360 mildly infected individuals, and 113 severely infected individuals. Each sample involved 768,067 methylation sites. Three feature-ranking algorithms (least absolute shrinkage and selection operator (LASSO), light gradient-boosting machine (LightGBM), and Monte Carlo feature selection (MCFS)) were used to rank and filter out sites highly correlated with COVID-19. Based on the obtained ranking results, a high-performance classification model was constructed by combining the feature incremental approach with four classification algorithms (decision tree (DT), k-nearest neighbor (kNN), random forest (RF), and support vector machine (SVM)). Results: Some essential methylation sites and decision rules were obtained. Conclusions: The genes (IGSF6, CD38, and TLR2) of some essential methylation sites were confirmed to play important roles in the immune system.","PeriodicalId":12366,"journal":{"name":"Frontiers in bioscience","volume":"158 1‐8","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in bioscience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31083/j.fbl2811284","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Different severities of coronavirus disease 2019 (COVID-19) cause different levels of respiratory symptoms and systemic inflammation. DNA methylation, a heritable epigenetic process, also shows differential changes in different severities of COVID-19. DNA methylation is involved in regulating the activity of various immune cells and influences immune pathways associated with viral infections. It may also be involved in regulating the expression of genes associated with the progression of COVID-19. Methods: In this study, a sophisticated machine-learning workflow was designed to analyze whole-blood DNA methylation data from COVID-19 patients with different severities versus healthy controls. We aimed to understand the role of DNA methylation in the development of COVID-19. The sample set contained 101 negative controls, 360 mildly infected individuals, and 113 severely infected individuals. Each sample involved 768,067 methylation sites. Three feature-ranking algorithms (least absolute shrinkage and selection operator (LASSO), light gradient-boosting machine (LightGBM), and Monte Carlo feature selection (MCFS)) were used to rank and filter out sites highly correlated with COVID-19. Based on the obtained ranking results, a high-performance classification model was constructed by combining the feature incremental approach with four classification algorithms (decision tree (DT), k-nearest neighbor (kNN), random forest (RF), and support vector machine (SVM)). Results: Some essential methylation sites and decision rules were obtained. Conclusions: The genes (IGSF6, CD38, and TLR2) of some essential methylation sites were confirmed to play important roles in the immune system.
与COVID-19严重程度相关的全血DNA甲基化特征和规则的鉴定
背景:不同程度的冠状病毒病2019 (COVID-19)导致不同程度的呼吸道症状和全身性炎症。DNA甲基化是一种可遗传的表观遗传过程,在不同严重程度的COVID-19中也表现出不同的变化。DNA甲基化参与调节各种免疫细胞的活性,并影响与病毒感染相关的免疫途径。它还可能参与调节与COVID-19进展相关的基因表达。方法:在本研究中,设计了一个复杂的机器学习工作流程来分析不同严重程度的COVID-19患者与健康对照组的全血DNA甲基化数据。我们的目的是了解DNA甲基化在COVID-19发展中的作用。对照组101例,轻度感染者360例,重度感染者113例。每个样本涉及768,067个甲基化位点。使用最小绝对收缩和选择算子(LASSO)、光梯度增强机(LightGBM)和蒙特卡罗特征选择(MCFS)三种特征排序算法对与COVID-19高度相关的站点进行排序和过滤。基于获得的分类结果,将特征增量法与决策树(DT)、k近邻(kNN)、随机森林(RF)和支持向量机(SVM)四种分类算法相结合,构建了高性能的分类模型。结果:获得了一些必要的甲基化位点和判定规则。结论:一些必需甲基化位点的基因(IGSF6、CD38和TLR2)已被证实在免疫系统中发挥重要作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信