Identification of Biomarkers Associated with Heart Failure Caused by Idiopathic Dilated Cardiomyopathy Using WGCNA and Machine Learning Algorithms.

IF 2.6 4区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Mengyi Sun, Linping Li
{"title":"Identification of Biomarkers Associated with Heart Failure Caused by Idiopathic Dilated Cardiomyopathy Using WGCNA and Machine Learning Algorithms.","authors":"Mengyi Sun,&nbsp;Linping Li","doi":"10.1155/2023/2250772","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The genetic factors and pathogenesis of idiopathic dilated cardiomyopathy-induced heart failure (IDCM-HF) have not been understood thoroughly; there is a lack of specific diagnostic markers and treatment methods for the disease. Hence, we aimed to identify the mechanisms of action at the molecular level and potential molecular markers for this disease.</p><p><strong>Methods: </strong>Gene expression profiles of IDCM-HF and non-heart failure (NF) specimens were acquired from the database of Gene Expression Omnibus (GEO). We then identified the differentially expressed genes (DEGs) and analyzed their functions and related pathways by using \"Metascape\". Weighted gene co-expression network analysis (WGCNA) was utilized to search for key module genes. Candidate genes were identified by intersecting the key module genes identified via WGCNA with DEGs and further screened via the support vector machine-recursive feature elimination (SVM-RFE) method and the least absolute shrinkage and selection operator (LASSO) algorithm. At last, the biomarkers were validated and evaluated the diagnostic efficacy by the area under curve (AUC) value and further confirmed the differential expression in the IDCM-HF and NF groups using an external database.</p><p><strong>Results: </strong>We detected 490 genes exhibiting differential expression between IDCM-HF and NF specimens from the GSE57338 dataset, with most of them being concentrated in the extracellular matrix (ECM) of cells related to biological processes and pathways. After screening, 13 candidate genes were identified. Aquaporin 3 (AQP3) and cytochrome P450 2J2 (CYP2J2) showed high diagnostic efficacy in the GSE57338 and GSE6406 datasets, respectively. In comparison to the NF group, AQP3 was significantly down-regulated in the IDCM-HF group, while CYP2J2 was significantly up-regulated.</p><p><strong>Conclusion: </strong>As far as we know, this is the first study that combines WGCNA and machine learning algorithms to screen for potential biomarkers of IDCM-HF. Our findings suggest that AQP3 and CYP2J2 could be used as novel diagnostic markers and treatment targets of IDCM-HF.</p>","PeriodicalId":13988,"journal":{"name":"International Journal of Genomics","volume":"2023 ","pages":"2250772"},"PeriodicalIF":2.6000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10154102/pdf/","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1155/2023/2250772","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 2

Abstract

Background: The genetic factors and pathogenesis of idiopathic dilated cardiomyopathy-induced heart failure (IDCM-HF) have not been understood thoroughly; there is a lack of specific diagnostic markers and treatment methods for the disease. Hence, we aimed to identify the mechanisms of action at the molecular level and potential molecular markers for this disease.

Methods: Gene expression profiles of IDCM-HF and non-heart failure (NF) specimens were acquired from the database of Gene Expression Omnibus (GEO). We then identified the differentially expressed genes (DEGs) and analyzed their functions and related pathways by using "Metascape". Weighted gene co-expression network analysis (WGCNA) was utilized to search for key module genes. Candidate genes were identified by intersecting the key module genes identified via WGCNA with DEGs and further screened via the support vector machine-recursive feature elimination (SVM-RFE) method and the least absolute shrinkage and selection operator (LASSO) algorithm. At last, the biomarkers were validated and evaluated the diagnostic efficacy by the area under curve (AUC) value and further confirmed the differential expression in the IDCM-HF and NF groups using an external database.

Results: We detected 490 genes exhibiting differential expression between IDCM-HF and NF specimens from the GSE57338 dataset, with most of them being concentrated in the extracellular matrix (ECM) of cells related to biological processes and pathways. After screening, 13 candidate genes were identified. Aquaporin 3 (AQP3) and cytochrome P450 2J2 (CYP2J2) showed high diagnostic efficacy in the GSE57338 and GSE6406 datasets, respectively. In comparison to the NF group, AQP3 was significantly down-regulated in the IDCM-HF group, while CYP2J2 was significantly up-regulated.

Conclusion: As far as we know, this is the first study that combines WGCNA and machine learning algorithms to screen for potential biomarkers of IDCM-HF. Our findings suggest that AQP3 and CYP2J2 could be used as novel diagnostic markers and treatment targets of IDCM-HF.

Abstract Image

Abstract Image

Abstract Image

使用WGCNA和机器学习算法识别特发性扩张型心肌病引起的心力衰竭相关生物标志物
背景:特发性扩张型心肌病诱发心力衰竭(IDCM-HF)的遗传因素和发病机制尚未完全了解;目前缺乏特异性的诊断标记物和治疗方法。因此,我们的目的是在分子水平上确定作用机制和潜在的分子标记。方法:从Gene expression Omnibus (GEO)数据库中获取IDCM-HF和非心力衰竭(NF)标本的基因表达谱。利用“meta - scape”软件对差异表达基因(differential expression genes, deg)进行鉴定,分析其功能和相关途径。采用加权基因共表达网络分析(Weighted gene co-expression network analysis, WGCNA)搜索关键模块基因。将WGCNA识别出的关键模块基因与deg相交,确定候选基因,并通过支持向量机递归特征消除(SVM-RFE)方法和最小绝对收缩选择算子(LASSO)算法进行筛选。最后,通过曲线下面积(area under curve, AUC)值对生物标志物进行验证和诊断效能评估,并利用外部数据库进一步确认IDCM-HF组和NF组的差异表达。结果:我们从GSE57338数据集中检测到490个基因在IDCM-HF和NF样本中表现出差异表达,其中大多数基因集中在与生物过程和途径相关的细胞外基质(ECM)中。经筛选,共鉴定出13个候选基因。水通道蛋白3 (AQP3)和细胞色素P450 2J2 (CYP2J2)分别在GSE57338和GSE6406数据集中显示出较高的诊断效能。与NF组比较,IDCM-HF组AQP3显著下调,CYP2J2显著上调。结论:据我们所知,这是第一个结合WGCNA和机器学习算法筛选IDCM-HF潜在生物标志物的研究。提示AQP3和CYP2J2可作为IDCM-HF新的诊断标记物和治疗靶点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Genomics
International Journal of Genomics BIOCHEMISTRY & MOLECULAR BIOLOGY-BIOTECHNOLOGY & APPLIED MICROBIOLOGY
CiteScore
5.40
自引率
0.00%
发文量
33
审稿时长
17 weeks
期刊介绍: International Journal of Genomics is a peer-reviewed, Open Access journal that publishes research articles as well as review articles in all areas of genome-scale analysis. Topics covered by the journal include, but are not limited to: bioinformatics, clinical genomics, disease genomics, epigenomics, evolutionary genomics, functional genomics, genome engineering, and synthetic genomics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信