Yanlong Zhang, Yanming Fan, Fei Cheng, Dan Chen, Hualong Zhang
{"title":"基于机器学习的心衰诊断特征基因和亚型识别。","authors":"Yanlong Zhang, Yanming Fan, Fei Cheng, Dan Chen, Hualong Zhang","doi":"10.3389/fcvm.2025.1492192","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Heart failure (HF) is a multifaceted clinical condition, and our comprehension of its genetic pathogenesis continues to be significantly limited. Consequently, identifying specific genes for HF at the transcriptomic level may enhance early detection and allow for more targeted therapies for these individuals.</p><p><strong>Methods: </strong>HF datasets were acquired from the Gene Expression Omnibus (GEO) database (GSE57338), and through the application of bioinformatics and machine-learning algorithms. We identified four candidate genes (<i>FCN3</i>, <i>MNS1</i>, <i>SMOC2</i>, and <i>FREM1</i>) that may serve as potential diagnostics for HF. Furthermore, we validated the diagnostic value of these genes on additional GEO datasets (GSE21610 and GSE76701). In addition, we assessed the different subtypes of heart failure through unsupervised clustering, and investigations were conducted on the differences in the immunological microenvironment, improved functions, and pathways among these subtypes. Finally, a comprehensive analysis of the expression profile, prognostic value, and genetic and epigenetic alterations of four potential diagnostic candidate genes was performed based on The Cancer Genome Atlas pan-cancer database.</p><p><strong>Results: </strong>A total of 295 differential genes were identified in the HF dataset, and intersected with the blue module gene with the highest correlation to HF identified by weighted correlation network analysis (<i>r</i> = 0.72, <i>p</i> = 1.3 × 10<sup>-43</sup>), resulting in a total of 114 key HF genes. Furthermore, based on random forest, least absolute shrinkage and selection operator, and support vector machine algorithms, we finally identified four hub genes (<i>FCN3</i>, <i>FREM1</i>, <i>MNS1,</i> and <i>SMOC2</i>) that had good potential for diagnosis in HF (area under the curve > 0.7). Meanwhile, three subgroups for patients with HF were identified (C1, C2, and C3). Compared with the C1 and C2 groups, we eventually identified C3 as an immune subtype. Moreover, the pan-cancer study revealed that these four genes are closely associated with tumor development.</p><p><strong>Conclusions: </strong>Our research identified four unique genes (<i>FCN3</i>, <i>FREM1</i>, <i>MNS1</i>, and <i>SMOC2</i>), enhancing our comprehension of the causes of HF. This provides new diagnostic insights and potentially establishes a tailored approach for individualized HF treatment.</p>","PeriodicalId":12414,"journal":{"name":"Frontiers in Cardiovascular Medicine","volume":"12 ","pages":"1492192"},"PeriodicalIF":2.8000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12034685/pdf/","citationCount":"0","resultStr":"{\"title\":\"Identification of signature genes and subtypes for heart failure diagnosis based on machine learning.\",\"authors\":\"Yanlong Zhang, Yanming Fan, Fei Cheng, Dan Chen, Hualong Zhang\",\"doi\":\"10.3389/fcvm.2025.1492192\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Heart failure (HF) is a multifaceted clinical condition, and our comprehension of its genetic pathogenesis continues to be significantly limited. Consequently, identifying specific genes for HF at the transcriptomic level may enhance early detection and allow for more targeted therapies for these individuals.</p><p><strong>Methods: </strong>HF datasets were acquired from the Gene Expression Omnibus (GEO) database (GSE57338), and through the application of bioinformatics and machine-learning algorithms. We identified four candidate genes (<i>FCN3</i>, <i>MNS1</i>, <i>SMOC2</i>, and <i>FREM1</i>) that may serve as potential diagnostics for HF. Furthermore, we validated the diagnostic value of these genes on additional GEO datasets (GSE21610 and GSE76701). In addition, we assessed the different subtypes of heart failure through unsupervised clustering, and investigations were conducted on the differences in the immunological microenvironment, improved functions, and pathways among these subtypes. Finally, a comprehensive analysis of the expression profile, prognostic value, and genetic and epigenetic alterations of four potential diagnostic candidate genes was performed based on The Cancer Genome Atlas pan-cancer database.</p><p><strong>Results: </strong>A total of 295 differential genes were identified in the HF dataset, and intersected with the blue module gene with the highest correlation to HF identified by weighted correlation network analysis (<i>r</i> = 0.72, <i>p</i> = 1.3 × 10<sup>-43</sup>), resulting in a total of 114 key HF genes. Furthermore, based on random forest, least absolute shrinkage and selection operator, and support vector machine algorithms, we finally identified four hub genes (<i>FCN3</i>, <i>FREM1</i>, <i>MNS1,</i> and <i>SMOC2</i>) that had good potential for diagnosis in HF (area under the curve > 0.7). Meanwhile, three subgroups for patients with HF were identified (C1, C2, and C3). Compared with the C1 and C2 groups, we eventually identified C3 as an immune subtype. Moreover, the pan-cancer study revealed that these four genes are closely associated with tumor development.</p><p><strong>Conclusions: </strong>Our research identified four unique genes (<i>FCN3</i>, <i>FREM1</i>, <i>MNS1</i>, and <i>SMOC2</i>), enhancing our comprehension of the causes of HF. This provides new diagnostic insights and potentially establishes a tailored approach for individualized HF treatment.</p>\",\"PeriodicalId\":12414,\"journal\":{\"name\":\"Frontiers in Cardiovascular Medicine\",\"volume\":\"12 \",\"pages\":\"1492192\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12034685/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Cardiovascular Medicine\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3389/fcvm.2025.1492192\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"CARDIAC & CARDIOVASCULAR SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Cardiovascular Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fcvm.2025.1492192","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
摘要
背景:心力衰竭(HF)是一种多方面的临床疾病,我们对其遗传发病机制的理解仍然非常有限。因此,在转录组水平上识别HF的特定基因可能会提高早期发现,并允许对这些个体进行更有针对性的治疗。方法:利用生物信息学和机器学习算法,从基因表达综合数据库(GEO) (GSE57338)中获取HF数据集。我们确定了四个候选基因(FCN3、MNS1、SMOC2和FREM1),它们可能作为HF的潜在诊断基因。此外,我们在其他GEO数据集(GSE21610和GSE76701)上验证了这些基因的诊断价值。此外,我们通过无监督聚类评估了不同亚型的心力衰竭,并调查了这些亚型在免疫微环境、改善功能和途径方面的差异。最后,基于the Cancer Genome Atlas泛癌症数据库,对四个潜在诊断候选基因的表达谱、预后价值以及遗传和表观遗传改变进行了全面分析。结果:在HF数据集中共鉴定出295个差异基因,并与加权相关网络分析鉴定出与HF相关性最高的蓝色模块基因相交(r = 0.72, p = 1.3 × 10-43),共鉴定出114个HF关键基因。此外,基于随机森林、最小绝对收缩和选择算子以及支持向量机算法,我们最终确定了四个具有良好诊断HF潜力的枢纽基因(FCN3、FREM1、MNS1和SMOC2)(曲线下面积> 0.7)。同时,确定了HF患者的三个亚组(C1、C2和C3)。与C1和C2组相比,我们最终确定C3是一种免疫亚型。此外,泛癌症研究表明,这四个基因与肿瘤的发生密切相关。结论:我们的研究发现了四个独特的基因(FCN3、FREM1、MNS1和SMOC2),增强了我们对HF病因的理解。这提供了新的诊断见解,并有可能为个体化心衰治疗建立量身定制的方法。
Identification of signature genes and subtypes for heart failure diagnosis based on machine learning.
Background: Heart failure (HF) is a multifaceted clinical condition, and our comprehension of its genetic pathogenesis continues to be significantly limited. Consequently, identifying specific genes for HF at the transcriptomic level may enhance early detection and allow for more targeted therapies for these individuals.
Methods: HF datasets were acquired from the Gene Expression Omnibus (GEO) database (GSE57338), and through the application of bioinformatics and machine-learning algorithms. We identified four candidate genes (FCN3, MNS1, SMOC2, and FREM1) that may serve as potential diagnostics for HF. Furthermore, we validated the diagnostic value of these genes on additional GEO datasets (GSE21610 and GSE76701). In addition, we assessed the different subtypes of heart failure through unsupervised clustering, and investigations were conducted on the differences in the immunological microenvironment, improved functions, and pathways among these subtypes. Finally, a comprehensive analysis of the expression profile, prognostic value, and genetic and epigenetic alterations of four potential diagnostic candidate genes was performed based on The Cancer Genome Atlas pan-cancer database.
Results: A total of 295 differential genes were identified in the HF dataset, and intersected with the blue module gene with the highest correlation to HF identified by weighted correlation network analysis (r = 0.72, p = 1.3 × 10-43), resulting in a total of 114 key HF genes. Furthermore, based on random forest, least absolute shrinkage and selection operator, and support vector machine algorithms, we finally identified four hub genes (FCN3, FREM1, MNS1, and SMOC2) that had good potential for diagnosis in HF (area under the curve > 0.7). Meanwhile, three subgroups for patients with HF were identified (C1, C2, and C3). Compared with the C1 and C2 groups, we eventually identified C3 as an immune subtype. Moreover, the pan-cancer study revealed that these four genes are closely associated with tumor development.
Conclusions: Our research identified four unique genes (FCN3, FREM1, MNS1, and SMOC2), enhancing our comprehension of the causes of HF. This provides new diagnostic insights and potentially establishes a tailored approach for individualized HF treatment.
期刊介绍:
Frontiers? Which frontiers? Where exactly are the frontiers of cardiovascular medicine? And who should be defining these frontiers?
At Frontiers in Cardiovascular Medicine we believe it is worth being curious to foresee and explore beyond the current frontiers. In other words, we would like, through the articles published by our community journal Frontiers in Cardiovascular Medicine, to anticipate the future of cardiovascular medicine, and thus better prevent cardiovascular disorders and improve therapeutic options and outcomes of our patients.