Identifying core genes in keloid and investigating immune infiltration and pan-cancer associations using eQTL and machine learning.

IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Bioinformatics advances Pub Date : 2025-04-26 eCollection Date: 2025-01-01 DOI:10.1093/bioadv/vbaf076
Xiaoyuan He, Yang Song
{"title":"Identifying core genes in keloid and investigating immune infiltration and pan-cancer associations using eQTL and machine learning.","authors":"Xiaoyuan He, Yang Song","doi":"10.1093/bioadv/vbaf076","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Keloid is a fibroproliferative skin disorder characterized by excessive fibroblast proliferation and abnormal extracellular matrix accumulation. It manifests as continuous growth, redness, itching, and pain, with a high recurrence rate. The pathogenesis of keloid is complex, with genetics and gene mutations increasingly recognized as critical risk factors. This condition exhibits familial predisposition and clustering, with individuals of darker skin tones at greater risk. To elucidate the genetic factors underlying keloid development, this study integrates bioinformatics and Mendelian randomization (MR) approaches to identify core genes associated with keloid, providing novel insights into its pathogenesis, treatment, and prognosis.</p><p><strong>Results: </strong>Bioinformatics and Mendelian randomization analyses identified two intersecting genes, CCND2 and KLF4, as core genes associated with keloid. MR analysis revealed that CCND2 is causally associated with keloid [inverse variance weighted (IVW) odds ratio (OR): 1.410; 95% confidence interval (CI): 1.001-1.985, <i>P</i> = .049], indicating it is a risk factor, while KLF4 is inversely associated with keloid (IVW OR: 0.492; 95% CI: 0.290-0.835, <i>P</i> = .009). Both intersecting genes exhibit a causal relationship with keloid, identifying them as two core genes. Specifically, CCND2 is recognized as a risk factor for keloid, while KLF4 functions as a protective factor against keloid formation. Validation analyses were conducted on these two core genes, revealing significant differences in KLF4 expression within the validation cohort.</p><p><strong>Availability and implementation: </strong>Firstly, bioinformatics analysis identified differentially expressed genes (DEGs) from the keloid GEO datasets. Secondly, MR was applied to eQTL and keloid GWAS datasets to identify candidate genes. Overlapping genes were derived by intersecting DEGs with MR candidate genes. Causal relationships between overlapping genes and keloids were analyzed using five MR methods, identifying core genes significantly associated with keloid pathogenesis. Cochran's Q test and MR-Egger intercept analysis evaluated heterogeneity and pleiotropy in MR results. GO, KEGG, and GSEA enrichment analyses were conducted to explore core gene functions. Finally, validation and TCGA pan-cancer analyses were conducted on the core genes.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf076"},"PeriodicalIF":2.8000,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12145169/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Keloid is a fibroproliferative skin disorder characterized by excessive fibroblast proliferation and abnormal extracellular matrix accumulation. It manifests as continuous growth, redness, itching, and pain, with a high recurrence rate. The pathogenesis of keloid is complex, with genetics and gene mutations increasingly recognized as critical risk factors. This condition exhibits familial predisposition and clustering, with individuals of darker skin tones at greater risk. To elucidate the genetic factors underlying keloid development, this study integrates bioinformatics and Mendelian randomization (MR) approaches to identify core genes associated with keloid, providing novel insights into its pathogenesis, treatment, and prognosis.

Results: Bioinformatics and Mendelian randomization analyses identified two intersecting genes, CCND2 and KLF4, as core genes associated with keloid. MR analysis revealed that CCND2 is causally associated with keloid [inverse variance weighted (IVW) odds ratio (OR): 1.410; 95% confidence interval (CI): 1.001-1.985, P = .049], indicating it is a risk factor, while KLF4 is inversely associated with keloid (IVW OR: 0.492; 95% CI: 0.290-0.835, P = .009). Both intersecting genes exhibit a causal relationship with keloid, identifying them as two core genes. Specifically, CCND2 is recognized as a risk factor for keloid, while KLF4 functions as a protective factor against keloid formation. Validation analyses were conducted on these two core genes, revealing significant differences in KLF4 expression within the validation cohort.

Availability and implementation: Firstly, bioinformatics analysis identified differentially expressed genes (DEGs) from the keloid GEO datasets. Secondly, MR was applied to eQTL and keloid GWAS datasets to identify candidate genes. Overlapping genes were derived by intersecting DEGs with MR candidate genes. Causal relationships between overlapping genes and keloids were analyzed using five MR methods, identifying core genes significantly associated with keloid pathogenesis. Cochran's Q test and MR-Egger intercept analysis evaluated heterogeneity and pleiotropy in MR results. GO, KEGG, and GSEA enrichment analyses were conducted to explore core gene functions. Finally, validation and TCGA pan-cancer analyses were conducted on the core genes.

利用eQTL和机器学习识别瘢痕疙瘩的核心基因,研究免疫浸润和泛癌症关联。
动机:瘢痕疙瘩是一种纤维增生性皮肤疾病,其特征是成纤维细胞过度增殖和细胞外基质异常积累。表现为持续生长、发红、瘙痒、疼痛,复发率高。瘢痕疙瘩的发病机制是复杂的,遗传和基因突变越来越被认为是关键的危险因素。这种情况表现出家族性易感性和聚集性,肤色越深的个体风险越大。为了阐明瘢痕疙瘩形成的遗传因素,本研究结合生物信息学和孟德尔随机化(MR)方法来识别与瘢痕疙瘩相关的核心基因,为其发病机制、治疗和预后提供新的见解。结果:生物信息学和孟德尔随机化分析发现两个交叉基因CCND2和KLF4是与瘢痕疙瘩相关的核心基因。MR分析显示CCND2与瘢痕疙瘩有因果关系[逆方差加权(IVW)比值比(OR): 1.410;95%置信区间(CI): 1.001 ~ 1.985, P =。[049],提示KLF4与瘢痕疙瘩呈负相关(IVW OR: 0.492;95% ci: 0.290-0.835, p = 0.009)。这两个交叉的基因都与瘢痕疙瘩有因果关系,确定它们是两个核心基因。具体而言,CCND2被认为是瘢痕疙瘩的危险因素,而KLF4则是防止瘢痕疙瘩形成的保护因素。对这两个核心基因进行验证分析,发现验证队列中KLF4的表达存在显著差异。可用性和实施:首先,生物信息学分析从瘢痕疙瘩GEO数据集中确定了差异表达基因(DEGs)。其次,将MR应用于eQTL和瘢痕疙瘩GWAS数据集来鉴定候选基因。重叠基因是通过deg与MR候选基因相交得到的。采用5种磁共振方法分析重叠基因与瘢痕疙瘩的因果关系,确定与瘢痕疙瘩发病机制显著相关的核心基因。Cochran’s Q检验和MR- egger截距分析评估了MR结果的异质性和多效性。通过GO、KEGG和GSEA富集分析来探索核心基因功能。最后对核心基因进行验证和TCGA泛癌分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信