{"title":"Identifying core genes in keloid and investigating immune infiltration and pan-cancer associations using eQTL and machine learning.","authors":"Xiaoyuan He, Yang Song","doi":"10.1093/bioadv/vbaf076","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Keloid is a fibroproliferative skin disorder characterized by excessive fibroblast proliferation and abnormal extracellular matrix accumulation. It manifests as continuous growth, redness, itching, and pain, with a high recurrence rate. The pathogenesis of keloid is complex, with genetics and gene mutations increasingly recognized as critical risk factors. This condition exhibits familial predisposition and clustering, with individuals of darker skin tones at greater risk. To elucidate the genetic factors underlying keloid development, this study integrates bioinformatics and Mendelian randomization (MR) approaches to identify core genes associated with keloid, providing novel insights into its pathogenesis, treatment, and prognosis.</p><p><strong>Results: </strong>Bioinformatics and Mendelian randomization analyses identified two intersecting genes, CCND2 and KLF4, as core genes associated with keloid. MR analysis revealed that CCND2 is causally associated with keloid [inverse variance weighted (IVW) odds ratio (OR): 1.410; 95% confidence interval (CI): 1.001-1.985, <i>P</i> = .049], indicating it is a risk factor, while KLF4 is inversely associated with keloid (IVW OR: 0.492; 95% CI: 0.290-0.835, <i>P</i> = .009). Both intersecting genes exhibit a causal relationship with keloid, identifying them as two core genes. Specifically, CCND2 is recognized as a risk factor for keloid, while KLF4 functions as a protective factor against keloid formation. Validation analyses were conducted on these two core genes, revealing significant differences in KLF4 expression within the validation cohort.</p><p><strong>Availability and implementation: </strong>Firstly, bioinformatics analysis identified differentially expressed genes (DEGs) from the keloid GEO datasets. Secondly, MR was applied to eQTL and keloid GWAS datasets to identify candidate genes. Overlapping genes were derived by intersecting DEGs with MR candidate genes. Causal relationships between overlapping genes and keloids were analyzed using five MR methods, identifying core genes significantly associated with keloid pathogenesis. Cochran's Q test and MR-Egger intercept analysis evaluated heterogeneity and pleiotropy in MR results. GO, KEGG, and GSEA enrichment analyses were conducted to explore core gene functions. Finally, validation and TCGA pan-cancer analyses were conducted on the core genes.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf076"},"PeriodicalIF":2.8000,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12145169/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Motivation: Keloid is a fibroproliferative skin disorder characterized by excessive fibroblast proliferation and abnormal extracellular matrix accumulation. It manifests as continuous growth, redness, itching, and pain, with a high recurrence rate. The pathogenesis of keloid is complex, with genetics and gene mutations increasingly recognized as critical risk factors. This condition exhibits familial predisposition and clustering, with individuals of darker skin tones at greater risk. To elucidate the genetic factors underlying keloid development, this study integrates bioinformatics and Mendelian randomization (MR) approaches to identify core genes associated with keloid, providing novel insights into its pathogenesis, treatment, and prognosis.
Results: Bioinformatics and Mendelian randomization analyses identified two intersecting genes, CCND2 and KLF4, as core genes associated with keloid. MR analysis revealed that CCND2 is causally associated with keloid [inverse variance weighted (IVW) odds ratio (OR): 1.410; 95% confidence interval (CI): 1.001-1.985, P = .049], indicating it is a risk factor, while KLF4 is inversely associated with keloid (IVW OR: 0.492; 95% CI: 0.290-0.835, P = .009). Both intersecting genes exhibit a causal relationship with keloid, identifying them as two core genes. Specifically, CCND2 is recognized as a risk factor for keloid, while KLF4 functions as a protective factor against keloid formation. Validation analyses were conducted on these two core genes, revealing significant differences in KLF4 expression within the validation cohort.
Availability and implementation: Firstly, bioinformatics analysis identified differentially expressed genes (DEGs) from the keloid GEO datasets. Secondly, MR was applied to eQTL and keloid GWAS datasets to identify candidate genes. Overlapping genes were derived by intersecting DEGs with MR candidate genes. Causal relationships between overlapping genes and keloids were analyzed using five MR methods, identifying core genes significantly associated with keloid pathogenesis. Cochran's Q test and MR-Egger intercept analysis evaluated heterogeneity and pleiotropy in MR results. GO, KEGG, and GSEA enrichment analyses were conducted to explore core gene functions. Finally, validation and TCGA pan-cancer analyses were conducted on the core genes.