基于综合生物信息学分析和机器学习算法的瘢痕疙瘩疾病潜在生物标志物和机制鉴定。

IF 2.1 4区 医学 Q3 GENETICS & HEREDITY
Bowen Zheng, Jianxiong Qiao, Xiaoping Yu, Hanghang Zhou, Anqi Wang, Xuanfen Zhang
{"title":"基于综合生物信息学分析和机器学习算法的瘢痕疙瘩疾病潜在生物标志物和机制鉴定。","authors":"Bowen Zheng, Jianxiong Qiao, Xiaoping Yu, Hanghang Zhou, Anqi Wang, Xuanfen Zhang","doi":"10.1186/s12920-025-02174-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Keloid disorder (KD) encompasses a spectrum of fibroproliferative dermal conditions, the pathogenesis remains complex and incompletely understood. This study sought to identify biomarkers and potential therapeutic targets for KD through an integrative bioinformatics approach and machine learning analysis of RNA sequencing data.</p><p><strong>Methods: </strong>RNA sequencing was performed on skin tissue samples from 13 patients with KD and 14 healthy controls. Using weighted gene co-expression network analysis and differential expression analysis revealed differentially expressed key module genes, and the CytoHubba plugin identified candidate genes. Subsequently analyzed using least absolute shrinkage and selection operator (LASSO) and support vector machine recursive feature elimination (SVM-RFE) methods to pinpoint feature genes associated with KD. Following this, biomarkers were determined through expression level validation, enrichment analysis, and immune infiltration analysis.</p><p><strong>Results: </strong>A total of 420 differentially expressed key module genes were identified, and the top 10 genes with DMNC values were selected as candidate genes. Five feature genes were selected through LASSO and SVM-RFE, with NID2, MFAP2, COL8A1, and P4HA3 showing significant expression differences between KD and control samples, along with consistent expression patterns across datasets, identified as potential biomarkers. These four biomarkers were proved to possess high diagnostic potential, and they were found to exhibit significant positive correlations with one another. Functional enrichment analysis indicated that the primary KEGG pathways associated with these biomarkers included \"steroid hormone biosynthesis\" and \"cytokine-cytokine receptor interaction.\" Moreover, immune infiltration analysis revealed that the four biomarkers were negatively correlated with type 17 T helper cells and positively correlated with 15 immune cell types, including activated B cells and central memory CD4 T cells.</p><p><strong>Conclusion: </strong>In conclusion, NID2, MFAP2, COL8A1, and P4HA3 were identified as key biomarkers for KD, offering new avenues for more targeted and effective diagnostic and therapeutic strategies for managing this condition.</p>","PeriodicalId":8915,"journal":{"name":"BMC Medical Genomics","volume":"18 1","pages":"108"},"PeriodicalIF":2.1000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12220631/pdf/","citationCount":"0","resultStr":"{\"title\":\"Identification of potential biomarkers and mechanisms for keloid disorder based on comprehensive bioinformatics analysis and machine learning algorithms.\",\"authors\":\"Bowen Zheng, Jianxiong Qiao, Xiaoping Yu, Hanghang Zhou, Anqi Wang, Xuanfen Zhang\",\"doi\":\"10.1186/s12920-025-02174-9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Keloid disorder (KD) encompasses a spectrum of fibroproliferative dermal conditions, the pathogenesis remains complex and incompletely understood. This study sought to identify biomarkers and potential therapeutic targets for KD through an integrative bioinformatics approach and machine learning analysis of RNA sequencing data.</p><p><strong>Methods: </strong>RNA sequencing was performed on skin tissue samples from 13 patients with KD and 14 healthy controls. Using weighted gene co-expression network analysis and differential expression analysis revealed differentially expressed key module genes, and the CytoHubba plugin identified candidate genes. Subsequently analyzed using least absolute shrinkage and selection operator (LASSO) and support vector machine recursive feature elimination (SVM-RFE) methods to pinpoint feature genes associated with KD. Following this, biomarkers were determined through expression level validation, enrichment analysis, and immune infiltration analysis.</p><p><strong>Results: </strong>A total of 420 differentially expressed key module genes were identified, and the top 10 genes with DMNC values were selected as candidate genes. Five feature genes were selected through LASSO and SVM-RFE, with NID2, MFAP2, COL8A1, and P4HA3 showing significant expression differences between KD and control samples, along with consistent expression patterns across datasets, identified as potential biomarkers. These four biomarkers were proved to possess high diagnostic potential, and they were found to exhibit significant positive correlations with one another. Functional enrichment analysis indicated that the primary KEGG pathways associated with these biomarkers included \\\"steroid hormone biosynthesis\\\" and \\\"cytokine-cytokine receptor interaction.\\\" Moreover, immune infiltration analysis revealed that the four biomarkers were negatively correlated with type 17 T helper cells and positively correlated with 15 immune cell types, including activated B cells and central memory CD4 T cells.</p><p><strong>Conclusion: </strong>In conclusion, NID2, MFAP2, COL8A1, and P4HA3 were identified as key biomarkers for KD, offering new avenues for more targeted and effective diagnostic and therapeutic strategies for managing this condition.</p>\",\"PeriodicalId\":8915,\"journal\":{\"name\":\"BMC Medical Genomics\",\"volume\":\"18 1\",\"pages\":\"108\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12220631/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Medical Genomics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12920-025-02174-9\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Genomics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12920-025-02174-9","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

背景:瘢痕疙瘩病(KD)包括一系列纤维增生性皮肤疾病,其发病机制仍然复杂且不完全清楚。本研究试图通过综合生物信息学方法和RNA测序数据的机器学习分析来确定KD的生物标志物和潜在治疗靶点。方法:对13例KD患者和14例健康对照者的皮肤组织样本进行RNA测序。通过加权基因共表达网络分析和差异表达分析揭示了差异表达的关键模块基因,并利用CytoHubba插件对候选基因进行了鉴定。随后,采用最小绝对收缩选择算子(LASSO)和支持向量机递归特征消除(SVM-RFE)方法对KD相关特征基因进行定位分析。随后,通过表达水平验证、富集分析和免疫浸润分析确定生物标志物。结果:共鉴定出420个差异表达的关键模块基因,筛选出DMNC值最高的10个基因作为候选基因。通过LASSO和SVM-RFE选择了5个特征基因,其中NID2、MFAP2、COL8A1和P4HA3在KD和对照样品中表达差异显著,并且在数据集中表达模式一致,被确定为潜在的生物标志物。这四种生物标志物被证明具有很高的诊断潜力,并且它们之间表现出显著的正相关。功能富集分析表明,与这些生物标志物相关的主要KEGG途径包括“类固醇激素生物合成”和“细胞因子-细胞因子受体相互作用”。此外,免疫浸润分析显示,这四种生物标志物与17型T辅助细胞呈负相关,与15种免疫细胞类型呈正相关,包括活化B细胞和中枢记忆CD4 T细胞。结论:NID2、MFAP2、COL8A1和P4HA3被确定为KD的关键生物标志物,为更有针对性和更有效的诊断和治疗策略提供了新的途径。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identification of potential biomarkers and mechanisms for keloid disorder based on comprehensive bioinformatics analysis and machine learning algorithms.

Background: Keloid disorder (KD) encompasses a spectrum of fibroproliferative dermal conditions, the pathogenesis remains complex and incompletely understood. This study sought to identify biomarkers and potential therapeutic targets for KD through an integrative bioinformatics approach and machine learning analysis of RNA sequencing data.

Methods: RNA sequencing was performed on skin tissue samples from 13 patients with KD and 14 healthy controls. Using weighted gene co-expression network analysis and differential expression analysis revealed differentially expressed key module genes, and the CytoHubba plugin identified candidate genes. Subsequently analyzed using least absolute shrinkage and selection operator (LASSO) and support vector machine recursive feature elimination (SVM-RFE) methods to pinpoint feature genes associated with KD. Following this, biomarkers were determined through expression level validation, enrichment analysis, and immune infiltration analysis.

Results: A total of 420 differentially expressed key module genes were identified, and the top 10 genes with DMNC values were selected as candidate genes. Five feature genes were selected through LASSO and SVM-RFE, with NID2, MFAP2, COL8A1, and P4HA3 showing significant expression differences between KD and control samples, along with consistent expression patterns across datasets, identified as potential biomarkers. These four biomarkers were proved to possess high diagnostic potential, and they were found to exhibit significant positive correlations with one another. Functional enrichment analysis indicated that the primary KEGG pathways associated with these biomarkers included "steroid hormone biosynthesis" and "cytokine-cytokine receptor interaction." Moreover, immune infiltration analysis revealed that the four biomarkers were negatively correlated with type 17 T helper cells and positively correlated with 15 immune cell types, including activated B cells and central memory CD4 T cells.

Conclusion: In conclusion, NID2, MFAP2, COL8A1, and P4HA3 were identified as key biomarkers for KD, offering new avenues for more targeted and effective diagnostic and therapeutic strategies for managing this condition.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Medical Genomics
BMC Medical Genomics 医学-遗传学
CiteScore
3.90
自引率
0.00%
发文量
243
审稿时长
3.5 months
期刊介绍: BMC Medical Genomics is an open access journal publishing original peer-reviewed research articles in all aspects of functional genomics, genome structure, genome-scale population genetics, epigenomics, proteomics, systems analysis, and pharmacogenomics in relation to human health and disease.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信