An integrated machine learning framework for developing and validating diagnostic models and drug predictions based on ulcerative colitis genes.

IF 3.1 3区 医学 Q1 MEDICINE, GENERAL & INTERNAL
Frontiers in Medicine Pub Date : 2025-06-13 eCollection Date: 2025-01-01 DOI:10.3389/fmed.2025.1571529
Na An, Zhongwen Lu, Yang Li, Bing Yang, Shaozhen Ji, Xu Dong, Zhaoliang Ding
{"title":"An integrated machine learning framework for developing and validating diagnostic models and drug predictions based on ulcerative colitis genes.","authors":"Na An, Zhongwen Lu, Yang Li, Bing Yang, Shaozhen Ji, Xu Dong, Zhaoliang Ding","doi":"10.3389/fmed.2025.1571529","DOIUrl":null,"url":null,"abstract":"<p><p>Ulcerative colitis (UC) is a long-lasting inflammatory bowel disease that causes inflammation in the intestines and triggers autoimmune responses. This study aims to identify immune-related biomarkers for ulcerative colitis (UC) and explore potential therapeutic targets. First, we downloaded the expression profiles of datasets GSE87466, GSE87473, and GSE92415 from the GEO database. Next, we identified differentially expressed genes (DEGs) that are associated with UC. Using the WGCNA algorithm, we screened key module genes in UC and retrieved immune-related genes (IRGs) from the ImmPort database. We identified immune-related differentially expressed genes by intersecting the results from WGCNA, DEGs, and IRGs. To build a diagnostic model for UC, we applied 113 combinations of 12 machine learning algorithms. This included 10-fold cross-validation on the training set and external validation on the test set. The single-cell results presented the cellular profile of UC and indicated that the key genes were significantly associated with macrophages, epithelial cells, and fibroblasts. The single-cell results presented the cell atlas of UC and suggested that key genes were significantly associated with macrophages, epithelial cells and fibroblasts. Quantitative polymerase chain reaction (q-PCR) was used to verify the expression levels of the core biomarkers screened out by machine learning. We conducted enrichment analysis using Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and gene set enrichment analysis (GSEA), which showed biological processes and signaling pathways associated with UC. Immune cell infiltration analysis based on CIBERSORT was also performed. We also screened potential drugs from the DSigDB drug database. To evaluate their effectiveness, we performed molecular docking and dynamics simulations. The results suggested that compounds like thalidomide and troglitazone are promising candidates for new UC drug development. Our findings provide insights into the pathogenesis of UC, its clinical treatment, and potential drug development.</p>","PeriodicalId":12488,"journal":{"name":"Frontiers in Medicine","volume":"12 ","pages":"1571529"},"PeriodicalIF":3.1000,"publicationDate":"2025-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12202452/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fmed.2025.1571529","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Ulcerative colitis (UC) is a long-lasting inflammatory bowel disease that causes inflammation in the intestines and triggers autoimmune responses. This study aims to identify immune-related biomarkers for ulcerative colitis (UC) and explore potential therapeutic targets. First, we downloaded the expression profiles of datasets GSE87466, GSE87473, and GSE92415 from the GEO database. Next, we identified differentially expressed genes (DEGs) that are associated with UC. Using the WGCNA algorithm, we screened key module genes in UC and retrieved immune-related genes (IRGs) from the ImmPort database. We identified immune-related differentially expressed genes by intersecting the results from WGCNA, DEGs, and IRGs. To build a diagnostic model for UC, we applied 113 combinations of 12 machine learning algorithms. This included 10-fold cross-validation on the training set and external validation on the test set. The single-cell results presented the cellular profile of UC and indicated that the key genes were significantly associated with macrophages, epithelial cells, and fibroblasts. The single-cell results presented the cell atlas of UC and suggested that key genes were significantly associated with macrophages, epithelial cells and fibroblasts. Quantitative polymerase chain reaction (q-PCR) was used to verify the expression levels of the core biomarkers screened out by machine learning. We conducted enrichment analysis using Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG), and gene set enrichment analysis (GSEA), which showed biological processes and signaling pathways associated with UC. Immune cell infiltration analysis based on CIBERSORT was also performed. We also screened potential drugs from the DSigDB drug database. To evaluate their effectiveness, we performed molecular docking and dynamics simulations. The results suggested that compounds like thalidomide and troglitazone are promising candidates for new UC drug development. Our findings provide insights into the pathogenesis of UC, its clinical treatment, and potential drug development.

基于溃疡性结肠炎基因开发和验证诊断模型和药物预测的集成机器学习框架。
溃疡性结肠炎(UC)是一种长期的炎症性肠病,引起肠道炎症并引发自身免疫反应。本研究旨在鉴定溃疡性结肠炎(UC)的免疫相关生物标志物,并探索潜在的治疗靶点。首先,我们从GEO数据库中下载了GSE87466、GSE87473和GSE92415数据集的表达谱。接下来,我们鉴定了与UC相关的差异表达基因(DEGs)。我们使用WGCNA算法筛选UC的关键模块基因,并从import数据库中检索免疫相关基因(IRGs)。通过交叉WGCNA、deg和irg的结果,我们确定了免疫相关的差异表达基因。为了建立UC的诊断模型,我们应用了12种机器学习算法的113种组合。这包括对训练集的10倍交叉验证和对测试集的外部验证。单细胞结果显示UC的细胞图谱,并表明关键基因与巨噬细胞、上皮细胞和成纤维细胞显著相关。单细胞结果显示UC的细胞图谱,提示关键基因与巨噬细胞、上皮细胞和成纤维细胞显著相关。采用定量聚合酶链反应(q-PCR)验证机器学习筛选出的核心生物标志物的表达水平。我们使用基因本体(GO)、京都基因与基因组百科全书(KEGG)和基因集富集分析(GSEA)进行富集分析,揭示了与UC相关的生物学过程和信号通路。同时进行基于CIBERSORT的免疫细胞浸润分析。我们还从DSigDB药物数据库中筛选潜在药物。为了评估其有效性,我们进行了分子对接和动力学模拟。结果表明,像沙利度胺和曲格列酮这样的化合物是新的UC药物开发的有希望的候选者。我们的发现为UC的发病机制、临床治疗和潜在的药物开发提供了见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Frontiers in Medicine
Frontiers in Medicine Medicine-General Medicine
CiteScore
5.10
自引率
5.10%
发文量
3710
审稿时长
12 weeks
期刊介绍: Frontiers in Medicine publishes rigorously peer-reviewed research linking basic research to clinical practice and patient care, as well as translating scientific advances into new therapies and diagnostic tools. Led by an outstanding Editorial Board of international experts, this multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, clinicians and the public worldwide. In addition to papers that provide a link between basic research and clinical practice, a particular emphasis is given to studies that are directly relevant to patient care. In this spirit, the journal publishes the latest research results and medical knowledge that facilitate the translation of scientific advances into new therapies or diagnostic tools. The full listing of the Specialty Sections represented by Frontiers in Medicine is as listed below. As well as the established medical disciplines, Frontiers in Medicine is launching new sections that together will facilitate - the use of patient-reported outcomes under real world conditions - the exploitation of big data and the use of novel information and communication tools in the assessment of new medicines - the scientific bases for guidelines and decisions from regulatory authorities - access to medicinal products and medical devices worldwide - addressing the grand health challenges around the world
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信