Identification of Potential Genes and Critical Pathways in Postoperative Recurrence of Crohn's Disease by Machine Learning And WGCNA Network Analysis.

IF 16.4 1区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Aruna Rajalingam, Kanagaraj Sekar, Anjali Ganjiwale
{"title":"Identification of Potential Genes and Critical Pathways in Postoperative Recurrence of Crohn's Disease by Machine Learning And WGCNA Network Analysis.","authors":"Aruna Rajalingam, Kanagaraj Sekar, Anjali Ganjiwale","doi":"10.2174/1389202924666230601122334","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Crohn's disease (CD) is a chronic idiopathic inflammatory bowel disease affecting the entire gastrointestinal tract from the mouth to the anus. These patients often experience a period of symptomatic relapse and remission. A 20 - 30% symptomatic recurrence rate is reported in the first year after surgery, with a 10% increase each subsequent year. Thus, surgery is done only to relieve symptoms and not for the complete cure of the disease. The determinants and the genetic factors of this disease recurrence are also not well-defined. Therefore, enhanced diagnostic efficiency and prognostic outcome are critical for confronting CD recurrence.</p><p><strong>Methods: </strong>We analysed ileal mucosa samples collected from neo-terminal ileum six months after surgery (M6=121 samples) from Crohn's disease dataset (GSE186582). The primary aim of this study is to identify the potential genes and critical pathways in post-operative recurrence of Crohn's disease. We combined the differential gene expression analysis with Recursive feature elimination (RFE), a machine learning approach to get five critical genes for the postoperative recurrence of Crohn's disease. The features (genes) selected by different methods were validated using five binary classifiers for recurrence and remission samples: Logistic Regression (LR), Decision tree classifier (DT), Support Vector Machine (SVM), Random Forest classifier (RF), and K-nearest neighbor (KNN) with 10-fold cross-validation. We also performed weighted gene co-expression network analysis (WGCNA) to select specific modules and feature genes associated with Crohn's disease postoperative recurrence, smoking, and biological sex. Combined with other biological interpretations, including Gene Ontology (GO) analysis, pathway enrichment, and protein-protein interaction (PPI) network analysis, our current study sheds light on the in-depth research of CD diagnosis and prognosis in postoperative recurrence.</p><p><strong>Results: </strong>PLOD2, ZNF165, BOK, CX3CR1, and ARMCX4, are the important genes identified from the machine learning approach. These genes are reported to be involved in the viral protein interaction with cytokine and cytokine receptors, lysine degradation, and apoptosis. They are also linked with various cellular and molecular functions such as Peptidyl-lysine hydroxylation, Central nervous system maturation, G protein-coupled chemoattractant receptor activity, BCL-2 homology (BH) domain binding, Gliogenesis and negative regulation of mitochondrial depolarization. WGCNA identified a gene co-expression module that was primarily involved in mitochondrial translational elongation, mitochondrial translational termination, mitochondrial translation, mitochondrial respiratory chain complex, mRNA splicing <i>via</i> spliceosome pathways, <i>etc</i>.; Both the analysis result emphasizes that the mitochondrial depolarization pathway is linked with CD recurrence leading to oxidative stress in promoting inflammation in CD patients.</p><p><strong>Conclusion: </strong>These key genes serve as the novel diagnostic biomarker for the postoperative recurrence of Crohn's disease. Thus, among other treatment options present until now, these biomarkers would provide success in both diagnosis and prognosis, aiming for a long-lasting remission to prevent further complications in CD.</p>","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10662376/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.2174/1389202924666230601122334","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Crohn's disease (CD) is a chronic idiopathic inflammatory bowel disease affecting the entire gastrointestinal tract from the mouth to the anus. These patients often experience a period of symptomatic relapse and remission. A 20 - 30% symptomatic recurrence rate is reported in the first year after surgery, with a 10% increase each subsequent year. Thus, surgery is done only to relieve symptoms and not for the complete cure of the disease. The determinants and the genetic factors of this disease recurrence are also not well-defined. Therefore, enhanced diagnostic efficiency and prognostic outcome are critical for confronting CD recurrence.

Methods: We analysed ileal mucosa samples collected from neo-terminal ileum six months after surgery (M6=121 samples) from Crohn's disease dataset (GSE186582). The primary aim of this study is to identify the potential genes and critical pathways in post-operative recurrence of Crohn's disease. We combined the differential gene expression analysis with Recursive feature elimination (RFE), a machine learning approach to get five critical genes for the postoperative recurrence of Crohn's disease. The features (genes) selected by different methods were validated using five binary classifiers for recurrence and remission samples: Logistic Regression (LR), Decision tree classifier (DT), Support Vector Machine (SVM), Random Forest classifier (RF), and K-nearest neighbor (KNN) with 10-fold cross-validation. We also performed weighted gene co-expression network analysis (WGCNA) to select specific modules and feature genes associated with Crohn's disease postoperative recurrence, smoking, and biological sex. Combined with other biological interpretations, including Gene Ontology (GO) analysis, pathway enrichment, and protein-protein interaction (PPI) network analysis, our current study sheds light on the in-depth research of CD diagnosis and prognosis in postoperative recurrence.

Results: PLOD2, ZNF165, BOK, CX3CR1, and ARMCX4, are the important genes identified from the machine learning approach. These genes are reported to be involved in the viral protein interaction with cytokine and cytokine receptors, lysine degradation, and apoptosis. They are also linked with various cellular and molecular functions such as Peptidyl-lysine hydroxylation, Central nervous system maturation, G protein-coupled chemoattractant receptor activity, BCL-2 homology (BH) domain binding, Gliogenesis and negative regulation of mitochondrial depolarization. WGCNA identified a gene co-expression module that was primarily involved in mitochondrial translational elongation, mitochondrial translational termination, mitochondrial translation, mitochondrial respiratory chain complex, mRNA splicing via spliceosome pathways, etc.; Both the analysis result emphasizes that the mitochondrial depolarization pathway is linked with CD recurrence leading to oxidative stress in promoting inflammation in CD patients.

Conclusion: These key genes serve as the novel diagnostic biomarker for the postoperative recurrence of Crohn's disease. Thus, among other treatment options present until now, these biomarkers would provide success in both diagnosis and prognosis, aiming for a long-lasting remission to prevent further complications in CD.

机器学习和WGCNA网络分析识别克罗恩病术后复发的潜在基因和关键途径
克罗恩病(CD)是一种慢性特发性炎症性肠病,影响从口腔到肛门的整个胃肠道。这些患者通常会经历一段时间的症状复发和缓解。据报道,术后第一年症状复发率为20-30%,随后每年增加10%。因此,手术只是为了缓解症状,而不是为了彻底治愈疾病。这种疾病复发的决定因素和遗传因素也没有明确定义。因此,提高诊断效率和预后对于对抗CD复发至关重要。我们分析了克罗恩病数据集(GSE186582)中手术后6个月从新末端回肠收集的回肠粘膜样本(M6=121个样本)。本研究的主要目的是确定克罗恩病术后复发的潜在基因和关键途径。我们将差异基因表达分析与递归特征消除(RFE)相结合,这是一种机器学习方法,可以获得克罗恩病术后复发的五个关键基因。使用五个用于复发和缓解样本的二元分类器对通过不同方法选择的特征(基因)进行验证:逻辑回归(LR)、决策树分类器(DT)、支持向量机(SVM)、随机森林分类器(RF)和K近邻(KNN),并进行10倍交叉验证。我们还进行了加权基因共表达网络分析(WGCNA),以选择与克罗恩病术后复发、吸烟和生物学性别相关的特定模块和特征基因。结合其他生物学解释,包括基因本体论(GO)分析、通路富集和蛋白质-蛋白质相互作用(PPI)网络分析,我们目前的研究为术后复发CD诊断和预后的深入研究提供了线索。PLOD2、ZNF165、BOK、CX3CR1和ARMCX4是通过机器学习方法鉴定的重要基因。据报道,这些基因参与病毒蛋白与细胞因子和细胞因子受体的相互作用、赖氨酸降解和细胞凋亡。它们还与各种细胞和分子功能有关,如肽基赖氨酸羟基化、中枢神经系统成熟、G蛋白偶联化学引诱剂受体活性、BCL-2同源性(BH)结构域结合、胶质形成和线粒体去极化的负调控。WGCNA鉴定出一个基因共表达模块,主要参与线粒体翻译延长、线粒体翻译终止、线粒体翻译、线粒体呼吸链复合体、通过剪接体途径剪接信使核糖核酸等。;这两个分析结果都强调线粒体去极化途径与CD复发有关,导致CD患者的氧化应激促进炎症。这些关键基因是克罗恩病术后复发的新的诊断生物标志物。因此,在迄今为止存在的其他治疗选择中,这些生物标志物将在诊断和预后方面取得成功,旨在获得长期缓解,以防止CD的进一步并发症。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Accounts of Chemical Research
Accounts of Chemical Research 化学-化学综合
CiteScore
31.40
自引率
1.10%
发文量
312
审稿时长
2 months
期刊介绍: Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance. Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信