{"title":"机器学习和WGCNA网络分析识别克罗恩病术后复发的潜在基因和关键途径","authors":"Aruna Rajalingam, Kanagaraj Sekar, Anjali Ganjiwale","doi":"10.2174/1389202924666230601122334","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Crohn's disease (CD) is a chronic idiopathic inflammatory bowel disease affecting the entire gastrointestinal tract from the mouth to the anus. These patients often experience a period of symptomatic relapse and remission. A 20 - 30% symptomatic recurrence rate is reported in the first year after surgery, with a 10% increase each subsequent year. Thus, surgery is done only to relieve symptoms and not for the complete cure of the disease. The determinants and the genetic factors of this disease recurrence are also not well-defined. Therefore, enhanced diagnostic efficiency and prognostic outcome are critical for confronting CD recurrence.</p><p><strong>Methods: </strong>We analysed ileal mucosa samples collected from neo-terminal ileum six months after surgery (M6=121 samples) from Crohn's disease dataset (GSE186582). The primary aim of this study is to identify the potential genes and critical pathways in post-operative recurrence of Crohn's disease. We combined the differential gene expression analysis with Recursive feature elimination (RFE), a machine learning approach to get five critical genes for the postoperative recurrence of Crohn's disease. The features (genes) selected by different methods were validated using five binary classifiers for recurrence and remission samples: Logistic Regression (LR), Decision tree classifier (DT), Support Vector Machine (SVM), Random Forest classifier (RF), and K-nearest neighbor (KNN) with 10-fold cross-validation. We also performed weighted gene co-expression network analysis (WGCNA) to select specific modules and feature genes associated with Crohn's disease postoperative recurrence, smoking, and biological sex. Combined with other biological interpretations, including Gene Ontology (GO) analysis, pathway enrichment, and protein-protein interaction (PPI) network analysis, our current study sheds light on the in-depth research of CD diagnosis and prognosis in postoperative recurrence.</p><p><strong>Results: </strong>PLOD2, ZNF165, BOK, CX3CR1, and ARMCX4, are the important genes identified from the machine learning approach. These genes are reported to be involved in the viral protein interaction with cytokine and cytokine receptors, lysine degradation, and apoptosis. They are also linked with various cellular and molecular functions such as Peptidyl-lysine hydroxylation, Central nervous system maturation, G protein-coupled chemoattractant receptor activity, BCL-2 homology (BH) domain binding, Gliogenesis and negative regulation of mitochondrial depolarization. WGCNA identified a gene co-expression module that was primarily involved in mitochondrial translational elongation, mitochondrial translational termination, mitochondrial translation, mitochondrial respiratory chain complex, mRNA splicing <i>via</i> spliceosome pathways, <i>etc</i>.; Both the analysis result emphasizes that the mitochondrial depolarization pathway is linked with CD recurrence leading to oxidative stress in promoting inflammation in CD patients.</p><p><strong>Conclusion: </strong>These key genes serve as the novel diagnostic biomarker for the postoperative recurrence of Crohn's disease. Thus, among other treatment options present until now, these biomarkers would provide success in both diagnosis and prognosis, aiming for a long-lasting remission to prevent further complications in CD.</p>","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2023-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10662376/pdf/","citationCount":"0","resultStr":"{\"title\":\"Identification of Potential Genes and Critical Pathways in Postoperative Recurrence of Crohn's Disease by Machine Learning And WGCNA Network Analysis.\",\"authors\":\"Aruna Rajalingam, Kanagaraj Sekar, Anjali Ganjiwale\",\"doi\":\"10.2174/1389202924666230601122334\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Crohn's disease (CD) is a chronic idiopathic inflammatory bowel disease affecting the entire gastrointestinal tract from the mouth to the anus. These patients often experience a period of symptomatic relapse and remission. A 20 - 30% symptomatic recurrence rate is reported in the first year after surgery, with a 10% increase each subsequent year. Thus, surgery is done only to relieve symptoms and not for the complete cure of the disease. The determinants and the genetic factors of this disease recurrence are also not well-defined. Therefore, enhanced diagnostic efficiency and prognostic outcome are critical for confronting CD recurrence.</p><p><strong>Methods: </strong>We analysed ileal mucosa samples collected from neo-terminal ileum six months after surgery (M6=121 samples) from Crohn's disease dataset (GSE186582). The primary aim of this study is to identify the potential genes and critical pathways in post-operative recurrence of Crohn's disease. We combined the differential gene expression analysis with Recursive feature elimination (RFE), a machine learning approach to get five critical genes for the postoperative recurrence of Crohn's disease. The features (genes) selected by different methods were validated using five binary classifiers for recurrence and remission samples: Logistic Regression (LR), Decision tree classifier (DT), Support Vector Machine (SVM), Random Forest classifier (RF), and K-nearest neighbor (KNN) with 10-fold cross-validation. We also performed weighted gene co-expression network analysis (WGCNA) to select specific modules and feature genes associated with Crohn's disease postoperative recurrence, smoking, and biological sex. Combined with other biological interpretations, including Gene Ontology (GO) analysis, pathway enrichment, and protein-protein interaction (PPI) network analysis, our current study sheds light on the in-depth research of CD diagnosis and prognosis in postoperative recurrence.</p><p><strong>Results: </strong>PLOD2, ZNF165, BOK, CX3CR1, and ARMCX4, are the important genes identified from the machine learning approach. These genes are reported to be involved in the viral protein interaction with cytokine and cytokine receptors, lysine degradation, and apoptosis. They are also linked with various cellular and molecular functions such as Peptidyl-lysine hydroxylation, Central nervous system maturation, G protein-coupled chemoattractant receptor activity, BCL-2 homology (BH) domain binding, Gliogenesis and negative regulation of mitochondrial depolarization. WGCNA identified a gene co-expression module that was primarily involved in mitochondrial translational elongation, mitochondrial translational termination, mitochondrial translation, mitochondrial respiratory chain complex, mRNA splicing <i>via</i> spliceosome pathways, <i>etc</i>.; Both the analysis result emphasizes that the mitochondrial depolarization pathway is linked with CD recurrence leading to oxidative stress in promoting inflammation in CD patients.</p><p><strong>Conclusion: </strong>These key genes serve as the novel diagnostic biomarker for the postoperative recurrence of Crohn's disease. Thus, among other treatment options present until now, these biomarkers would provide success in both diagnosis and prognosis, aiming for a long-lasting remission to prevent further complications in CD.</p>\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":16.4000,\"publicationDate\":\"2023-10-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10662376/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.2174/1389202924666230601122334\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.2174/1389202924666230601122334","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
Identification of Potential Genes and Critical Pathways in Postoperative Recurrence of Crohn's Disease by Machine Learning And WGCNA Network Analysis.
Background: Crohn's disease (CD) is a chronic idiopathic inflammatory bowel disease affecting the entire gastrointestinal tract from the mouth to the anus. These patients often experience a period of symptomatic relapse and remission. A 20 - 30% symptomatic recurrence rate is reported in the first year after surgery, with a 10% increase each subsequent year. Thus, surgery is done only to relieve symptoms and not for the complete cure of the disease. The determinants and the genetic factors of this disease recurrence are also not well-defined. Therefore, enhanced diagnostic efficiency and prognostic outcome are critical for confronting CD recurrence.
Methods: We analysed ileal mucosa samples collected from neo-terminal ileum six months after surgery (M6=121 samples) from Crohn's disease dataset (GSE186582). The primary aim of this study is to identify the potential genes and critical pathways in post-operative recurrence of Crohn's disease. We combined the differential gene expression analysis with Recursive feature elimination (RFE), a machine learning approach to get five critical genes for the postoperative recurrence of Crohn's disease. The features (genes) selected by different methods were validated using five binary classifiers for recurrence and remission samples: Logistic Regression (LR), Decision tree classifier (DT), Support Vector Machine (SVM), Random Forest classifier (RF), and K-nearest neighbor (KNN) with 10-fold cross-validation. We also performed weighted gene co-expression network analysis (WGCNA) to select specific modules and feature genes associated with Crohn's disease postoperative recurrence, smoking, and biological sex. Combined with other biological interpretations, including Gene Ontology (GO) analysis, pathway enrichment, and protein-protein interaction (PPI) network analysis, our current study sheds light on the in-depth research of CD diagnosis and prognosis in postoperative recurrence.
Results: PLOD2, ZNF165, BOK, CX3CR1, and ARMCX4, are the important genes identified from the machine learning approach. These genes are reported to be involved in the viral protein interaction with cytokine and cytokine receptors, lysine degradation, and apoptosis. They are also linked with various cellular and molecular functions such as Peptidyl-lysine hydroxylation, Central nervous system maturation, G protein-coupled chemoattractant receptor activity, BCL-2 homology (BH) domain binding, Gliogenesis and negative regulation of mitochondrial depolarization. WGCNA identified a gene co-expression module that was primarily involved in mitochondrial translational elongation, mitochondrial translational termination, mitochondrial translation, mitochondrial respiratory chain complex, mRNA splicing via spliceosome pathways, etc.; Both the analysis result emphasizes that the mitochondrial depolarization pathway is linked with CD recurrence leading to oxidative stress in promoting inflammation in CD patients.
Conclusion: These key genes serve as the novel diagnostic biomarker for the postoperative recurrence of Crohn's disease. Thus, among other treatment options present until now, these biomarkers would provide success in both diagnosis and prognosis, aiming for a long-lasting remission to prevent further complications in CD.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.