Yang Bai, Zequn Niu, Zhenyu Yang, Yi Sun, Weidong Yan, Anshi Wu, Changwei Wei
{"title":"综合生物信息学和机器学习算法揭示了急性心肌梗死中的未折叠蛋白反应途径和免疫浸润。","authors":"Yang Bai, Zequn Niu, Zhenyu Yang, Yi Sun, Weidong Yan, Anshi Wu, Changwei Wei","doi":"10.21037/jtd-24-622","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The unfolded protein response (UPR) is a critical biological process related to a variety of physiological functions and cardiac disease. However, the role of UPR-related genes in acute myocardial infarction (AMI) has not been well characterized. Therefore, this study aims to elucidate the mechanism and role of the UPR in the context of AMI.</p><p><strong>Methods: </strong>Gene expression profiles related to AMI and UPR pathway were downloaded from the Gene Expression Omnibus database and PathCards database, respectively. Differentially expressed genes (DEGs) were identified and then functionally annotated. The random forest (RF) and least absolute shrinkage and selection operator (LASSO) regression analysis were conducted to identify potential diagnostic UPR-AMI biomarkers. Furthermore, the results were validated by using external data sets, and discriminability was measured by the area under the curve (AUC). A nomogram based on the feature genes was developed to predict the AMI-risk rate. Then we utilized two algorithms, CIBERSORT and MCPcounter, to investigate the relationship between the key genes and immune microenvironment. Additionally, we performed uniform clustering of AMI samples based on the expression of UPR pathway-related genes. The weighted gene co-expression network analysis was conducted to identify the key modules in various clusters, enrichment analysis was performed for the genes existing in different modules.</p><p><strong>Results: </strong>A total of 14 DEGs related to the UPR pathway were identified. Among the 14 DEGs, <i>CEBPB</i>, <i>ATF3</i>, <i>EIF2S3</i>, and <i>TSPYL2</i> were subsequently identified as biomarkers by the LASSO and RF algorithms. A diagnostic model was constructed with these four genes, and the AUC was 0.939. The calibration curves, receiver operating characteristic (ROC) curves, and the decision curve analysis of the nomogram exhibited good performance. Furthermore, immune cell infiltration analysis revealed that four feature genes were linked with the infiltration of immune cells such as neutrophils. The cluster analysis of the AMI samples identified two distinct clusters, each with differential expression of genes related to the UPR pathway, immune cell infiltration, and inflammatory cytokine secretion. Weighted gene coexpression network analysis and enrichment analysis showed that both clusters were associated with the UPR.</p><p><strong>Conclusions: </strong>Our study highlights the importance of the UPR pathway in the pathogenesis of myocardial infarction, and identifies four genes <i>CEBPB</i>, <i>ATF3</i>, <i>EIF2S3</i>, and <i>TSPYL2</i> as diagnostic biomarkers for AMI, providing new ideas for the clinical diagnosis and treatment of AMI.</p>","PeriodicalId":17542,"journal":{"name":"Journal of thoracic disease","volume":"16 10","pages":"6496-6515"},"PeriodicalIF":2.1000,"publicationDate":"2024-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11565340/pdf/","citationCount":"0","resultStr":"{\"title\":\"Integrated bioinformatics and machine learning algorithms reveal the unfolded protein response pathways and immune infiltration in acute myocardial infarction.\",\"authors\":\"Yang Bai, Zequn Niu, Zhenyu Yang, Yi Sun, Weidong Yan, Anshi Wu, Changwei Wei\",\"doi\":\"10.21037/jtd-24-622\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The unfolded protein response (UPR) is a critical biological process related to a variety of physiological functions and cardiac disease. However, the role of UPR-related genes in acute myocardial infarction (AMI) has not been well characterized. Therefore, this study aims to elucidate the mechanism and role of the UPR in the context of AMI.</p><p><strong>Methods: </strong>Gene expression profiles related to AMI and UPR pathway were downloaded from the Gene Expression Omnibus database and PathCards database, respectively. Differentially expressed genes (DEGs) were identified and then functionally annotated. The random forest (RF) and least absolute shrinkage and selection operator (LASSO) regression analysis were conducted to identify potential diagnostic UPR-AMI biomarkers. Furthermore, the results were validated by using external data sets, and discriminability was measured by the area under the curve (AUC). A nomogram based on the feature genes was developed to predict the AMI-risk rate. Then we utilized two algorithms, CIBERSORT and MCPcounter, to investigate the relationship between the key genes and immune microenvironment. Additionally, we performed uniform clustering of AMI samples based on the expression of UPR pathway-related genes. The weighted gene co-expression network analysis was conducted to identify the key modules in various clusters, enrichment analysis was performed for the genes existing in different modules.</p><p><strong>Results: </strong>A total of 14 DEGs related to the UPR pathway were identified. Among the 14 DEGs, <i>CEBPB</i>, <i>ATF3</i>, <i>EIF2S3</i>, and <i>TSPYL2</i> were subsequently identified as biomarkers by the LASSO and RF algorithms. A diagnostic model was constructed with these four genes, and the AUC was 0.939. The calibration curves, receiver operating characteristic (ROC) curves, and the decision curve analysis of the nomogram exhibited good performance. Furthermore, immune cell infiltration analysis revealed that four feature genes were linked with the infiltration of immune cells such as neutrophils. The cluster analysis of the AMI samples identified two distinct clusters, each with differential expression of genes related to the UPR pathway, immune cell infiltration, and inflammatory cytokine secretion. Weighted gene coexpression network analysis and enrichment analysis showed that both clusters were associated with the UPR.</p><p><strong>Conclusions: </strong>Our study highlights the importance of the UPR pathway in the pathogenesis of myocardial infarction, and identifies four genes <i>CEBPB</i>, <i>ATF3</i>, <i>EIF2S3</i>, and <i>TSPYL2</i> as diagnostic biomarkers for AMI, providing new ideas for the clinical diagnosis and treatment of AMI.</p>\",\"PeriodicalId\":17542,\"journal\":{\"name\":\"Journal of thoracic disease\",\"volume\":\"16 10\",\"pages\":\"6496-6515\"},\"PeriodicalIF\":2.1000,\"publicationDate\":\"2024-10-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11565340/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of thoracic disease\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.21037/jtd-24-622\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/10/30 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q3\",\"JCRName\":\"RESPIRATORY SYSTEM\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of thoracic disease","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/jtd-24-622","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/30 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"RESPIRATORY SYSTEM","Score":null,"Total":0}
引用次数: 0
摘要
背景:未折叠蛋白反应(UPR)是一个与多种生理功能和心脏疾病相关的关键生物过程。然而,UPR 相关基因在急性心肌梗死(AMI)中的作用尚未得到很好的描述。因此,本研究旨在阐明 UPR 在 AMI 中的机制和作用:方法:分别从基因表达总库数据库和 PathCards 数据库下载与 AMI 和 UPR 通路相关的基因表达谱。确定差异表达基因(DEG),然后进行功能注释。通过随机森林(RF)和最小绝对收缩与选择算子(LASSO)回归分析,确定了潜在的UPR-AMI诊断生物标志物。此外,还利用外部数据集对结果进行了验证,并通过曲线下面积(AUC)测量了可鉴别性。根据特征基因绘制的提名图用于预测 AMI 风险率。然后,我们利用 CIBERSORT 和 MCPcounter 两种算法来研究关键基因与免疫微环境之间的关系。此外,我们还根据 UPR 通路相关基因的表达对 AMI 样本进行了统一聚类。通过加权基因共表达网络分析确定了不同聚类中的关键模块,并对不同模块中存在的基因进行了富集分析:结果:共鉴定出 14 个与 UPR 通路相关的 DEGs。在这 14 个 DEGs 中,CEBPB、ATF3、EIF2S3 和 TSPYL2 随后被 LASSO 和 RF 算法鉴定为生物标记物。利用这四个基因构建的诊断模型的AUC为0.939。标定曲线、接收者操作特征曲线(ROC)和提名图的决策曲线分析均表现出良好的性能。此外,免疫细胞浸润分析表明,四个特征基因与中性粒细胞等免疫细胞的浸润有关。对 AMI 样本的聚类分析发现了两个不同的聚类,每个聚类中与 UPR 通路、免疫细胞浸润和炎性细胞因子分泌相关的基因都有不同的表达。加权基因共表达网络分析和富集分析表明,这两个簇都与 UPR 相关:我们的研究强调了 UPR 通路在心肌梗死发病机制中的重要性,并发现了 CEBPB、ATF3、EIF2S3 和 TSPYL2 四个基因可作为 AMI 的诊断生物标志物,为 AMI 的临床诊断和治疗提供了新思路。
Integrated bioinformatics and machine learning algorithms reveal the unfolded protein response pathways and immune infiltration in acute myocardial infarction.
Background: The unfolded protein response (UPR) is a critical biological process related to a variety of physiological functions and cardiac disease. However, the role of UPR-related genes in acute myocardial infarction (AMI) has not been well characterized. Therefore, this study aims to elucidate the mechanism and role of the UPR in the context of AMI.
Methods: Gene expression profiles related to AMI and UPR pathway were downloaded from the Gene Expression Omnibus database and PathCards database, respectively. Differentially expressed genes (DEGs) were identified and then functionally annotated. The random forest (RF) and least absolute shrinkage and selection operator (LASSO) regression analysis were conducted to identify potential diagnostic UPR-AMI biomarkers. Furthermore, the results were validated by using external data sets, and discriminability was measured by the area under the curve (AUC). A nomogram based on the feature genes was developed to predict the AMI-risk rate. Then we utilized two algorithms, CIBERSORT and MCPcounter, to investigate the relationship between the key genes and immune microenvironment. Additionally, we performed uniform clustering of AMI samples based on the expression of UPR pathway-related genes. The weighted gene co-expression network analysis was conducted to identify the key modules in various clusters, enrichment analysis was performed for the genes existing in different modules.
Results: A total of 14 DEGs related to the UPR pathway were identified. Among the 14 DEGs, CEBPB, ATF3, EIF2S3, and TSPYL2 were subsequently identified as biomarkers by the LASSO and RF algorithms. A diagnostic model was constructed with these four genes, and the AUC was 0.939. The calibration curves, receiver operating characteristic (ROC) curves, and the decision curve analysis of the nomogram exhibited good performance. Furthermore, immune cell infiltration analysis revealed that four feature genes were linked with the infiltration of immune cells such as neutrophils. The cluster analysis of the AMI samples identified two distinct clusters, each with differential expression of genes related to the UPR pathway, immune cell infiltration, and inflammatory cytokine secretion. Weighted gene coexpression network analysis and enrichment analysis showed that both clusters were associated with the UPR.
Conclusions: Our study highlights the importance of the UPR pathway in the pathogenesis of myocardial infarction, and identifies four genes CEBPB, ATF3, EIF2S3, and TSPYL2 as diagnostic biomarkers for AMI, providing new ideas for the clinical diagnosis and treatment of AMI.
期刊介绍:
The Journal of Thoracic Disease (JTD, J Thorac Dis, pISSN: 2072-1439; eISSN: 2077-6624) was founded in Dec 2009, and indexed in PubMed in Dec 2011 and Science Citation Index SCI in Feb 2013. It is published quarterly (Dec 2009- Dec 2011), bimonthly (Jan 2012 - Dec 2013), monthly (Jan. 2014-) and openly distributed worldwide. JTD received its impact factor of 2.365 for the year 2016. JTD publishes manuscripts that describe new findings and provide current, practical information on the diagnosis and treatment of conditions related to thoracic disease. All the submission and reviewing are conducted electronically so that rapid review is assured.