Bioinformatics and Machine Learning-Based Identification of Critical Biomarkers and Immune Infiltration in Venous Thromboembolism.

IF 1.5 4区 化学 Q3 CHEMISTRY, ANALYTICAL
International Journal of Analytical Chemistry Pub Date : 2024-11-22 eCollection Date: 2024-01-01 DOI:10.1155/ianc/2202321
Yajing Li, Hongru Deng
{"title":"Bioinformatics and Machine Learning-Based Identification of Critical Biomarkers and Immune Infiltration in Venous Thromboembolism.","authors":"Yajing Li, Hongru Deng","doi":"10.1155/ianc/2202321","DOIUrl":null,"url":null,"abstract":"<p><p><b>Objective:</b> This study aims to use bioinformatics and machine learning algorithms to screen and analyze the key genes involved in venous thromboembolism (VTE) and explore the relationship between these biomarkers and immune cell infiltration. <b>Methods:</b> The gene expression profile with the identifier GSE19151 was downloaded from the GEO database. Differential expression analysis using the limma package was conducted to identify genes that were differentially expressed between VTE and normal samples. Biological activities of these genes were then investigated through GO analysis utilizing the R language package. KEGG and GSEA were also performed to identify key signaling pathways. Furthermore, machine learning techniques were employed to determine hub gene signatures related to VTE, and ROC curves were used to validate the findings. To compare the immune infiltration of healthy and VTE samples, single sample gene set enrichment analysis (ssGSEA) was applied. Lastly, the Spearman correlation coefficient was used to assess the relationship between the expression of hub genes and immune cell infiltration. <b>Results:</b> A total of 628 differentially expressed genes (DEGs) were discovered between the VTE samples and normal samples. GO analysis identified protein polyubiquitination, lysosomal lumen acidification, organellar ribosome, mitochondrial ribosome, ammonium transmembrane transporter activity, and immunoglobulin binding as the processes with the highest abundance of DEGs. KEGG pathway analysis revealed that DEGs were enriched in ribosome, COVID-19, viral infection, oxidative phosphorylation, Parkinson's disease, nonalcoholic fatty liver disease, apoptosis, and cancer. The most prominent KEGG pathways associated with VTE were ribosome, Parkinson's disease, oxidative phosphorylation, Alzheimer's disease, and Huntington's disease according to GSEA findings. DLST and LSP1 were identified as hub gene signatures in VTE by machine learning integrative analysis, and ROC curves confirmed their diagnostic value. Results from ssGSEA indicated a significant difference in the degree of immune cell infiltration between VTE and normal samples, with the expression of DLST and LSP1 positively correlated with the content of some immune cells. The R package, code, and analysis results used in this paper are available on https://github.com/doctorlaby/my-project. <b>Conclusion:</b> Our research is the first to utilize machine learning techniques in identifying DLST and LSP1 as significant biomarkers of VTE. With our findings, we have uncovered new insights into the underlying causes of VTE and potential treatments for affected patients.</p>","PeriodicalId":13888,"journal":{"name":"International Journal of Analytical Chemistry","volume":"2024 ","pages":"2202321"},"PeriodicalIF":1.5000,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11608302/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Analytical Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1155/ianc/2202321","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: This study aims to use bioinformatics and machine learning algorithms to screen and analyze the key genes involved in venous thromboembolism (VTE) and explore the relationship between these biomarkers and immune cell infiltration. Methods: The gene expression profile with the identifier GSE19151 was downloaded from the GEO database. Differential expression analysis using the limma package was conducted to identify genes that were differentially expressed between VTE and normal samples. Biological activities of these genes were then investigated through GO analysis utilizing the R language package. KEGG and GSEA were also performed to identify key signaling pathways. Furthermore, machine learning techniques were employed to determine hub gene signatures related to VTE, and ROC curves were used to validate the findings. To compare the immune infiltration of healthy and VTE samples, single sample gene set enrichment analysis (ssGSEA) was applied. Lastly, the Spearman correlation coefficient was used to assess the relationship between the expression of hub genes and immune cell infiltration. Results: A total of 628 differentially expressed genes (DEGs) were discovered between the VTE samples and normal samples. GO analysis identified protein polyubiquitination, lysosomal lumen acidification, organellar ribosome, mitochondrial ribosome, ammonium transmembrane transporter activity, and immunoglobulin binding as the processes with the highest abundance of DEGs. KEGG pathway analysis revealed that DEGs were enriched in ribosome, COVID-19, viral infection, oxidative phosphorylation, Parkinson's disease, nonalcoholic fatty liver disease, apoptosis, and cancer. The most prominent KEGG pathways associated with VTE were ribosome, Parkinson's disease, oxidative phosphorylation, Alzheimer's disease, and Huntington's disease according to GSEA findings. DLST and LSP1 were identified as hub gene signatures in VTE by machine learning integrative analysis, and ROC curves confirmed their diagnostic value. Results from ssGSEA indicated a significant difference in the degree of immune cell infiltration between VTE and normal samples, with the expression of DLST and LSP1 positively correlated with the content of some immune cells. The R package, code, and analysis results used in this paper are available on https://github.com/doctorlaby/my-project. Conclusion: Our research is the first to utilize machine learning techniques in identifying DLST and LSP1 as significant biomarkers of VTE. With our findings, we have uncovered new insights into the underlying causes of VTE and potential treatments for affected patients.

基于生物信息学和机器学习的关键生物标志物识别和静脉血栓栓塞的免疫浸润。
目的:本研究旨在利用生物信息学和机器学习算法筛选和分析静脉血栓栓塞(venous thromboembolism, VTE)的关键基因,探讨这些生物标志物与免疫细胞浸润的关系。方法:从GEO数据库下载标识符为GSE19151的基因表达谱。使用limma包进行差异表达分析,以鉴定VTE和正常样本之间差异表达的基因。然后利用R语言包通过GO分析来研究这些基因的生物活性。KEGG和GSEA也被用来确定关键的信号通路。此外,采用机器学习技术来确定与VTE相关的枢纽基因特征,并使用ROC曲线来验证研究结果。采用单样本基因集富集分析(ssGSEA)比较正常和静脉血栓栓塞样本的免疫浸润。最后,采用Spearman相关系数评价hub基因表达与免疫细胞浸润的关系。结果:VTE标本与正常标本共发现628个差异表达基因(DEGs)。氧化石墨烯分析发现,蛋白多泛素化、溶酶体管腔酸化、细胞器核糖体、线粒体核糖体、铵跨膜转运蛋白活性和免疫球蛋白结合是DEGs丰度最高的过程。KEGG通路分析显示,deg在核糖体、COVID-19、病毒感染、氧化磷酸化、帕金森病、非酒精性脂肪性肝病、细胞凋亡和癌症中富集。根据GSEA的发现,与VTE相关的最突出的KEGG通路是核糖体、帕金森病、氧化磷酸化、阿尔茨海默病和亨廷顿病。通过机器学习综合分析,确定DLST和LSP1为VTE的枢纽基因特征,ROC曲线证实了它们的诊断价值。ssGSEA结果显示VTE与正常标本免疫细胞浸润程度有显著差异,DLST和LSP1表达与部分免疫细胞含量呈正相关。本文中使用的R包、代码和分析结果可在https://github.com/doctorlaby/my-project上获得。结论:我们的研究首次利用机器学习技术识别DLST和LSP1作为VTE的重要生物标志物。根据我们的研究结果,我们对静脉血栓栓塞的潜在原因和受影响患者的潜在治疗方法有了新的认识。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.10
自引率
5.60%
发文量
117
期刊介绍: International Journal of Analytical Chemistry publishes original research articles that report new experimental results and methods, especially in relation to important analytes, difficult matrices, and topical samples. Investigations may be fundamental, or else related to specific applications; examples being biological, environmental and food testing, and analysis in chemical synthesis and materials processing. As well as original research, the International Journal of Analytical Chemistry also publishes focused review articles that examine the state of the art, identify emerging trends, and suggest future directions for developing fields.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信