A Novel Hypoxia-Related Gene Signature with Strong Predicting Ability in Non-Small-Cell Lung Cancer Identified by Comprehensive Profiling

IF 2.6 4区 生物学 Q3 BIOCHEMISTRY & MOLECULAR BIOLOGY
Huajun Yang, Zhongan Wang, L. Gong, Guichuan Huang, Daigang Chen, Xiao-peng Li, Fei Du, Jiang Lin, Xueyi Yang
{"title":"A Novel Hypoxia-Related Gene Signature with Strong Predicting Ability in Non-Small-Cell Lung Cancer Identified by Comprehensive Profiling","authors":"Huajun Yang, Zhongan Wang, L. Gong, Guichuan Huang, Daigang Chen, Xiao-peng Li, Fei Du, Jiang Lin, Xueyi Yang","doi":"10.1155/2022/8594658","DOIUrl":null,"url":null,"abstract":"Background Non-small-cell lung cancer (NSCLC) is the most common malignant tumor among males and females worldwide. Hypoxia is a typical feature of the tumor microenvironment, and it affects cancer development. Circular RNAs (circRNAs) have been reported to sponge miRNAs to regulate target gene expression and play an essential role in tumorigenesis and progression. This study is aimed at identifying whether circRNAs could be used as the diagnostic biomarkers for NSCLC. Methods The heterogeneity of samples in this study was assessed by principal component analysis (PCA). Furthermore, the Gene Expression Omnibus (GEO) database was normalized by the affy R package. We further screened the differentially expressed genes (DEGs) and differentially expressed circular RNAs (DEcircRNAs) using the DEseq2 R package. Moreover, we analyzed the Gene Ontology (GO) annotation and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment of DEGs using the cluster profile R package. Besides, the Gene Set Enrichment Analysis (GSEA) was used to identify the biological function of DEGs. The interaction between DEGs and the competing endogenous RNAs (ceRNA) network was detected using STRING and visualized using Cytoscape. Starbase predicted the miRNAs of target hub genes, and miRanda predicted the target miRNAs of circRNAs. The RNA-seq profiler and clinical information were downloaded from The Cancer Genome Atlas (TCGA) database. Then, the variables were assessed by the univariate and multivariate Cox proportional hazard regression models. Significant variables in the univariate Cox proportional hazard regression model were included in the multivariate Cox proportional hazard regression model to analyze the association between the variables of clinical features. Furthermore, the overall survival of variables was determined by the Kaplan-Meier survival curve, and the time-dependent receiver operating characteristic (ROC) curve analysis was used to calculate and validate the risk score in NSCLC patients. Moreover, predictive nomograms were constructed and used to predict the prognostic features between the high-risk and low-risk score groups. Results We screened a total of 2039 DEGs, including 1293 upregulated DEGs and 746 downregulated DEGs in hypoxia-treated A549 cells. A549 cells treated with hypoxia had a total of 70 DEcircRNAs, including 21 upregulated and 49 downregulated DEcircRNAs, compared to A549 cells treated with normoxia. The upregulated genes were significantly enriched in 284 GO terms and 42 KEGG pathways, while the downregulated genes were significantly enriched in 184 GO terms and 25 KEGG pathways. Moreover, the function analysis by GSEA showed enrichment in the enzyme-linked receptor protein signaling pathway, hypoxia-inducible factor- (HIF-) 1 signaling pathway, and G protein-coupled receptor (GPCR) downstream signaling. Furthermore, six hub modules and 10 hub genes, CDC45, EXO1, PLK1, RFC4, CCNB1, CDC6, MCM10, DLGAP5, AURKA, and POLE2, were identified. The ceRNA network was constructed, and it consisted of 4 circRNAs, 14 miRNAs, and 38 mRNAs. The ROC curve was constructed and calculated. The area under the curve (AUC) value was 0.62, and the optimal threshold was 0.28. Based on the optimal threshold, the patients were divided into the high-risk score and low-risk score groups. The survival rate in the high-risk score group was lower than that in the low-risk score group. The expression of SERPINE1, STC2, and LPCAT1; clinical stage; and age of the patient were significantly correlated with the high-risk score. Moreover, nomograms were established based on the risk factors in multivariate analysis, and the median survival time, 3-year survival probability, and 5-year survival were possibly predicted according to nomograms. Conclusion The ceRNA network associated with NSCLC was identified, and the hub genes, circRNAs, might act as the potential biomarkers for NSCLC.","PeriodicalId":13988,"journal":{"name":"International Journal of Genomics","volume":" ","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2022-05-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Genomics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1155/2022/8594658","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 1

Abstract

Background Non-small-cell lung cancer (NSCLC) is the most common malignant tumor among males and females worldwide. Hypoxia is a typical feature of the tumor microenvironment, and it affects cancer development. Circular RNAs (circRNAs) have been reported to sponge miRNAs to regulate target gene expression and play an essential role in tumorigenesis and progression. This study is aimed at identifying whether circRNAs could be used as the diagnostic biomarkers for NSCLC. Methods The heterogeneity of samples in this study was assessed by principal component analysis (PCA). Furthermore, the Gene Expression Omnibus (GEO) database was normalized by the affy R package. We further screened the differentially expressed genes (DEGs) and differentially expressed circular RNAs (DEcircRNAs) using the DEseq2 R package. Moreover, we analyzed the Gene Ontology (GO) annotation and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment of DEGs using the cluster profile R package. Besides, the Gene Set Enrichment Analysis (GSEA) was used to identify the biological function of DEGs. The interaction between DEGs and the competing endogenous RNAs (ceRNA) network was detected using STRING and visualized using Cytoscape. Starbase predicted the miRNAs of target hub genes, and miRanda predicted the target miRNAs of circRNAs. The RNA-seq profiler and clinical information were downloaded from The Cancer Genome Atlas (TCGA) database. Then, the variables were assessed by the univariate and multivariate Cox proportional hazard regression models. Significant variables in the univariate Cox proportional hazard regression model were included in the multivariate Cox proportional hazard regression model to analyze the association between the variables of clinical features. Furthermore, the overall survival of variables was determined by the Kaplan-Meier survival curve, and the time-dependent receiver operating characteristic (ROC) curve analysis was used to calculate and validate the risk score in NSCLC patients. Moreover, predictive nomograms were constructed and used to predict the prognostic features between the high-risk and low-risk score groups. Results We screened a total of 2039 DEGs, including 1293 upregulated DEGs and 746 downregulated DEGs in hypoxia-treated A549 cells. A549 cells treated with hypoxia had a total of 70 DEcircRNAs, including 21 upregulated and 49 downregulated DEcircRNAs, compared to A549 cells treated with normoxia. The upregulated genes were significantly enriched in 284 GO terms and 42 KEGG pathways, while the downregulated genes were significantly enriched in 184 GO terms and 25 KEGG pathways. Moreover, the function analysis by GSEA showed enrichment in the enzyme-linked receptor protein signaling pathway, hypoxia-inducible factor- (HIF-) 1 signaling pathway, and G protein-coupled receptor (GPCR) downstream signaling. Furthermore, six hub modules and 10 hub genes, CDC45, EXO1, PLK1, RFC4, CCNB1, CDC6, MCM10, DLGAP5, AURKA, and POLE2, were identified. The ceRNA network was constructed, and it consisted of 4 circRNAs, 14 miRNAs, and 38 mRNAs. The ROC curve was constructed and calculated. The area under the curve (AUC) value was 0.62, and the optimal threshold was 0.28. Based on the optimal threshold, the patients were divided into the high-risk score and low-risk score groups. The survival rate in the high-risk score group was lower than that in the low-risk score group. The expression of SERPINE1, STC2, and LPCAT1; clinical stage; and age of the patient were significantly correlated with the high-risk score. Moreover, nomograms were established based on the risk factors in multivariate analysis, and the median survival time, 3-year survival probability, and 5-year survival were possibly predicted according to nomograms. Conclusion The ceRNA network associated with NSCLC was identified, and the hub genes, circRNAs, might act as the potential biomarkers for NSCLC.
综合图谱鉴定非小细胞肺癌癌症中一种新的具有强预测能力的缺氧相关基因特征
背景非小细胞性癌症(NSCLC)是世界范围内男性和女性最常见的恶性肿瘤。缺氧是肿瘤微环境的一个典型特征,它影响癌症的发展。环状RNA(circRNAs)已被报道为海绵状miRNA,以调节靶基因表达,并在肿瘤发生和发展中发挥重要作用。本研究旨在确定circRNA是否可以用作NSCLC的诊断生物标志物。方法采用主成分分析法对本研究样本的异质性进行评价。此外,基因表达综合数据库(GEO)通过affy-R包进行了规范化。我们使用DEseq2R包进一步筛选了差异表达基因(DEGs)和差异表达环状RNA(DEcircRNAs)。此外,我们使用聚类图谱R包分析了基因本体论(GO)注释和京都基因和基因组百科全书(KEGG)对DEG的富集。此外,利用基因集富集分析(GSEA)对DEG的生物学功能进行了鉴定。使用STRING检测DEG和竞争内源性RNA(ceRNA)网络之间的相互作用,并使用Cytoscape进行可视化。Starbase预测了靶枢纽基因的miRNA,miRanda预测了circRNA的靶miRNA。RNA-seq剖析器和临床信息从癌症基因组图谱(TCGA)数据库下载。然后,通过单变量和多变量Cox比例风险回归模型对变量进行评估。将单变量Cox比例风险回归模型中的显著变量纳入多变量Cox比率风险回归模型,以分析临床特征变量之间的相关性。此外,通过Kaplan-Meier生存曲线确定变量的总生存率,并使用时间依赖性受试者操作特征(ROC)曲线分析来计算和验证NSCLC患者的风险评分。此外,构建了预测列线图,并用于预测高危和低危评分组之间的预后特征。结果我们在缺氧处理的A549细胞中共筛选出2039个DEG,包括1293个上调的DEG和746个下调的DEG。与用常氧处理的A549细胞相比,用缺氧处理的A549%细胞共有70个DEcircRNA,包括21个上调的和49个下调的DEcircRNAs。上调的基因在284个GO术语和42个KEGG途径中显著富集,而下调的基因在184个GO术语中和25个KEGG途径中显著丰富。此外,GSEA的功能分析显示,酶联受体蛋白信号通路、缺氧诱导因子-(HIF-)1信号通路和G蛋白偶联受体(GPCR)下游信号通路富集。此外,还鉴定了6个枢纽模块和10个枢纽基因,CDC45、EXO1、PLK1、RFC4、CCNB1、CDC6、MCM10、DLGAP5、AURKA和POLE2。构建了ceRNA网络,它由4个circRNA、14个miRNA和38个mRNA组成。构建并计算ROC曲线。曲线下面积(AUC)值为0.62,最佳阈值为0.28。根据最佳阈值,将患者分为高风险评分组和低风险评分组。高风险评分组的生存率低于低风险评分组。SERPINE1、STC2和LPCAT1的表达;临床分期;患者的年龄和高危评分显著相关。此外,根据多变量分析中的风险因素建立列线图,并根据列线图预测中位生存时间、3年生存概率和5年生存率。结论已鉴定出与NSCLC相关的ceRNA网络,其枢纽基因circRNAs可能是NSCLC的潜在生物标志物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Genomics
International Journal of Genomics BIOCHEMISTRY & MOLECULAR BIOLOGY-BIOTECHNOLOGY & APPLIED MICROBIOLOGY
CiteScore
5.40
自引率
0.00%
发文量
33
审稿时长
17 weeks
期刊介绍: International Journal of Genomics is a peer-reviewed, Open Access journal that publishes research articles as well as review articles in all areas of genome-scale analysis. Topics covered by the journal include, but are not limited to: bioinformatics, clinical genomics, disease genomics, epigenomics, evolutionary genomics, functional genomics, genome engineering, and synthetic genomics.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信