A Machine Learning-Based Hypoxia-Related Gene Signatures to Facilitate Prediction of Cetuximab Response in Patients with Colorectal Cancer.

IF 3.2 3区 医学 Q1 MEDICINE, GENERAL & INTERNAL
International Journal of Medical Sciences Pub Date : 2025-08-11 eCollection Date: 2025-01-01 DOI:10.7150/ijms.114833
Cuizhen Zhang, Wanjie Niu, Jiangtao Zhang, Yingyi Zheng, Zhiru Chen, Fali Zhang, Xiaoyan Qiu
{"title":"A Machine Learning-Based Hypoxia-Related Gene Signatures to Facilitate Prediction of Cetuximab Response in Patients with Colorectal Cancer.","authors":"Cuizhen Zhang, Wanjie Niu, Jiangtao Zhang, Yingyi Zheng, Zhiru Chen, Fali Zhang, Xiaoyan Qiu","doi":"10.7150/ijms.114833","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background</b> There is significant individual variation in the efficacy of cetuximab for the treatment of colorectal cancer (CRC). However, effective models to predict treatment outcomes are still lacking in clinical practice. <b>Methods</b> Datasets (GSE106582 and GSE83889) were used to identify differentially expressed genes (DEGs) in CRC by the 'Limma' package in R software. Hypoxia-related genes were retrieved from the Molecular Signatures Database and cross-referenced with CRC DEGs. Protein expression levels were verified using immunohistochemistry (IHC) data from the Human Protein Atlas (HPA), and prognostic significance was assessed through the Kaplan-Meier plotter platform. Additionally, pathway and immune infiltration analyses were performed using the GSCA platform. We also successfully constructed a prediction model for cetuximab treatment response using the K-nearest neighbors (KNN) algorithm in GSE108277 dataset, in which the feature selection was performed through the permutation importance method. <b>Results</b> Analysis of GSE106582 and GSE83889 identified 417 overlapping DEGs by comparing cancer tissues with normal controls, including 16 hypoxia-related genes. 6 genes (<i>BGN</i>, <i>DDIT4</i>, <i>MIF</i>, <i>SLC2A1</i>, <i>STC2</i>, and <i>TGFBI</i>) were upregulated, and 10 genes (<i>CA12</i>, <i>CITED2</i>, <i>MT1E</i>, <i>MT2A</i>, <i>NEDD4L</i>, <i>PCK1</i>, <i>PLAC8</i>, <i>PPARGC1A</i>, <i>SELENBP1</i>, and <i>SRPX</i>) were downregulated in CRC. Survival analysis revealed that the 16 hypoxia-related DEGs were linked to the survival outcomes of CRC patients. Pathway analysis indicated that these genes were almost involved in EMT, cell cycle, and RTK pathways. Furthermore, these genes play a role in the infiltration of immune cells and may regulate the immune microenvironment. A prediction model for cetuximab response was developed, based on 10 key genes (<i>CA12</i>, <i>DDIT4</i>, <i>MIF</i>, <i>MT2A</i>, <i>NEDD4L</i>, <i>PLAC8</i>, <i>SELENBP1</i>, <i>SLC2A1</i>, <i>SRPX</i>, and <i>TGFBI</i>) and dataset from GSE108277. The model demonstrated robust performance with an accuracy of 0.9500, precision of 0.8378, recall of 1.0000, F1-score of 0.9118, and a receiver operating characteristic-area under the curve (ROC-AUC) of 0.9663. <b>Conclusion</b> Our study identifies 10 hypoxia-related DEGs as key players in CRC progression and cetuximab response. And we successfully developed a predictive model to forecast the response of CRC patients to cetuximab treatment. This study will provide valuable biomarkers for CRC prognosis and help guide more effective therapeutic strategies.</p>","PeriodicalId":14031,"journal":{"name":"International Journal of Medical Sciences","volume":"22 14","pages":"3749-3762"},"PeriodicalIF":3.2000,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12434828/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Sciences","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.7150/ijms.114833","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background There is significant individual variation in the efficacy of cetuximab for the treatment of colorectal cancer (CRC). However, effective models to predict treatment outcomes are still lacking in clinical practice. Methods Datasets (GSE106582 and GSE83889) were used to identify differentially expressed genes (DEGs) in CRC by the 'Limma' package in R software. Hypoxia-related genes were retrieved from the Molecular Signatures Database and cross-referenced with CRC DEGs. Protein expression levels were verified using immunohistochemistry (IHC) data from the Human Protein Atlas (HPA), and prognostic significance was assessed through the Kaplan-Meier plotter platform. Additionally, pathway and immune infiltration analyses were performed using the GSCA platform. We also successfully constructed a prediction model for cetuximab treatment response using the K-nearest neighbors (KNN) algorithm in GSE108277 dataset, in which the feature selection was performed through the permutation importance method. Results Analysis of GSE106582 and GSE83889 identified 417 overlapping DEGs by comparing cancer tissues with normal controls, including 16 hypoxia-related genes. 6 genes (BGN, DDIT4, MIF, SLC2A1, STC2, and TGFBI) were upregulated, and 10 genes (CA12, CITED2, MT1E, MT2A, NEDD4L, PCK1, PLAC8, PPARGC1A, SELENBP1, and SRPX) were downregulated in CRC. Survival analysis revealed that the 16 hypoxia-related DEGs were linked to the survival outcomes of CRC patients. Pathway analysis indicated that these genes were almost involved in EMT, cell cycle, and RTK pathways. Furthermore, these genes play a role in the infiltration of immune cells and may regulate the immune microenvironment. A prediction model for cetuximab response was developed, based on 10 key genes (CA12, DDIT4, MIF, MT2A, NEDD4L, PLAC8, SELENBP1, SLC2A1, SRPX, and TGFBI) and dataset from GSE108277. The model demonstrated robust performance with an accuracy of 0.9500, precision of 0.8378, recall of 1.0000, F1-score of 0.9118, and a receiver operating characteristic-area under the curve (ROC-AUC) of 0.9663. Conclusion Our study identifies 10 hypoxia-related DEGs as key players in CRC progression and cetuximab response. And we successfully developed a predictive model to forecast the response of CRC patients to cetuximab treatment. This study will provide valuable biomarkers for CRC prognosis and help guide more effective therapeutic strategies.

基于机器学习的低氧相关基因标记有助于预测结直肠癌患者的西妥昔单抗反应。
西妥昔单抗治疗结直肠癌(CRC)的疗效存在显著的个体差异。然而,在临床实践中仍然缺乏有效的预测治疗结果的模型。方法采用R软件中的Limma软件包,利用GSE106582和GSE83889数据集对结直肠癌中的差异表达基因(deg)进行鉴定。从分子特征数据库中检索缺氧相关基因,并与CRC基因进行交叉比对。利用人类蛋白图谱(HPA)的免疫组织化学(IHC)数据验证蛋白表达水平,并通过Kaplan-Meier绘图仪平台评估预后意义。此外,使用GSCA平台进行通路和免疫浸润分析。我们还在GSE108277数据集上利用k近邻(KNN)算法成功构建了西妥昔单抗治疗反应的预测模型,其中通过排列重要性法进行特征选择。结果GSE106582和GSE83889通过对比癌组织与正常对照,鉴定出417个重叠的deg,其中包括16个缺氧相关基因。6个基因(BGN、DDIT4、MIF、SLC2A1、STC2、TGFBI)在结直肠癌中上调,10个基因(CA12、CITED2、MT1E、MT2A、NEDD4L、PCK1、PLAC8、PPARGC1A、SELENBP1、SRPX)在结直肠癌中下调。生存分析显示,16种与缺氧相关的deg与结直肠癌患者的生存结果有关。通路分析表明,这些基因几乎参与了EMT、细胞周期和RTK通路。此外,这些基因在免疫细胞的浸润中发挥作用,并可能调节免疫微环境。基于10个关键基因(CA12、DDIT4、MIF、MT2A、NEDD4L、PLAC8、SELENBP1、SLC2A1、SRPX和TGFBI)和GSE108277的数据集,建立了西妥昔单抗应答预测模型。模型的准确率为0.9500,精密度为0.8378,召回率为1.000,f1得分为0.9118,受试者工作特征曲线下面积(ROC-AUC)为0.9663,具有良好的鲁棒性。结论:我们的研究确定了10个与缺氧相关的deg是CRC进展和西妥昔单抗反应的关键参与者。我们成功地开发了一个预测模型来预测CRC患者对西妥昔单抗治疗的反应。该研究将为结直肠癌的预后提供有价值的生物标志物,并有助于指导更有效的治疗策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Medical Sciences
International Journal of Medical Sciences MEDICINE, GENERAL & INTERNAL-
CiteScore
7.20
自引率
0.00%
发文量
185
审稿时长
2.7 months
期刊介绍: Original research papers, reviews, and short research communications in any medical related area can be submitted to the Journal on the understanding that the work has not been published previously in whole or part and is not under consideration for publication elsewhere. Manuscripts in basic science and clinical medicine are both considered. There is no restriction on the length of research papers and reviews, although authors are encouraged to be concise. Short research communication is limited to be under 2500 words.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信