Junxu Chen , Dongwook Kim , Jae Young Kim , Hyung Jun Kim
{"title":"Development and multi-cohort validation of a prognostic risk score model for oral squamous cell carcinoma based on a three-gene signature","authors":"Junxu Chen , Dongwook Kim , Jae Young Kim , Hyung Jun Kim","doi":"10.1016/j.cancergen.2025.09.005","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Oral squamous cell carcinoma (OSCC) carries substantial mortality despite surgery-based management. Reliable biomarkers and practical prognostic tools are needed to guide individualized care.</div></div><div><h3>Methods</h3><div>Transcriptomic and clinical data from TCGA and multiple GEO cohorts were integrated. Candidate genes identified by differential expression analysis, WGCNA, and PPI were refined using LASSO and Random Forest, then entered a multivariable Cox model to derive a three-gene risk score. Performance was assessed with Kaplan–Meier curves, time-dependent ROC, and calibration, and validated across internal, external, and combined datasets. Expression was examined by RT-qPCR and Western blot in OSCC and normal oral cell lines. Immune infiltration and pathway enrichment analyses were conducted to contextualize biology.</div></div><div><h3>Results</h3><div>The three-gene signature (<em>CXCL12, PLAU, PXDN</em>) separated risk groups in the training cohort with 1/3/5-year AUCs of 0.767/0.625/0.714. In three independent external cohorts, high-risk patients consistently had worse overall survival (log-rank <em>p</em> = 0.0092, ≤0.0001, ≤0.0001), with time-AUC ranges of 0.581–0.747 (1-year), 0.555–0.795 (3-year), and 0.603–0.812 (5-year). In TCGA, the score remained prognostic across sex, age, smoking, drinking, and stage III/IV subgroups (all <em>p</em> ≤ 0.05), with a consistent trend in stage I/II (<em>p</em> = 0.09). A nomogram integrating clinical variables with the risk score achieved a C-index of 0.63 with good 1–5-year calibration and outperformed TNM staging alone (C-index 0.63 vs 0.58; 95 % CI 0.58–0.70 vs 0.54–0.61). RT-qPCR/Western blot confirmed consistent differential expression of all three biomarkers. Immune-infiltration and pathway analyses revealed distinct microenvironmental and molecular features across risk strata.</div></div><div><h3>Conclusion</h3><div>We present a robust, externally validated three-gene prognostic model for OSCC, supported by experimental evidence and superior to TNM staging for discrimination, offering a practical nomogram for individualized risk estimation from 1 to 5 years.</div></div>","PeriodicalId":49225,"journal":{"name":"Cancer Genetics","volume":"298 ","pages":"Pages 88-98"},"PeriodicalIF":2.1000,"publicationDate":"2025-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Genetics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210776225001152","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Oral squamous cell carcinoma (OSCC) carries substantial mortality despite surgery-based management. Reliable biomarkers and practical prognostic tools are needed to guide individualized care.
Methods
Transcriptomic and clinical data from TCGA and multiple GEO cohorts were integrated. Candidate genes identified by differential expression analysis, WGCNA, and PPI were refined using LASSO and Random Forest, then entered a multivariable Cox model to derive a three-gene risk score. Performance was assessed with Kaplan–Meier curves, time-dependent ROC, and calibration, and validated across internal, external, and combined datasets. Expression was examined by RT-qPCR and Western blot in OSCC and normal oral cell lines. Immune infiltration and pathway enrichment analyses were conducted to contextualize biology.
Results
The three-gene signature (CXCL12, PLAU, PXDN) separated risk groups in the training cohort with 1/3/5-year AUCs of 0.767/0.625/0.714. In three independent external cohorts, high-risk patients consistently had worse overall survival (log-rank p = 0.0092, ≤0.0001, ≤0.0001), with time-AUC ranges of 0.581–0.747 (1-year), 0.555–0.795 (3-year), and 0.603–0.812 (5-year). In TCGA, the score remained prognostic across sex, age, smoking, drinking, and stage III/IV subgroups (all p ≤ 0.05), with a consistent trend in stage I/II (p = 0.09). A nomogram integrating clinical variables with the risk score achieved a C-index of 0.63 with good 1–5-year calibration and outperformed TNM staging alone (C-index 0.63 vs 0.58; 95 % CI 0.58–0.70 vs 0.54–0.61). RT-qPCR/Western blot confirmed consistent differential expression of all three biomarkers. Immune-infiltration and pathway analyses revealed distinct microenvironmental and molecular features across risk strata.
Conclusion
We present a robust, externally validated three-gene prognostic model for OSCC, supported by experimental evidence and superior to TNM staging for discrimination, offering a practical nomogram for individualized risk estimation from 1 to 5 years.
期刊介绍:
The aim of Cancer Genetics is to publish high quality scientific papers on the cellular, genetic and molecular aspects of cancer, including cancer predisposition and clinical diagnostic applications. Specific areas of interest include descriptions of new chromosomal, molecular or epigenetic alterations in benign and malignant diseases; novel laboratory approaches for identification and characterization of chromosomal rearrangements or genomic alterations in cancer cells; correlation of genetic changes with pathology and clinical presentation; and the molecular genetics of cancer predisposition. To reach a basic science and clinical multidisciplinary audience, we welcome original full-length articles, reviews, meeting summaries, brief reports, and letters to the editor.