Screening, Validation, and Machine Learning-Based Evaluation of Serum Protein Biomarkers for Esophageal Squamous Cell Carcinoma Based on Single-Cell Subtype-Specific Genes
Xiuzhi Zhang, Zhefeng Xiao, Fengqi Chen, Wenke Sun, Tiandong Li, Hua Ye, Peng Wang*, Liping Dai* and Xiaoli Liu*,
{"title":"Screening, Validation, and Machine Learning-Based Evaluation of Serum Protein Biomarkers for Esophageal Squamous Cell Carcinoma Based on Single-Cell Subtype-Specific Genes","authors":"Xiuzhi Zhang, Zhefeng Xiao, Fengqi Chen, Wenke Sun, Tiandong Li, Hua Ye, Peng Wang*, Liping Dai* and Xiaoli Liu*, ","doi":"10.1021/acs.jproteome.5c00475","DOIUrl":null,"url":null,"abstract":"<p >Cellular heterogeneity of epithelial cells and fibroblasts is critical in esophageal squamous cell carcinoma development (ESCC). Identifying dysregulated subtype-specific genes in these cells is essential for early diagnosis and treatment. In this study, our pipeline integrated scRNA-seq, proteomics, and ELISA to screen biomarkers: scRNA-seq defined epithelial and fibroblast subtypes and their markers, while proteomics and secretory profiling identified dysregulated secretory proteins. Serum levels of five selected proteins were measured in 344 ESCC patients, 46 HGIN cases, and 390 normal controls. Machine learning was employed to construct diagnostic models. An interactive web tool was implemented in R Shiny. Six epithelial and four fibroblast subtypes, proportionally distinct between ESCC and normal tissues, were identified. Four validated dysregulated proteins were used to build diagnostic models; among 12 algorithms, the Support Vector Machine (SVM) achieved the best performance with AUCs of 0.829 and 0.767 in the training and validation sets, respectively (<i>p</i> > 0.05). The model effectively distinguished early- and late-stage ESCC and HGIN from normal controls. The web-based diagnostic tool is publicly available at https://zhangxz.shinyapps.io/P4_Pred/. The identified serum biomarkers may enhance early ESCC detection and diagnosis. Our pipeline, leveraging heterogeneity-related genes in fibroblasts and epithelial cells, is readily adaptable to other tumors.</p>","PeriodicalId":48,"journal":{"name":"Journal of Proteome Research","volume":"24 9","pages":"4719–4733"},"PeriodicalIF":3.6000,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Proteome Research","FirstCategoryId":"99","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.jproteome.5c00475","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Cellular heterogeneity of epithelial cells and fibroblasts is critical in esophageal squamous cell carcinoma development (ESCC). Identifying dysregulated subtype-specific genes in these cells is essential for early diagnosis and treatment. In this study, our pipeline integrated scRNA-seq, proteomics, and ELISA to screen biomarkers: scRNA-seq defined epithelial and fibroblast subtypes and their markers, while proteomics and secretory profiling identified dysregulated secretory proteins. Serum levels of five selected proteins were measured in 344 ESCC patients, 46 HGIN cases, and 390 normal controls. Machine learning was employed to construct diagnostic models. An interactive web tool was implemented in R Shiny. Six epithelial and four fibroblast subtypes, proportionally distinct between ESCC and normal tissues, were identified. Four validated dysregulated proteins were used to build diagnostic models; among 12 algorithms, the Support Vector Machine (SVM) achieved the best performance with AUCs of 0.829 and 0.767 in the training and validation sets, respectively (p > 0.05). The model effectively distinguished early- and late-stage ESCC and HGIN from normal controls. The web-based diagnostic tool is publicly available at https://zhangxz.shinyapps.io/P4_Pred/. The identified serum biomarkers may enhance early ESCC detection and diagnosis. Our pipeline, leveraging heterogeneity-related genes in fibroblasts and epithelial cells, is readily adaptable to other tumors.
期刊介绍:
Journal of Proteome Research publishes content encompassing all aspects of global protein analysis and function, including the dynamic aspects of genomics, spatio-temporal proteomics, metabonomics and metabolomics, clinical and agricultural proteomics, as well as advances in methodology including bioinformatics. The theme and emphasis is on a multidisciplinary approach to the life sciences through the synergy between the different types of "omics".