Screening, Validation, and Machine Learning-Based Evaluation of Serum Protein Biomarkers for Esophageal Squamous Cell Carcinoma Based on Single-Cell Subtype-Specific Genes

IF 3.6 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Xiuzhi Zhang, Zhefeng Xiao, Fengqi Chen, Wenke Sun, Tiandong Li, Hua Ye, Peng Wang*, Liping Dai* and Xiaoli Liu*, 
{"title":"Screening, Validation, and Machine Learning-Based Evaluation of Serum Protein Biomarkers for Esophageal Squamous Cell Carcinoma Based on Single-Cell Subtype-Specific Genes","authors":"Xiuzhi Zhang,&nbsp;Zhefeng Xiao,&nbsp;Fengqi Chen,&nbsp;Wenke Sun,&nbsp;Tiandong Li,&nbsp;Hua Ye,&nbsp;Peng Wang*,&nbsp;Liping Dai* and Xiaoli Liu*,&nbsp;","doi":"10.1021/acs.jproteome.5c00475","DOIUrl":null,"url":null,"abstract":"<p >Cellular heterogeneity of epithelial cells and fibroblasts is critical in esophageal squamous cell carcinoma development (ESCC). Identifying dysregulated subtype-specific genes in these cells is essential for early diagnosis and treatment. In this study, our pipeline integrated scRNA-seq, proteomics, and ELISA to screen biomarkers: scRNA-seq defined epithelial and fibroblast subtypes and their markers, while proteomics and secretory profiling identified dysregulated secretory proteins. Serum levels of five selected proteins were measured in 344 ESCC patients, 46 HGIN cases, and 390 normal controls. Machine learning was employed to construct diagnostic models. An interactive web tool was implemented in R Shiny. Six epithelial and four fibroblast subtypes, proportionally distinct between ESCC and normal tissues, were identified. Four validated dysregulated proteins were used to build diagnostic models; among 12 algorithms, the Support Vector Machine (SVM) achieved the best performance with AUCs of 0.829 and 0.767 in the training and validation sets, respectively (<i>p</i> &gt; 0.05). The model effectively distinguished early- and late-stage ESCC and HGIN from normal controls. The web-based diagnostic tool is publicly available at https://zhangxz.shinyapps.io/P4_Pred/. The identified serum biomarkers may enhance early ESCC detection and diagnosis. Our pipeline, leveraging heterogeneity-related genes in fibroblasts and epithelial cells, is readily adaptable to other tumors.</p>","PeriodicalId":48,"journal":{"name":"Journal of Proteome Research","volume":"24 9","pages":"4719–4733"},"PeriodicalIF":3.6000,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Proteome Research","FirstCategoryId":"99","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.jproteome.5c00475","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Cellular heterogeneity of epithelial cells and fibroblasts is critical in esophageal squamous cell carcinoma development (ESCC). Identifying dysregulated subtype-specific genes in these cells is essential for early diagnosis and treatment. In this study, our pipeline integrated scRNA-seq, proteomics, and ELISA to screen biomarkers: scRNA-seq defined epithelial and fibroblast subtypes and their markers, while proteomics and secretory profiling identified dysregulated secretory proteins. Serum levels of five selected proteins were measured in 344 ESCC patients, 46 HGIN cases, and 390 normal controls. Machine learning was employed to construct diagnostic models. An interactive web tool was implemented in R Shiny. Six epithelial and four fibroblast subtypes, proportionally distinct between ESCC and normal tissues, were identified. Four validated dysregulated proteins were used to build diagnostic models; among 12 algorithms, the Support Vector Machine (SVM) achieved the best performance with AUCs of 0.829 and 0.767 in the training and validation sets, respectively (p > 0.05). The model effectively distinguished early- and late-stage ESCC and HGIN from normal controls. The web-based diagnostic tool is publicly available at https://zhangxz.shinyapps.io/P4_Pred/. The identified serum biomarkers may enhance early ESCC detection and diagnosis. Our pipeline, leveraging heterogeneity-related genes in fibroblasts and epithelial cells, is readily adaptable to other tumors.

Abstract Image

基于单细胞亚型特异性基因的食管鳞状细胞癌血清蛋白生物标志物的筛选、验证和机器学习评估
上皮细胞和成纤维细胞的细胞异质性是食管鳞状细胞癌(ESCC)发展的关键。识别这些细胞中失调的亚型特异性基因对于早期诊断和治疗至关重要。在这项研究中,我们的产品线整合了scRNA-seq,蛋白质组学和ELISA来筛选生物标志物:scRNA-seq定义上皮和成纤维细胞亚型及其标记,而蛋白质组学和分泌谱鉴定了失调的分泌蛋白。在344例ESCC患者、46例HGIN患者和390例正常对照中测定了5种选定蛋白的血清水平。采用机器学习构建诊断模型。在R Shiny中实现了一个交互式web工具。6种上皮细胞亚型和4种成纤维细胞亚型,在ESCC和正常组织中比例不同。使用四种验证过的失调蛋白建立诊断模型;在12种算法中,支持向量机(SVM)在训练集和验证集上的auc分别为0.829和0.767,表现最佳(p < 0.05)。该模型有效地区分了早期和晚期ESCC和HGIN与正常对照。这个基于网络的诊断工具可在https://zhangxz.shinyapps.io/P4_Pred/上公开获得。确定的血清生物标志物可提高ESCC的早期发现和诊断。我们的管道利用成纤维细胞和上皮细胞中的异质性相关基因,很容易适应其他肿瘤。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Proteome Research
Journal of Proteome Research 生物-生化研究方法
CiteScore
9.00
自引率
4.50%
发文量
251
审稿时长
3 months
期刊介绍: Journal of Proteome Research publishes content encompassing all aspects of global protein analysis and function, including the dynamic aspects of genomics, spatio-temporal proteomics, metabonomics and metabolomics, clinical and agricultural proteomics, as well as advances in methodology including bioinformatics. The theme and emphasis is on a multidisciplinary approach to the life sciences through the synergy between the different types of "omics".
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信