{"title":"Machine learning for identifying liver and pancreas cancers through comprehensive serum glycopeptide spectra analysis: a case-control study.","authors":"Motoyuki Kohjima, Yuko Takami, Ken Kawabe, Kazuhiro Tanabe, Chihiro Hayashi, Mikio Mikami, Tetsuya Kusumoto","doi":"10.1002/1878-0261.70084","DOIUrl":null,"url":null,"abstract":"<p><p>Liver and pancreatic cancers are difficult to detect early, leading to high mortality rates. Blood-based diagnostics present a viable alternative for earlier detection, potentially improving survival rates. The comprehensive serum glycopeptide spectra analysis (CSGSA) method combines enriched glycopeptides (EGPs) with conventional tumor markers through machine learning to accurately identify early stage cancers. Here, we analyzed nine tumor markers (CA19-9, AFP, PSA, CEA, CA125, CYFRA, CA15-3, SCC antigen, and NCC-ST439) in 119 patients with pancreatic cancer and 49 with hepatocellular carcinoma, alongside 590 healthy controls. We also analyzed EGPs using liquid chromatography-mass spectrometry. We found that α1-antitrypsin with a fully sialylated biantennary glycan at asparagine 271 and α2-macroglobulin with a fully sialylated biantennary glycan at asparagine 70 effectively distinguished liver and pancreatic cancers. The integration of these two glycopeptides, along with the nine tumor markers and 1688 EGPs using a machine learning model enhanced diagnostic accuracy, achieving a receiver operating characteristic-area under curve (ROC-AUC) score of 0.996. CSGSA has the potential to minimize the need for invasive diagnostic procedures and serves as a promising tool for widespread screening.</p>","PeriodicalId":18764,"journal":{"name":"Molecular Oncology","volume":" ","pages":""},"PeriodicalIF":4.5000,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/1878-0261.70084","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Biochemistry, Genetics and Molecular Biology","Score":null,"Total":0}
引用次数: 0
Abstract
Liver and pancreatic cancers are difficult to detect early, leading to high mortality rates. Blood-based diagnostics present a viable alternative for earlier detection, potentially improving survival rates. The comprehensive serum glycopeptide spectra analysis (CSGSA) method combines enriched glycopeptides (EGPs) with conventional tumor markers through machine learning to accurately identify early stage cancers. Here, we analyzed nine tumor markers (CA19-9, AFP, PSA, CEA, CA125, CYFRA, CA15-3, SCC antigen, and NCC-ST439) in 119 patients with pancreatic cancer and 49 with hepatocellular carcinoma, alongside 590 healthy controls. We also analyzed EGPs using liquid chromatography-mass spectrometry. We found that α1-antitrypsin with a fully sialylated biantennary glycan at asparagine 271 and α2-macroglobulin with a fully sialylated biantennary glycan at asparagine 70 effectively distinguished liver and pancreatic cancers. The integration of these two glycopeptides, along with the nine tumor markers and 1688 EGPs using a machine learning model enhanced diagnostic accuracy, achieving a receiver operating characteristic-area under curve (ROC-AUC) score of 0.996. CSGSA has the potential to minimize the need for invasive diagnostic procedures and serves as a promising tool for widespread screening.
Molecular OncologyBiochemistry, Genetics and Molecular Biology-Molecular Medicine
CiteScore
11.80
自引率
1.50%
发文量
203
审稿时长
10 weeks
期刊介绍:
Molecular Oncology highlights new discoveries, approaches, and technical developments, in basic, clinical and discovery-driven translational cancer research. It publishes research articles, reviews (by invitation only), and timely science policy articles.
The journal is now fully Open Access with all articles published over the past 10 years freely available.