{"title":"SRPS:多中心蛋白质组学亚型和生物标志物发现的生存强化迁移学习。","authors":"Linhai Xie, Pei Jiang, Cheng Chang","doi":"10.1093/gpbjnl/qzaf052","DOIUrl":null,"url":null,"abstract":"<p><p>Omics-based molecular subtyping in large-scale and multicentric cohort studies is a prerequisite for proteomics-driven precision medicine (PDPM). However, keeping the subtypes with robust molecular features and significant associations with prognosis across different cohorts is challenging due to the biological heterogeneity and technical inconsistency. Herein, we propose a subtyping algorithm, named Survival Reinforced Patient Stratification (SRPS), to adapt the known subtypes from a discovery cohort to another by simultaneously preserving the distinct prognosis and molecular characteristics of each subtype. SRPS has been benchmarked on simulated and real-world datasets, where it shows a 12% increase in classification accuracy and possesses the best prognostic discrimination. Moreover, based on the calculated subtype significance score, an \"unpopular\" protein, Peptidylprolyl Isomerase C (PPIC), was identified as the top-1 remarkable protein for subtyping the hepatocellular carcinoma (HCC) patients with the worst prognosis. Eventually, PPIC was experimentally proved to be a pro-cancer protein in HCC, confirming our work as a demonstration of interpretable machine learning guided biological discovery in PDPM research. SRPS is publicly available at https://github.com/PHOENIXcenter/SRPS and https://ngdc.cncb.ac.cn/biocode/tool/BT007770.</p>","PeriodicalId":94020,"journal":{"name":"Genomics, proteomics & bioinformatics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SRPS: Survival Reinforced Transfer Learning for Multicentric Proteomic Subtyping and Biomarker Discovery.\",\"authors\":\"Linhai Xie, Pei Jiang, Cheng Chang\",\"doi\":\"10.1093/gpbjnl/qzaf052\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Omics-based molecular subtyping in large-scale and multicentric cohort studies is a prerequisite for proteomics-driven precision medicine (PDPM). However, keeping the subtypes with robust molecular features and significant associations with prognosis across different cohorts is challenging due to the biological heterogeneity and technical inconsistency. Herein, we propose a subtyping algorithm, named Survival Reinforced Patient Stratification (SRPS), to adapt the known subtypes from a discovery cohort to another by simultaneously preserving the distinct prognosis and molecular characteristics of each subtype. SRPS has been benchmarked on simulated and real-world datasets, where it shows a 12% increase in classification accuracy and possesses the best prognostic discrimination. Moreover, based on the calculated subtype significance score, an \\\"unpopular\\\" protein, Peptidylprolyl Isomerase C (PPIC), was identified as the top-1 remarkable protein for subtyping the hepatocellular carcinoma (HCC) patients with the worst prognosis. Eventually, PPIC was experimentally proved to be a pro-cancer protein in HCC, confirming our work as a demonstration of interpretable machine learning guided biological discovery in PDPM research. SRPS is publicly available at https://github.com/PHOENIXcenter/SRPS and https://ngdc.cncb.ac.cn/biocode/tool/BT007770.</p>\",\"PeriodicalId\":94020,\"journal\":{\"name\":\"Genomics, proteomics & bioinformatics\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-06-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Genomics, proteomics & bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1093/gpbjnl/qzaf052\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Genomics, proteomics & bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/gpbjnl/qzaf052","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
SRPS: Survival Reinforced Transfer Learning for Multicentric Proteomic Subtyping and Biomarker Discovery.
Omics-based molecular subtyping in large-scale and multicentric cohort studies is a prerequisite for proteomics-driven precision medicine (PDPM). However, keeping the subtypes with robust molecular features and significant associations with prognosis across different cohorts is challenging due to the biological heterogeneity and technical inconsistency. Herein, we propose a subtyping algorithm, named Survival Reinforced Patient Stratification (SRPS), to adapt the known subtypes from a discovery cohort to another by simultaneously preserving the distinct prognosis and molecular characteristics of each subtype. SRPS has been benchmarked on simulated and real-world datasets, where it shows a 12% increase in classification accuracy and possesses the best prognostic discrimination. Moreover, based on the calculated subtype significance score, an "unpopular" protein, Peptidylprolyl Isomerase C (PPIC), was identified as the top-1 remarkable protein for subtyping the hepatocellular carcinoma (HCC) patients with the worst prognosis. Eventually, PPIC was experimentally proved to be a pro-cancer protein in HCC, confirming our work as a demonstration of interpretable machine learning guided biological discovery in PDPM research. SRPS is publicly available at https://github.com/PHOENIXcenter/SRPS and https://ngdc.cncb.ac.cn/biocode/tool/BT007770.