Bayesian Optimization–Enhanced Machine Learning for Osteosarcoma Risk Stratification Based on Sphingolipid Metabolism

IF 3.7 2区 医学 Q2 GENETICS & HEREDITY
Yujian Zhong, Ruyuan He, Zewen Jiang, Queran Lin, Fei Peng, Wenyi Jin
{"title":"Bayesian Optimization–Enhanced Machine Learning for Osteosarcoma Risk Stratification Based on Sphingolipid Metabolism","authors":"Yujian Zhong,&nbsp;Ruyuan He,&nbsp;Zewen Jiang,&nbsp;Queran Lin,&nbsp;Fei Peng,&nbsp;Wenyi Jin","doi":"10.1155/humu/2904964","DOIUrl":null,"url":null,"abstract":"<p><b>Background:</b> Heterogenized sphingolipid metabolism (SM) drives osteosarcoma tumorigenesis and its tumor-promoting microenvironment. State-of-the-art bioinformatic tools, such as machine learning, are essential for dissecting the prognostic value of SM by investigating its molecular and cellular mechanisms.</p><p><b>Methods:</b> A tailored machine learning pipeline was established by integrating Cox regression, 5-fold cross-validation, Elastic Net, eXtreme Gradient Boosting (XGBoost), and Bayesian optimization (for hyperparameters tuning) to foster an SM Elastic Net-XGBoost (SNEX) prognostic model, interpreted by the Shapley additive explanations (SHAP) algorithm. The alterations in molecular pathways and immune microenvironment–driven unfavorable prognosis of SNEX-identified high-risk osteosarcoma were further investigated. The SNEX predicted results have also been clinically and experimentally validated.</p><p><b>Results:</b> We identified 22 critical SM prognostic genes for Bayesian-optimized SNEX. This model provided outstanding estimates of the prognoses of osteosarcoma patients (C-index of 1.000). Its robustness was confirmed in the independent test set with a high area under the curve (AUC) of 0.875 at 1 year, 0.930 at 3 years, and 0.930 at 5 years. SNEX also significantly outperformed all previous genetic prognostic signatures with a significantly higher net benefit of decision curves and higher AUCs. ACTA2 was the most pivotal gene critical to the negative prediction of SNEX, while BNIP3 was for positive prediction. Mechanistically, SNEX-identified high-risk osteosarcoma suffered unfavorable prognoses due to dysregulation of many critical metabolic/inflammatory/immune biologic processes and immunosuppressive microenvironment, with reduced infiltration of 14 types of immune cells (macrophages, CD8+ T cells, NK cells, etc.). Notably, SNEX highlighted TERT as the most remarkable SM prognostic gene. Clinical osteosarcomas with high expression of TERT exhibited more significant malignant characteristics than others, as evidenced by their higher proliferation efficiency. In addition, all the experiments in vitro and in vivo validated that inhibiting TERT abundance reduces the proliferation, invasion, and migration capabilities of osteosarcoma cells.</p><p><b>Conclusions:</b> This study is a first-hand report employing a tailored machine-learning pipeline for dissecting the prognostic value and roles of SM in osteosarcoma. The present study fostered a SNEX for risk-stratification with outstanding accuracy and offered deep insights into SM-mediated pathways and microenvironment dysregulation in osteosarcoma.</p>","PeriodicalId":13061,"journal":{"name":"Human Mutation","volume":"2025 1","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/humu/2904964","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Human Mutation","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1155/humu/2904964","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Heterogenized sphingolipid metabolism (SM) drives osteosarcoma tumorigenesis and its tumor-promoting microenvironment. State-of-the-art bioinformatic tools, such as machine learning, are essential for dissecting the prognostic value of SM by investigating its molecular and cellular mechanisms.

Methods: A tailored machine learning pipeline was established by integrating Cox regression, 5-fold cross-validation, Elastic Net, eXtreme Gradient Boosting (XGBoost), and Bayesian optimization (for hyperparameters tuning) to foster an SM Elastic Net-XGBoost (SNEX) prognostic model, interpreted by the Shapley additive explanations (SHAP) algorithm. The alterations in molecular pathways and immune microenvironment–driven unfavorable prognosis of SNEX-identified high-risk osteosarcoma were further investigated. The SNEX predicted results have also been clinically and experimentally validated.

Results: We identified 22 critical SM prognostic genes for Bayesian-optimized SNEX. This model provided outstanding estimates of the prognoses of osteosarcoma patients (C-index of 1.000). Its robustness was confirmed in the independent test set with a high area under the curve (AUC) of 0.875 at 1 year, 0.930 at 3 years, and 0.930 at 5 years. SNEX also significantly outperformed all previous genetic prognostic signatures with a significantly higher net benefit of decision curves and higher AUCs. ACTA2 was the most pivotal gene critical to the negative prediction of SNEX, while BNIP3 was for positive prediction. Mechanistically, SNEX-identified high-risk osteosarcoma suffered unfavorable prognoses due to dysregulation of many critical metabolic/inflammatory/immune biologic processes and immunosuppressive microenvironment, with reduced infiltration of 14 types of immune cells (macrophages, CD8+ T cells, NK cells, etc.). Notably, SNEX highlighted TERT as the most remarkable SM prognostic gene. Clinical osteosarcomas with high expression of TERT exhibited more significant malignant characteristics than others, as evidenced by their higher proliferation efficiency. In addition, all the experiments in vitro and in vivo validated that inhibiting TERT abundance reduces the proliferation, invasion, and migration capabilities of osteosarcoma cells.

Conclusions: This study is a first-hand report employing a tailored machine-learning pipeline for dissecting the prognostic value and roles of SM in osteosarcoma. The present study fostered a SNEX for risk-stratification with outstanding accuracy and offered deep insights into SM-mediated pathways and microenvironment dysregulation in osteosarcoma.

Abstract Image

基于鞘脂代谢的骨肉瘤风险分层贝叶斯优化增强机器学习
背景:异质鞘脂代谢(SM)驱动骨肉瘤肿瘤发生及其促瘤微环境。最先进的生物信息学工具,如机器学习,对于通过研究其分子和细胞机制来剖析SM的预后价值至关重要。方法:通过整合Cox回归、5重交叉验证、Elastic Net、eXtreme Gradient Boosting (XGBoost)和贝叶斯优化(用于超参数调优),建立定制的机器学习管道,构建SM Elastic Net-XGBoost (SNEX)预测模型,并采用Shapley加性解释(SHAP)算法进行解释。进一步研究snex鉴定的高危骨肉瘤分子通路的改变和免疫微环境驱动的不良预后。SNEX预测结果也得到了临床和实验验证。结果:我们鉴定了22个关键的SM预后基因,用于贝叶斯优化的SNEX。该模型对骨肉瘤患者的预后提供了出色的估计(c指数为1000)。其稳健性在独立检验集中得到证实,1年曲线下面积(AUC)为0.875,3年为0.930,5年为0.930。SNEX还显著优于所有以前的遗传预后特征,具有更高的决策曲线净收益和更高的auc。ACTA2是SNEX阴性预测最关键的基因,而BNIP3是阳性预测最关键的基因。机制上,snex鉴定的高危骨肉瘤由于许多关键的代谢/炎症/免疫生物过程和免疫抑制微环境的失调,14种免疫细胞(巨噬细胞、CD8+ T细胞、NK细胞等)的浸润减少,预后不良。值得注意的是,SNEX强调TERT是最显著的SM预后基因。TERT高表达的临床骨肉瘤比其他骨肉瘤具有更显著的恶性特征,其增殖效率更高。此外,所有体外和体内实验均证实,抑制TERT丰度可降低骨肉瘤细胞的增殖、侵袭和迁移能力。结论:本研究是一份使用量身定制的机器学习管道来剖析SM在骨肉瘤中的预后价值和作用的第一手报告。本研究培养了一种非常准确的风险分层snx,并为骨肉瘤中sm介导的途径和微环境失调提供了深入的见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Human Mutation
Human Mutation 医学-遗传学
CiteScore
8.40
自引率
5.10%
发文量
190
审稿时长
2 months
期刊介绍: Human Mutation is a peer-reviewed journal that offers publication of original Research Articles, Methods, Mutation Updates, Reviews, Database Articles, Rapid Communications, and Letters on broad aspects of mutation research in humans. Reports of novel DNA variations and their phenotypic consequences, reports of SNPs demonstrated as valuable for genomic analysis, descriptions of new molecular detection methods, and novel approaches to clinical diagnosis are welcomed. Novel reports of gene organization at the genomic level, reported in the context of mutation investigation, may be considered. The journal provides a unique forum for the exchange of ideas, methods, and applications of interest to molecular, human, and medical geneticists in academic, industrial, and clinical research settings worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信