使用18F-FDG PET/CT参数和机器学习对抗her2治疗的乳腺癌候选人进行个性化预测：一项双中心研究

IF 3.5 3区医学 Q2 ONCOLOGY

Frontiers in Oncology Pub Date : 2025-05-14 eCollection Date: 2025-01-01 DOI:10.3389/fonc.2025.1590769

Zhenguo Sun, Jianxiong Gao, Wenji Yu, Xiaoshuai Yuan, Peng Du, Peng Chen, Yuetao Wang

{"title":"使用18F-FDG PET/CT参数和机器学习对抗her2治疗的乳腺癌候选人进行个性化预测：一项双中心研究","authors":"Zhenguo Sun, Jianxiong Gao, Wenji Yu, Xiaoshuai Yuan, Peng Du, Peng Chen, Yuetao Wang","doi":"10.3389/fonc.2025.1590769","DOIUrl":null,"url":null,"abstract":"Background: Accurately evaluating human epidermal growth factor receptor (HER2) expression status in breast cancer enables clinicians to develop individualized treatment plans and improve patient prognosis. The purpose of this study was to assess the performance of a machine learning (ML) model that was developed using 18F-FDG PET/CT parameters and clinicopathological features in distinguishing different levels of HER2 expression in breast cancer.Methods: This retrospective study enrolled breast cancer patients who underwent 18F-FDG PET/CT scans prior to treatment at Lianyungang First People's Hospital (centre 1, n=157) and the Third Affiliated Hospital of Soochow University (centre 2, n=84). Two classification tasks were analysed: distinguishing HER2-zero expression from HER2-low/positive expression (Task 1) and distinguishing HER2-low expression from HER2-positive expression (Task 2). For each task, patients from Centre 1 were randomly divided into training and internal test sets at a 7:3 ratio, whereas patients from Centre 2 served as an external test set. The prediction models included logistic regression (LR), support vector machine (SVM), extreme gradient boosting (XGBoost) and multilayer perceptron (MLP), and SHAP analysis provided model interpretability. Model performance was evaluated via the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV).Results: XGBoost models exhibited the best predictive performance in both tasks. For Task 1, recursive feature elimination (RFE) was used to select 8 features, excluding pathological features, and the XGBoost model achieved AUCs of 0.888, 0.844 and 0.759 for the training, internal and external testing sets, respectively. The top three features according to the SHAP values were the tumour minimum diameter, mean standardized uptake value (SUVmean) and CTmean. For Task 2, 9 features were selected, including progesterone receptor (PR) status as a pathological feature. The XGBoost model achieved AUCs of 0.920, 0.814 and 0.693 for the training, internal and external testing sets, respectively. The top three features according to the SHAP values were the PR status, maximum tumour diameter and metabolic tumour volume (MTV).Conclusions: ML models that incorporate 18F-FDG PET/CT parameters and clinicopathological features can aid in the prediction of different HER2 expression statuses in breast cancer.","PeriodicalId":12482,"journal":{"name":"Frontiers in Oncology","volume":"15 ","pages":"1590769"},"PeriodicalIF":3.5000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12116446/pdf/","citationCount":"0","resultStr":"{\"title\":\"Personalized prediction of breast cancer candidates for Anti-HER2 therapy using 18F-FDG PET/CT parameters and machine learning: a dual-center study.\",\"authors\":\"Zhenguo Sun, Jianxiong Gao, Wenji Yu, Xiaoshuai Yuan, Peng Du, Peng Chen, Yuetao Wang\",\"doi\":\"10.3389/fonc.2025.1590769\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Background: Accurately evaluating human epidermal growth factor receptor (HER2) expression status in breast cancer enables clinicians to develop individualized treatment plans and improve patient prognosis. The purpose of this study was to assess the performance of a machine learning (ML) model that was developed using 18F-FDG PET/CT parameters and clinicopathological features in distinguishing different levels of HER2 expression in breast cancer.Methods: This retrospective study enrolled breast cancer patients who underwent 18F-FDG PET/CT scans prior to treatment at Lianyungang First People's Hospital (centre 1, n=157) and the Third Affiliated Hospital of Soochow University (centre 2, n=84). Two classification tasks were analysed: distinguishing HER2-zero expression from HER2-low/positive expression (Task 1) and distinguishing HER2-low expression from HER2-positive expression (Task 2). For each task, patients from Centre 1 were randomly divided into training and internal test sets at a 7:3 ratio, whereas patients from Centre 2 served as an external test set. The prediction models included logistic regression (LR), support vector machine (SVM), extreme gradient boosting (XGBoost) and multilayer perceptron (MLP), and SHAP analysis provided model interpretability. Model performance was evaluated via the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV).Results: XGBoost models exhibited the best predictive performance in both tasks. For Task 1, recursive feature elimination (RFE) was used to select 8 features, excluding pathological features, and the XGBoost model achieved AUCs of 0.888, 0.844 and 0.759 for the training, internal and external testing sets, respectively. The top three features according to the SHAP values were the tumour minimum diameter, mean standardized uptake value (SUVmean) and CTmean. For Task 2, 9 features were selected, including progesterone receptor (PR) status as a pathological feature. The XGBoost model achieved AUCs of 0.920, 0.814 and 0.693 for the training, internal and external testing sets, respectively. The top three features according to the SHAP values were the PR status, maximum tumour diameter and metabolic tumour volume (MTV).Conclusions: ML models that incorporate 18F-FDG PET/CT parameters and clinicopathological features can aid in the prediction of different HER2 expression statuses in breast cancer.\",\"PeriodicalId\":12482,\"journal\":{\"name\":\"Frontiers in Oncology\",\"volume\":\"15 \",\"pages\":\"1590769\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-05-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12116446/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Oncology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3389/fonc.2025.1590769\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fonc.2025.1590769","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

背景：准确评估人表皮生长因子受体（HER2）在乳腺癌中的表达状况，有助于临床医生制定个体化治疗方案，改善患者预后。本研究的目的是评估机器学习（ML）模型的性能，该模型使用18F-FDG PET/CT参数和临床病理特征来区分乳腺癌中不同水平的HER2表达。方法：本回顾性研究纳入了连云港市第一人民医院（中心1，n=157）和东吴大学第三附属医院（中心2，n=84）治疗前行18F-FDG PET/CT扫描的乳腺癌患者。分析了两项分类任务：区分her2 -零表达与her2 -低表达/阳性表达（任务1）和区分her2 -低表达与her2 -阳性表达（任务2）。对于每个任务，中心1的患者以7:3的比例随机分为训练组和内部测试组，而中心2的患者作为外部测试组。预测模型包括逻辑回归（LR）、支持向量机（SVM）、极端梯度增强（XGBoost）和多层感知器（MLP）， SHAP分析提供了模型的可解释性。通过受试者工作特征曲线下面积（AUC）、准确性、敏感性、特异性、阳性预测值（PPV）和阴性预测值（NPV）评价模型的性能。结果：XGBoost模型在两个任务中都表现出最好的预测性能。对于Task 1，使用递归特征消去（RFE）剔除病理特征，选择8个特征，XGBoost模型在训练集、内部集和外部集上的auc分别为0.888、0.844和0.759。根据SHAP值排名前三位的特征是肿瘤最小直径、平均标准化摄取值（SUVmean）和CTmean。在任务2中，选择了9个特征，包括孕激素受体（PR）状态作为病理特征。XGBoost模型在训练集、内部测试集和外部测试集的auc分别为0.920、0.814和0.693。根据SHAP值，前三位特征是PR状态、最大肿瘤直径和代谢肿瘤体积（MTV）。结论：结合18F-FDG PET/CT参数和临床病理特征的ML模型有助于预测乳腺癌中不同的HER2表达状态。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Personalized prediction of breast cancer candidates for Anti-HER2 therapy using ¹⁸F-FDG PET/CT parameters and machine learning: a dual-center study.

Background: Accurately evaluating human epidermal growth factor receptor (HER2) expression status in breast cancer enables clinicians to develop individualized treatment plans and improve patient prognosis. The purpose of this study was to assess the performance of a machine learning (ML) model that was developed using ¹⁸F-FDG PET/CT parameters and clinicopathological features in distinguishing different levels of HER2 expression in breast cancer.

Methods: This retrospective study enrolled breast cancer patients who underwent ¹⁸F-FDG PET/CT scans prior to treatment at Lianyungang First People's Hospital (centre 1, n=157) and the Third Affiliated Hospital of Soochow University (centre 2, n=84). Two classification tasks were analysed: distinguishing HER2-zero expression from HER2-low/positive expression (Task 1) and distinguishing HER2-low expression from HER2-positive expression (Task 2). For each task, patients from Centre 1 were randomly divided into training and internal test sets at a 7:3 ratio, whereas patients from Centre 2 served as an external test set. The prediction models included logistic regression (LR), support vector machine (SVM), extreme gradient boosting (XGBoost) and multilayer perceptron (MLP), and SHAP analysis provided model interpretability. Model performance was evaluated via the area under the receiver operating characteristic curve (AUC), accuracy, sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV).

Results: XGBoost models exhibited the best predictive performance in both tasks. For Task 1, recursive feature elimination (RFE) was used to select 8 features, excluding pathological features, and the XGBoost model achieved AUCs of 0.888, 0.844 and 0.759 for the training, internal and external testing sets, respectively. The top three features according to the SHAP values were the tumour minimum diameter, mean standardized uptake value (SUVmean) and CTmean. For Task 2, 9 features were selected, including progesterone receptor (PR) status as a pathological feature. The XGBoost model achieved AUCs of 0.920, 0.814 and 0.693 for the training, internal and external testing sets, respectively. The top three features according to the SHAP values were the PR status, maximum tumour diameter and metabolic tumour volume (MTV).

Conclusions: ML models that incorporate ¹⁸F-FDG PET/CT parameters and clinicopathological features can aid in the prediction of different HER2 expression statuses in breast cancer.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Frontiers in Oncology Biochemistry, Genetics and Molecular Biology-Cancer Research

CiteScore

6.20

自引率

10.60%

发文量

6641

审稿时长

14 weeks

期刊介绍： Cancer Imaging and Diagnosis is dedicated to the publication of results from clinical and research studies applied to cancer diagnosis and treatment. The section aims to publish studies from the entire field of cancer imaging: results from routine use of clinical imaging in both radiology and nuclear medicine, results from clinical trials, experimental molecular imaging in humans and small animals, research on new contrast agents in CT, MRI, ultrasound, publication of new technical applications and processing algorithms to improve the standardization of quantitative imaging and image guided interventions for the diagnosis and treatment of cancer.