Development and validation of machine learning models for predicting HER2-zero and HER2-low breast cancers.

IF 1.8 4区医学 Q3 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

British Journal of Radiology Pub Date : 2024-09-01 DOI:10.1093/bjr/tqae124

Xu Huang, Lei Wu, Yu Liu, Zeyan Xu, Chunling Liu, Zaiyi Liu, Changhong Liang

{"title":"Development and validation of machine learning models for predicting HER2-zero and HER2-low breast cancers.","authors":"Xu Huang, Lei Wu, Yu Liu, Zeyan Xu, Chunling Liu, Zaiyi Liu, Changhong Liang","doi":"10.1093/bjr/tqae124","DOIUrl":null,"url":null,"abstract":"Objectives: To develop and validate machine learning models for human epidermal growth factor receptor 2 (HER2)-zero and HER2-low using MRI features pre-neoadjuvant therapy (NAT).Methods: Five hundred and sixteen breast cancer patients post-NAT surgery were randomly divided into training (n = 362) and internal validation sets (n = 154) for model building and evaluation. MRI features (tumour diameter, enhancement type, background parenchymal enhancement, enhancement pattern, percentage of enhancement, signal enhancement ratio, breast oedema, and apparent diffusion coefficient) were reviewed. Logistic regression (LR), support vector machine (SVM), k-nearest neighbour (KNN), and extreme gradient boosting (XGBoost) models utilized MRI characteristics for HER2 status assessment in training and validation datasets. The best-performing model generated a HER2 score, which was subsequently correlated with pathological complete response (pCR) and disease-free survival (DFS).Results: The XGBoost model outperformed LR, SVM, and KNN, achieving an area under the receiver operating characteristic curve (AUC) of 0.783 (95% CI, 0.733-0.833) and 0.787 (95% CI, 0.709-0.865) in the validation dataset. Its HER2 score for predicting pCR had an AUC of 0.708 in the training datasets and 0.695 in the validation dataset. Additionally, the low HER2 score was significantly associated with shorter DFS in the validation dataset (hazard ratio: 2.748, 95% CI, 1.016-7.432, P = .037).Conclusions: The XGBoost model could help distinguish HER2-zero and HER2-low breast cancers and has the potential to predict pCR and prognosis in breast cancer patients undergoing NAT.Advances in knowledge: HER2-low-expressing breast cancer can benefit from the HER2-targeted therapy. Prediction of HER2-low expression is crucial for appropriate management. MRI features offer a solution to this clinical issue.","PeriodicalId":9306,"journal":{"name":"British Journal of Radiology","volume":" ","pages":"1568-1576"},"PeriodicalIF":1.8000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11332671/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of Radiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/bjr/tqae124","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Objectives: To develop and validate machine learning models for human epidermal growth factor receptor 2 (HER2)-zero and HER2-low using MRI features pre-neoadjuvant therapy (NAT).

Methods: Five hundred and sixteen breast cancer patients post-NAT surgery were randomly divided into training (n = 362) and internal validation sets (n = 154) for model building and evaluation. MRI features (tumour diameter, enhancement type, background parenchymal enhancement, enhancement pattern, percentage of enhancement, signal enhancement ratio, breast oedema, and apparent diffusion coefficient) were reviewed. Logistic regression (LR), support vector machine (SVM), k-nearest neighbour (KNN), and extreme gradient boosting (XGBoost) models utilized MRI characteristics for HER2 status assessment in training and validation datasets. The best-performing model generated a HER2 score, which was subsequently correlated with pathological complete response (pCR) and disease-free survival (DFS).

Results: The XGBoost model outperformed LR, SVM, and KNN, achieving an area under the receiver operating characteristic curve (AUC) of 0.783 (95% CI, 0.733-0.833) and 0.787 (95% CI, 0.709-0.865) in the validation dataset. Its HER2 score for predicting pCR had an AUC of 0.708 in the training datasets and 0.695 in the validation dataset. Additionally, the low HER2 score was significantly associated with shorter DFS in the validation dataset (hazard ratio: 2.748, 95% CI, 1.016-7.432, P = .037).

Conclusions: The XGBoost model could help distinguish HER2-zero and HER2-low breast cancers and has the potential to predict pCR and prognosis in breast cancer patients undergoing NAT.

Advances in knowledge: HER2-low-expressing breast cancer can benefit from the HER2-targeted therapy. Prediction of HER2-low expression is crucial for appropriate management. MRI features offer a solution to this clinical issue.

查看原文本刊更多论文

开发和验证用于预测 HER2 为零和 HER2 为低的乳腺癌的机器学习模型。

目的方法：将 516 名接受新辅助治疗（NAT）手术后的乳腺癌患者随机分为训练集（n = 362）和内部验证集（n = 154），以建立和评估模型。回顾核磁共振成像特征（肿瘤直径、增强类型、背景实质增强、增强模式、增强百分比、信号增强比、乳腺水肿和 ADC）。逻辑回归（LR）、支持向量机（SVM）、k-近邻（KNN）和极梯度提升（XGBoost）模型在训练和验证数据集中利用 MRI 特征进行 HER2 状态评估。表现最好的模型生成了 HER2 评分，随后与病理完全反应（pCR）和无病生存期（DFS）相关联：XGBoost模型的表现优于LR、SVM和KNN，其ROC曲线下面积（AUC）分别为0.783（95% CI：0.733-0.833）和0.787（95% CI：0.709-0.865）。其预测 pCR 的 HER2 评分在训练数据集中的 AUC 为 0.708，在验证数据集中的 AUC 为 0.695。此外，在验证数据集中，低 HER2 评分与较短的 DFS 显著相关（HR：2.748,95% CI：1.016-7.432, P = 0.037）：XGBoost模型有助于区分HER2-0和HER2-低乳腺癌，并有可能预测接受NAT治疗的乳腺癌患者的pCR和预后：人表皮生长因子受体2（HER2）低表达乳腺癌可从HER2靶向治疗中获益。预测 HER2 低表达对于适当的治疗至关重要。磁共振成像特征为这一临床问题提供了解决方案。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

British Journal of Radiology 医学-核医学

CiteScore

5.30

自引率

3.80%

发文量

330

审稿时长

2-4 weeks

期刊介绍： BJR is the international research journal of the British Institute of Radiology and is the oldest scientific journal in the field of radiology and related sciences. Dating back to 1896, BJR’s history is radiology’s history, and the journal has featured some landmark papers such as the first description of Computed Tomography "Computerized transverse axial tomography" by Godfrey Hounsfield in 1973. A valuable historical resource, the complete BJR archive has been digitized from 1896. Quick Facts: - 2015 Impact Factor – 1.840 - Receipt to first decision – average of 6 weeks - Acceptance to online publication – average of 3 weeks - ISSN: 0007-1285 - eISSN: 1748-880X Open Access option