{"title":"Development and validation of machine learning models for predicting HER2-zero and HER2-low breast cancers.","authors":"Xu Huang, Lei Wu, Yu Liu, Zeyan Xu, Chunling Liu, Zaiyi Liu, Changhong Liang","doi":"10.1093/bjr/tqae124","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>To develop and validate machine learning models for human epidermal growth factor receptor 2 (HER2)-zero and HER2-low using MRI features pre-neoadjuvant therapy (NAT).</p><p><strong>Methods: </strong>Five hundred and sixteen breast cancer patients post-NAT surgery were randomly divided into training (n = 362) and internal validation sets (n = 154) for model building and evaluation. MRI features (tumour diameter, enhancement type, background parenchymal enhancement, enhancement pattern, percentage of enhancement, signal enhancement ratio, breast oedema, and apparent diffusion coefficient) were reviewed. Logistic regression (LR), support vector machine (SVM), k-nearest neighbour (KNN), and extreme gradient boosting (XGBoost) models utilized MRI characteristics for HER2 status assessment in training and validation datasets. The best-performing model generated a HER2 score, which was subsequently correlated with pathological complete response (pCR) and disease-free survival (DFS).</p><p><strong>Results: </strong>The XGBoost model outperformed LR, SVM, and KNN, achieving an area under the receiver operating characteristic curve (AUC) of 0.783 (95% CI, 0.733-0.833) and 0.787 (95% CI, 0.709-0.865) in the validation dataset. Its HER2 score for predicting pCR had an AUC of 0.708 in the training datasets and 0.695 in the validation dataset. Additionally, the low HER2 score was significantly associated with shorter DFS in the validation dataset (hazard ratio: 2.748, 95% CI, 1.016-7.432, P = .037).</p><p><strong>Conclusions: </strong>The XGBoost model could help distinguish HER2-zero and HER2-low breast cancers and has the potential to predict pCR and prognosis in breast cancer patients undergoing NAT.</p><p><strong>Advances in knowledge: </strong>HER2-low-expressing breast cancer can benefit from the HER2-targeted therapy. Prediction of HER2-low expression is crucial for appropriate management. MRI features offer a solution to this clinical issue.</p>","PeriodicalId":9306,"journal":{"name":"British Journal of Radiology","volume":" ","pages":"1568-1576"},"PeriodicalIF":1.8000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11332671/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"British Journal of Radiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/bjr/tqae124","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: To develop and validate machine learning models for human epidermal growth factor receptor 2 (HER2)-zero and HER2-low using MRI features pre-neoadjuvant therapy (NAT).
Methods: Five hundred and sixteen breast cancer patients post-NAT surgery were randomly divided into training (n = 362) and internal validation sets (n = 154) for model building and evaluation. MRI features (tumour diameter, enhancement type, background parenchymal enhancement, enhancement pattern, percentage of enhancement, signal enhancement ratio, breast oedema, and apparent diffusion coefficient) were reviewed. Logistic regression (LR), support vector machine (SVM), k-nearest neighbour (KNN), and extreme gradient boosting (XGBoost) models utilized MRI characteristics for HER2 status assessment in training and validation datasets. The best-performing model generated a HER2 score, which was subsequently correlated with pathological complete response (pCR) and disease-free survival (DFS).
Results: The XGBoost model outperformed LR, SVM, and KNN, achieving an area under the receiver operating characteristic curve (AUC) of 0.783 (95% CI, 0.733-0.833) and 0.787 (95% CI, 0.709-0.865) in the validation dataset. Its HER2 score for predicting pCR had an AUC of 0.708 in the training datasets and 0.695 in the validation dataset. Additionally, the low HER2 score was significantly associated with shorter DFS in the validation dataset (hazard ratio: 2.748, 95% CI, 1.016-7.432, P = .037).
Conclusions: The XGBoost model could help distinguish HER2-zero and HER2-low breast cancers and has the potential to predict pCR and prognosis in breast cancer patients undergoing NAT.
Advances in knowledge: HER2-low-expressing breast cancer can benefit from the HER2-targeted therapy. Prediction of HER2-low expression is crucial for appropriate management. MRI features offer a solution to this clinical issue.
期刊介绍:
BJR is the international research journal of the British Institute of Radiology and is the oldest scientific journal in the field of radiology and related sciences.
Dating back to 1896, BJR’s history is radiology’s history, and the journal has featured some landmark papers such as the first description of Computed Tomography "Computerized transverse axial tomography" by Godfrey Hounsfield in 1973. A valuable historical resource, the complete BJR archive has been digitized from 1896.
Quick Facts:
- 2015 Impact Factor – 1.840
- Receipt to first decision – average of 6 weeks
- Acceptance to online publication – average of 3 weeks
- ISSN: 0007-1285
- eISSN: 1748-880X
Open Access option