Machine learning prediction of HER2-low expression in breast cancers based on hematoxylin-eosin-stained slides.

IF 7.4 1区 医学 Q1 Medicine
Jun Du, Jun Shi, Dongdong Sun, Yifei Wang, Guanfeng Liu, Jingru Chen, Wei Wang, Wenchao Zhou, Yushan Zheng, Haibo Wu
{"title":"Machine learning prediction of HER2-low expression in breast cancers based on hematoxylin-eosin-stained slides.","authors":"Jun Du, Jun Shi, Dongdong Sun, Yifei Wang, Guanfeng Liu, Jingru Chen, Wei Wang, Wenchao Zhou, Yushan Zheng, Haibo Wu","doi":"10.1186/s13058-025-01998-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Treatment with HER2-targeted therapies is recommended for HER2-positive breast cancer patients with HER2 gene amplification or protein overexpression. Interestingly, recent clinical trials of novel HER2-targeted therapies demonstrated promising efficacy in HER2-low breast cancers, raising the prospect of including a HER2-low category (immunohistochemistry, IHC) score of 1 + or 2 + with non-amplified in-situ hybridization for HER2-targeted treatments, which necessitated the accurate detection and evaluation of HER2 expression in tumors. Traditionally, HER2 protein levels are routinely assessed by IHC in clinical practice, which not only requires significant time consumption and financial investment but is also technically challenging for many basic hospitals in developing countries. Therefore, directly predicting HER2 expression by hematoxylin-eosin (HE) staining should be of significant clinical values, and machine learning may be a potent technology to achieve this goal.</p><p><strong>Methods: </strong>In this study, we developed an artificial intelligence (AI) classification model using whole slide image of HE-stained slides to automatically assess HER2 status.</p><p><strong>Results: </strong>A publicly available TCGA-BRCA dataset and an in-house USTC-BC dataset were applied to evaluate our AI model and the state-of-the-art method SlideGraph + in terms of accuracy (ACC), the area under the receiver operating characteristic curve (AUC), and F1 score. Overall, our AI model achieved the superior performance in HER2 scoring in both datasets with AUC of 0.795 ± 0.028 and 0.688 ± 0.008 on the USCT-BC and TCGA-BRCA datasets, respectively. In addition, we visualized the results generated from our AI model by attention heatmaps, which proved that our AI model had strong interpretability.</p><p><strong>Conclusion: </strong>Our AI model is able to directly predict HER2 expression through HE images with strong interpretability, and has a better ACC particularly in HER2-low breast cancers, which provides a method for AI evaluation of HER2 status and helps to perform HER2 evaluation economically and efficiently. It has the potential to assist pathologists to improve diagnosis and assess biomarkers for companion diagnostics.</p>","PeriodicalId":49227,"journal":{"name":"Breast Cancer Research","volume":"27 1","pages":"57"},"PeriodicalIF":7.4000,"publicationDate":"2025-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12008878/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Breast Cancer Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s13058-025-01998-8","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Treatment with HER2-targeted therapies is recommended for HER2-positive breast cancer patients with HER2 gene amplification or protein overexpression. Interestingly, recent clinical trials of novel HER2-targeted therapies demonstrated promising efficacy in HER2-low breast cancers, raising the prospect of including a HER2-low category (immunohistochemistry, IHC) score of 1 + or 2 + with non-amplified in-situ hybridization for HER2-targeted treatments, which necessitated the accurate detection and evaluation of HER2 expression in tumors. Traditionally, HER2 protein levels are routinely assessed by IHC in clinical practice, which not only requires significant time consumption and financial investment but is also technically challenging for many basic hospitals in developing countries. Therefore, directly predicting HER2 expression by hematoxylin-eosin (HE) staining should be of significant clinical values, and machine learning may be a potent technology to achieve this goal.

Methods: In this study, we developed an artificial intelligence (AI) classification model using whole slide image of HE-stained slides to automatically assess HER2 status.

Results: A publicly available TCGA-BRCA dataset and an in-house USTC-BC dataset were applied to evaluate our AI model and the state-of-the-art method SlideGraph + in terms of accuracy (ACC), the area under the receiver operating characteristic curve (AUC), and F1 score. Overall, our AI model achieved the superior performance in HER2 scoring in both datasets with AUC of 0.795 ± 0.028 and 0.688 ± 0.008 on the USCT-BC and TCGA-BRCA datasets, respectively. In addition, we visualized the results generated from our AI model by attention heatmaps, which proved that our AI model had strong interpretability.

Conclusion: Our AI model is able to directly predict HER2 expression through HE images with strong interpretability, and has a better ACC particularly in HER2-low breast cancers, which provides a method for AI evaluation of HER2 status and helps to perform HER2 evaluation economically and efficiently. It has the potential to assist pathologists to improve diagnosis and assess biomarkers for companion diagnostics.

基于苏木精-伊红染色切片的机器学习预测乳腺癌中her2低表达。
背景:对于HER2基因扩增或蛋白过表达的HER2阳性乳腺癌患者,推荐采用HER2靶向治疗。有趣的是,最近新型HER2靶向治疗的临床试验显示,在低HER2乳腺癌中有很好的疗效,这增加了HER2靶向治疗的前景,包括HER2低类别(免疫组织化学,IHC)评分为1 +或2 +,非扩增原位杂交,这需要准确检测和评估肿瘤中HER2的表达。传统上,在临床实践中,HER2蛋白水平是通过免疫健康法常规评估的,这不仅需要大量的时间和资金投入,而且对发展中国家的许多基础医院在技术上也具有挑战性。因此,通过苏木精-伊红(HE)染色直接预测HER2的表达应该具有重要的临床价值,而机器学习可能是实现这一目标的有力技术。方法:在本研究中,我们建立了一个人工智能(AI)分类模型,利用he染色的全切片图像自动评估HER2状态。结果:使用公开的TCGA-BRCA数据集和内部的USTC-BC数据集来评估我们的人工智能模型和最先进的方法SlideGraph +在准确性(ACC)、接收者工作特征曲线下面积(AUC)和F1分数方面的效果。总体而言,我们的人工智能模型在两个数据集上的HER2评分都取得了优异的表现,在USCT-BC和TCGA-BRCA数据集上的AUC分别为0.795±0.028和0.688±0.008。此外,我们通过注意力热图将我们的AI模型生成的结果可视化,证明了我们的AI模型具有很强的可解释性。结论:我们的AI模型能够通过HE图像直接预测HER2的表达,具有较强的可解释性,特别是在HER2低的乳腺癌中具有较好的ACC,为HER2状态的AI评估提供了一种方法,有助于经济高效地进行HER2评估。它有可能帮助病理学家提高诊断和评估伴随诊断的生物标志物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
12.00
自引率
0.00%
发文量
76
审稿时长
12 weeks
期刊介绍: Breast Cancer Research, an international, peer-reviewed online journal, publishes original research, reviews, editorials, and reports. It features open-access research articles of exceptional interest across all areas of biology and medicine relevant to breast cancer. This includes normal mammary gland biology, with a special emphasis on the genetic, biochemical, and cellular basis of breast cancer. In addition to basic research, the journal covers preclinical, translational, and clinical studies with a biological basis, including Phase I and Phase II trials.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信