Human epidermal growth factor receptor 2 (HER2) expression dynamics between diagnosis and recurrence in patients with breast cancer using artificial intelligence and electronic health records: the RosHER study
E. López-Miranda , P. Tolosa-Ortega , M.A. Perelló-Martorell , L. Sánchez-Lorenzo , C. Hinojo-González , S. Servitja , S. Recalde-Penabad , C. Olier-Gárate , J.A. Guerrero , S. García-Vicente , L. Mina , D. Alcalá-López , L. López-Montero , C. Jiménez-Cortegana , M. Sampayo-Cordero , G. Antonarelli , J.M. Pérez-García , J. Cortés , A. Llombart-Cussac
{"title":"Human epidermal growth factor receptor 2 (HER2) expression dynamics between diagnosis and recurrence in patients with breast cancer using artificial intelligence and electronic health records: the RosHER study","authors":"E. López-Miranda , P. Tolosa-Ortega , M.A. Perelló-Martorell , L. Sánchez-Lorenzo , C. Hinojo-González , S. Servitja , S. Recalde-Penabad , C. Olier-Gárate , J.A. Guerrero , S. García-Vicente , L. Mina , D. Alcalá-López , L. López-Montero , C. Jiménez-Cortegana , M. Sampayo-Cordero , G. Antonarelli , J.M. Pérez-García , J. Cortés , A. Llombart-Cussac","doi":"10.1016/j.esmorw.2024.100107","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Human epidermal growth factor receptor 2 (HER2) is a treatment target in breast cancer (BC), driving therapeutic strategies. Changes over time in HER2 expression have been described and understanding of these fluctuations is crucial for personalized medicine. We aimed to assess HER2 expression dynamics using real-world data and natural language processing (NLP) from electronic health records (EHRs).</div></div><div><h3>Material and methods</h3><div>RosHER is a retrospective, observational, longitudinal, population-based, multicenter study (NCT05217381). An NLP tool extracted HER2 information from EHRs of adult patients with early, locally advanced, or <em>de novo</em> metastatic BC, who were initially diagnosed between 2005 and 2021. The primary endpoint was to evaluate HER2 dynamics in HER2 status and expression between initial diagnosis and recurrence or progression using NLP. The secondary endpoints were description of baseline clinicopathological characteristics and treatment patterns.</div></div><div><h3>Results</h3><div>Between January 2022 and November 2023, 18 533 patients were selected from seven Spanish sites. A cut-off of ≥6 months was established between initial determination and relapse or progression. The artificial intelligence (AI)-based tool identified 510 patients with two documented HER2 determinations and 209 with HER2 expression by immunohistochemistry/<em>in situ</em> hybridization. Overall discordances were 10.6% in HER2 status and 34.0% in HER2 expression. HER2-zero expression switched to HER2-low (23.2%), but not HER2-positive (0%); HER2-low expression converted to HER2-zero (32.0%) and HER2-positive (7.0%); finally HER2-positive expression switched to HER2-low (20.8%) and HER2-zero (15.1%).</div></div><div><h3>Conclusions</h3><div>This is the first study using NLP to evaluate HER2 discordances, which need to be further investigated. Improving AI methods and implementing similar EHR structures among hospitals would increase the success in clinical data extraction.</div></div>","PeriodicalId":100491,"journal":{"name":"ESMO Real World Data and Digital Oncology","volume":"7 ","pages":"Article 100107"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ESMO Real World Data and Digital Oncology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949820124000857","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Human epidermal growth factor receptor 2 (HER2) is a treatment target in breast cancer (BC), driving therapeutic strategies. Changes over time in HER2 expression have been described and understanding of these fluctuations is crucial for personalized medicine. We aimed to assess HER2 expression dynamics using real-world data and natural language processing (NLP) from electronic health records (EHRs).
Material and methods
RosHER is a retrospective, observational, longitudinal, population-based, multicenter study (NCT05217381). An NLP tool extracted HER2 information from EHRs of adult patients with early, locally advanced, or de novo metastatic BC, who were initially diagnosed between 2005 and 2021. The primary endpoint was to evaluate HER2 dynamics in HER2 status and expression between initial diagnosis and recurrence or progression using NLP. The secondary endpoints were description of baseline clinicopathological characteristics and treatment patterns.
Results
Between January 2022 and November 2023, 18 533 patients were selected from seven Spanish sites. A cut-off of ≥6 months was established between initial determination and relapse or progression. The artificial intelligence (AI)-based tool identified 510 patients with two documented HER2 determinations and 209 with HER2 expression by immunohistochemistry/in situ hybridization. Overall discordances were 10.6% in HER2 status and 34.0% in HER2 expression. HER2-zero expression switched to HER2-low (23.2%), but not HER2-positive (0%); HER2-low expression converted to HER2-zero (32.0%) and HER2-positive (7.0%); finally HER2-positive expression switched to HER2-low (20.8%) and HER2-zero (15.1%).
Conclusions
This is the first study using NLP to evaluate HER2 discordances, which need to be further investigated. Improving AI methods and implementing similar EHR structures among hospitals would increase the success in clinical data extraction.