Predicción de la sobrevida global y libre de recaída en pacientes con cáncer de mama triple negativo a través de agrupaciones basadas en el aprendizaje automático sobre datos clínicos
{"title":"Predicción de la sobrevida global y libre de recaída en pacientes con cáncer de mama triple negativo a través de agrupaciones basadas en el aprendizaje automático sobre datos clínicos","authors":"Juan Pablo Alzate-Granados, Luis Fernando Niño","doi":"10.1016/j.senol.2025.100686","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><div>Triple-negative breast cancer (TNBC) accounts for 15%–20% of all breast cancers and is characterised by the absence of hormone and HER2 receptors, limiting therapeutic options to cytotoxic chemotherapy. This subtype is highly aggressive, with frequent relapses and a poor prognosis. The integration of clinical and molecular data using machine learning algorithms offers an opportunity to improve the prediction of clinical outcomes and personalise the management of this disease.</div></div><div><h3>Objective</h3><div>To develop predictive models to estimate overall survival (OS) and relapse-free survival (RFS) in patients with TNBC.</div></div><div><h3>Methods</h3><div>A retrospective cohort study was conducted with 4,808 patients diagnosed between 2012 and 2024. Patients were grouped using the k-prototypes algorithm for mixed data. Demographic and clinical variables, biomarkers, and clinical outcomes were analysed. Multivariate analyses and Cox proportional hazards models were used to assess the association between clusters and OS and RFS outcomes.</div></div><div><h3>Results</h3><div>Four clusters of patients were identified. The highest risk group (cluster 3) had the highest mortality (42.3%; HR: 1.94; 95% CI: 1.63–2.30) and relapse (54.25%; HR: 1.68; 95% CI: 1.45–1.95), whereas cluster 0 had the best outcomes (mortality of 22.51%). Factors such as mutations in PIK3CA (HR: 1.535; <em>p</em> <!-->=<!--> <!-->0.001) and TP53 (HR: 1.180; <em>p</em> <!-->=<!--> <!-->0.023) were associated with adverse outcomes.</div></div><div><h3>Discussion</h3><div>The results support the utility of cluster-based stratification to predict outcomes and guide personalised interventions in TNBC.</div></div>","PeriodicalId":38058,"journal":{"name":"Revista de Senologia y Patologia Mamaria","volume":"38 4","pages":"Article 100686"},"PeriodicalIF":0.2000,"publicationDate":"2025-04-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Revista de Senologia y Patologia Mamaria","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0214158225000222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"OBSTETRICS & GYNECOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction
Triple-negative breast cancer (TNBC) accounts for 15%–20% of all breast cancers and is characterised by the absence of hormone and HER2 receptors, limiting therapeutic options to cytotoxic chemotherapy. This subtype is highly aggressive, with frequent relapses and a poor prognosis. The integration of clinical and molecular data using machine learning algorithms offers an opportunity to improve the prediction of clinical outcomes and personalise the management of this disease.
Objective
To develop predictive models to estimate overall survival (OS) and relapse-free survival (RFS) in patients with TNBC.
Methods
A retrospective cohort study was conducted with 4,808 patients diagnosed between 2012 and 2024. Patients were grouped using the k-prototypes algorithm for mixed data. Demographic and clinical variables, biomarkers, and clinical outcomes were analysed. Multivariate analyses and Cox proportional hazards models were used to assess the association between clusters and OS and RFS outcomes.
Results
Four clusters of patients were identified. The highest risk group (cluster 3) had the highest mortality (42.3%; HR: 1.94; 95% CI: 1.63–2.30) and relapse (54.25%; HR: 1.68; 95% CI: 1.45–1.95), whereas cluster 0 had the best outcomes (mortality of 22.51%). Factors such as mutations in PIK3CA (HR: 1.535; p = 0.001) and TP53 (HR: 1.180; p = 0.023) were associated with adverse outcomes.
Discussion
The results support the utility of cluster-based stratification to predict outcomes and guide personalised interventions in TNBC.