Daniella Castro Araújo, Ricardo Simões, Adriano de Paula Sabino, Angélica Navarro de Oliveira, Camila Maciel de Oliveira, Adriano Alonso Veloso, Karina Braga Gomes
{"title":"预测乳腺癌中阿霉素引起的心脏毒性:利用机器学习与合成数据。","authors":"Daniella Castro Araújo, Ricardo Simões, Adriano de Paula Sabino, Angélica Navarro de Oliveira, Camila Maciel de Oliveira, Adriano Alonso Veloso, Karina Braga Gomes","doi":"10.1007/s11517-025-03289-y","DOIUrl":null,"url":null,"abstract":"<p><p>Doxorubicin (DOXO) is a primary treatment for breast cancer but can cause cardiotoxicity in over 25% of patients within the first year post-chemotherapy. Recognizing at-risk patients before DOXO initiation offers pathways for alternative treatments or early protective actions. We analyzed data from 78 Brazilian breast cancer patients, with 34.6% developing cardiotoxicity within a year of their final DOXO dose. To address the limited sample size, we utilized the DAS (Data Augmentation and Smoothing) method, creating 4892 synthetic samples that exhibited high statistics fidelity to the original data. By integrating routine blood biomarkers (C-Reactive protein, total cholesterol, LDL-c, HDL-c, hematocrit, and hemoglobin) and two clinical measures (weighted smoking status and body mass index), our model achieved an AUROC of 0.85±0.10, a sensitivity of 0.89, and a specificity of 0.69, positioning it as a potential screening instrument. Notably, DAS outperformed the established methods, Adaptive Synthetic Sampling (ADASYN), Synthetic Minority Over-Sampling Technique (SMOTE), and Synthetic Data Vault (SDV), underscoring its promise for medical synthetic data generation and pioneering a cardiotoxicity prediction model specifically for DOXO.</p>","PeriodicalId":49840,"journal":{"name":"Medical & Biological Engineering & Computing","volume":" ","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Predicting doxorubicin-induced cardiotoxicity in breast cancer: leveraging machine learning with synthetic data.\",\"authors\":\"Daniella Castro Araújo, Ricardo Simões, Adriano de Paula Sabino, Angélica Navarro de Oliveira, Camila Maciel de Oliveira, Adriano Alonso Veloso, Karina Braga Gomes\",\"doi\":\"10.1007/s11517-025-03289-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Doxorubicin (DOXO) is a primary treatment for breast cancer but can cause cardiotoxicity in over 25% of patients within the first year post-chemotherapy. Recognizing at-risk patients before DOXO initiation offers pathways for alternative treatments or early protective actions. We analyzed data from 78 Brazilian breast cancer patients, with 34.6% developing cardiotoxicity within a year of their final DOXO dose. To address the limited sample size, we utilized the DAS (Data Augmentation and Smoothing) method, creating 4892 synthetic samples that exhibited high statistics fidelity to the original data. By integrating routine blood biomarkers (C-Reactive protein, total cholesterol, LDL-c, HDL-c, hematocrit, and hemoglobin) and two clinical measures (weighted smoking status and body mass index), our model achieved an AUROC of 0.85±0.10, a sensitivity of 0.89, and a specificity of 0.69, positioning it as a potential screening instrument. Notably, DAS outperformed the established methods, Adaptive Synthetic Sampling (ADASYN), Synthetic Minority Over-Sampling Technique (SMOTE), and Synthetic Data Vault (SDV), underscoring its promise for medical synthetic data generation and pioneering a cardiotoxicity prediction model specifically for DOXO.</p>\",\"PeriodicalId\":49840,\"journal\":{\"name\":\"Medical & Biological Engineering & Computing\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-01-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical & Biological Engineering & Computing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s11517-025-03289-y\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical & Biological Engineering & Computing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s11517-025-03289-y","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Predicting doxorubicin-induced cardiotoxicity in breast cancer: leveraging machine learning with synthetic data.
Doxorubicin (DOXO) is a primary treatment for breast cancer but can cause cardiotoxicity in over 25% of patients within the first year post-chemotherapy. Recognizing at-risk patients before DOXO initiation offers pathways for alternative treatments or early protective actions. We analyzed data from 78 Brazilian breast cancer patients, with 34.6% developing cardiotoxicity within a year of their final DOXO dose. To address the limited sample size, we utilized the DAS (Data Augmentation and Smoothing) method, creating 4892 synthetic samples that exhibited high statistics fidelity to the original data. By integrating routine blood biomarkers (C-Reactive protein, total cholesterol, LDL-c, HDL-c, hematocrit, and hemoglobin) and two clinical measures (weighted smoking status and body mass index), our model achieved an AUROC of 0.85±0.10, a sensitivity of 0.89, and a specificity of 0.69, positioning it as a potential screening instrument. Notably, DAS outperformed the established methods, Adaptive Synthetic Sampling (ADASYN), Synthetic Minority Over-Sampling Technique (SMOTE), and Synthetic Data Vault (SDV), underscoring its promise for medical synthetic data generation and pioneering a cardiotoxicity prediction model specifically for DOXO.
期刊介绍:
Founded in 1963, Medical & Biological Engineering & Computing (MBEC) continues to serve the biomedical engineering community, covering the entire spectrum of biomedical and clinical engineering. The journal presents exciting and vital experimental and theoretical developments in biomedical science and technology, and reports on advances in computer-based methodologies in these multidisciplinary subjects. The journal also incorporates new and evolving technologies including cellular engineering and molecular imaging.
MBEC publishes original research articles as well as reviews and technical notes. Its Rapid Communications category focuses on material of immediate value to the readership, while the Controversies section provides a forum to exchange views on selected issues, stimulating a vigorous and informed debate in this exciting and high profile field.
MBEC is an official journal of the International Federation of Medical and Biological Engineering (IFMBE).