{"title":"[基于机器学习算法的乳腺癌化疗致心脏毒性风险预测模型构建]。","authors":"X N Yue, C Yan, X Y Liu","doi":"10.3760/cma.j.cn112148-20250112-00035","DOIUrl":null,"url":null,"abstract":"<p><p><b>Objective:</b> To explore the application value of machine learning algorithms in constructing a predictive model for cardiovascular toxicity in breast cancer patients receiving anthracycline-based chemotherapy. <b>Methods:</b> This study was a retrospective cohort study. The female patients with breast cancer who received anthracyclines in the Affiliated Cancer Hospital of Xinjiang Medical University from January 2020 to December 2023 were enrolled. The endpoint event was abnormal electrocardiogram (ECG). According to whether the patients had ECG abnormalities during chemotherapy, they were divided into the ECG abnormal group and the ECG normal group. The dataset was divided into the training set and the test set at a ratio of 8∶2, and logistic regression, random forest, extreme gradient boosting (XGBoost), support vector machine (SVM) and multilayer perceptron (MLP) were used to construct a risk prediction model for cardiovascular toxicity in breast cancer patients, and the receiver operating characteristic curve, calibration curve and clinical decision curve were used to evaluate the model. <b>Results:</b> A total of 731 female patients with breast cancer, aged (51.6±9.4) years, were enrolled. The follow-up time was (130.3±37.1) days. There were 333 cases in the ECG abnormal group and 398 cases in the ECG normal group. Seven factors influencing cardiovascular toxicity were identified, including age, menstrual history, diabetes, combination therapy with trastuzumab, combination therapy with dexrazoxane, creatine kinase isoenzymes, and α-hydroxybutyrate dehydrogenase. In the training set, the area under the curve (<i>AUC</i>) for the logistic regression, random forest, XGBoost, SVM, and MLP models was 0.712, 0.863, 0.774, 0.813, and 0.733, respectively. In the test set, the <i>AUC</i> was 0.671, 0.778, 0.746, 0.771, and 0.705, respectively. Calibration curves and clinical decision curves showed that the random forest model performed the best. <b>Conclusion:</b> Models constructed with machine learning algorithms show promise in predicting cardiovascular toxicity in breast cancer patients receiving anthracycline-based chemotherapy, with the random forest prediction model performing the best.</p>","PeriodicalId":38755,"journal":{"name":"中华心血管病杂志","volume":"53 8","pages":"898-905"},"PeriodicalIF":0.0000,"publicationDate":"2025-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"[Construction of a risk prediction model for chemotherapy-induced cardio-toxicity in breast cancer patients based on machine learning algorithm].\",\"authors\":\"X N Yue, C Yan, X Y Liu\",\"doi\":\"10.3760/cma.j.cn112148-20250112-00035\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p><b>Objective:</b> To explore the application value of machine learning algorithms in constructing a predictive model for cardiovascular toxicity in breast cancer patients receiving anthracycline-based chemotherapy. <b>Methods:</b> This study was a retrospective cohort study. The female patients with breast cancer who received anthracyclines in the Affiliated Cancer Hospital of Xinjiang Medical University from January 2020 to December 2023 were enrolled. The endpoint event was abnormal electrocardiogram (ECG). According to whether the patients had ECG abnormalities during chemotherapy, they were divided into the ECG abnormal group and the ECG normal group. The dataset was divided into the training set and the test set at a ratio of 8∶2, and logistic regression, random forest, extreme gradient boosting (XGBoost), support vector machine (SVM) and multilayer perceptron (MLP) were used to construct a risk prediction model for cardiovascular toxicity in breast cancer patients, and the receiver operating characteristic curve, calibration curve and clinical decision curve were used to evaluate the model. <b>Results:</b> A total of 731 female patients with breast cancer, aged (51.6±9.4) years, were enrolled. The follow-up time was (130.3±37.1) days. There were 333 cases in the ECG abnormal group and 398 cases in the ECG normal group. Seven factors influencing cardiovascular toxicity were identified, including age, menstrual history, diabetes, combination therapy with trastuzumab, combination therapy with dexrazoxane, creatine kinase isoenzymes, and α-hydroxybutyrate dehydrogenase. In the training set, the area under the curve (<i>AUC</i>) for the logistic regression, random forest, XGBoost, SVM, and MLP models was 0.712, 0.863, 0.774, 0.813, and 0.733, respectively. In the test set, the <i>AUC</i> was 0.671, 0.778, 0.746, 0.771, and 0.705, respectively. Calibration curves and clinical decision curves showed that the random forest model performed the best. <b>Conclusion:</b> Models constructed with machine learning algorithms show promise in predicting cardiovascular toxicity in breast cancer patients receiving anthracycline-based chemotherapy, with the random forest prediction model performing the best.</p>\",\"PeriodicalId\":38755,\"journal\":{\"name\":\"中华心血管病杂志\",\"volume\":\"53 8\",\"pages\":\"898-905\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-08-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"中华心血管病杂志\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3760/cma.j.cn112148-20250112-00035\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"中华心血管病杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3760/cma.j.cn112148-20250112-00035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
[Construction of a risk prediction model for chemotherapy-induced cardio-toxicity in breast cancer patients based on machine learning algorithm].
Objective: To explore the application value of machine learning algorithms in constructing a predictive model for cardiovascular toxicity in breast cancer patients receiving anthracycline-based chemotherapy. Methods: This study was a retrospective cohort study. The female patients with breast cancer who received anthracyclines in the Affiliated Cancer Hospital of Xinjiang Medical University from January 2020 to December 2023 were enrolled. The endpoint event was abnormal electrocardiogram (ECG). According to whether the patients had ECG abnormalities during chemotherapy, they were divided into the ECG abnormal group and the ECG normal group. The dataset was divided into the training set and the test set at a ratio of 8∶2, and logistic regression, random forest, extreme gradient boosting (XGBoost), support vector machine (SVM) and multilayer perceptron (MLP) were used to construct a risk prediction model for cardiovascular toxicity in breast cancer patients, and the receiver operating characteristic curve, calibration curve and clinical decision curve were used to evaluate the model. Results: A total of 731 female patients with breast cancer, aged (51.6±9.4) years, were enrolled. The follow-up time was (130.3±37.1) days. There were 333 cases in the ECG abnormal group and 398 cases in the ECG normal group. Seven factors influencing cardiovascular toxicity were identified, including age, menstrual history, diabetes, combination therapy with trastuzumab, combination therapy with dexrazoxane, creatine kinase isoenzymes, and α-hydroxybutyrate dehydrogenase. In the training set, the area under the curve (AUC) for the logistic regression, random forest, XGBoost, SVM, and MLP models was 0.712, 0.863, 0.774, 0.813, and 0.733, respectively. In the test set, the AUC was 0.671, 0.778, 0.746, 0.771, and 0.705, respectively. Calibration curves and clinical decision curves showed that the random forest model performed the best. Conclusion: Models constructed with machine learning algorithms show promise in predicting cardiovascular toxicity in breast cancer patients receiving anthracycline-based chemotherapy, with the random forest prediction model performing the best.
中华心血管病杂志Medicine-Cardiology and Cardiovascular Medicine
CiteScore
1.40
自引率
0.00%
发文量
10577
期刊介绍:
The Chinese Journal of Cardiology , established in February 1973, is one of the major academic medical journals sponsored by the Chinese Medical Association and a leading periodical in the field of cardiology in China. It specializes in cardiology and related disciplines with a readership of more than 25 000. The journal publishes editorials and guidelines as well as important original articles on clinical and experimental investigations, reflecting achievements made in China and promoting academic communication between domestic and foreign cardiologists. The journal includes the following columns: Editorials, Strategies, Comments, Clinical Investigations, Experimental Investigations, Epidemiology and Prevention, Lectures, Comprehensive Reviews, Continuing Medical Education, etc.