İsmail Toygar, Su Özgür, Gülcan Bağçivan, Ezgi Karaçam, Hilal Benzer, Ferda Akyüz Özdemir, Halise Taşkın Duman, Özlem Ovayolu
{"title":"A machine learning approach to predict self-efficacy in breast cancer survivors.","authors":"İsmail Toygar, Su Özgür, Gülcan Bağçivan, Ezgi Karaçam, Hilal Benzer, Ferda Akyüz Özdemir, Halise Taşkın Duman, Özlem Ovayolu","doi":"10.1186/s12911-025-03155-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>To determine predictors of self-efficacy in breast cancer survivors and identify vulnerable groups.</p><p><strong>Methods: </strong>This descriptive study was conducted between November 2023 and April 2024 at three hospitals in Türkiye and involved 430 breast cancer survivors. Data were collected through face-to-face surveys using a patient identification form and the Breast Cancer Survivor Self-Efficacy Scale. This study identified patient characteristics that indicate a tendency towards higher self-efficacy using four machine learning models; Logistic Regression (LR), Random Forest (RF), Support Vector Machine (SVM), XGBoost (XGB).</p><p><strong>Results: </strong>The mean age of participants was 50.7 ± 11.5 years. Majority of the participants (n = 425) were female. AUC values were used as ranker for the machine learning models. The ranks of the models were as follows; logistic regression model (0.715), RF (0.710), SVM (0.704), and XGBoost (0.694). Education level ranked first in the LR (0.3874), RF (0.3290), and SVM (0.1250) models, and was the second most important variable in the XGB (0.2327) model. Conversely, the cancer stage stood out in the LR (0.2466) and RF (0.1935) models, ranking third and fourth, respectively, while it ranked third in SVM (0.0683) and fourth in XGB (0.1872). Additionally, comorbidity ranked third in importance in the LR (0.2213) and RF (0.1681) models, but second in SVM (0.0705) and seventh in XGB (0.1393).</p><p><strong>Conclusion: </strong>The study demonstrated that the self-efficacy of breast cancer survivors was associated with their sociodemographic and medical characteristics. These characteristics may assist healthcare professionals in enhancing the care provided to breast cancer survivors. It is of the utmost importance to consider the aforementioned patient group as being vulnerable with regard to breast cancer survivor self-efficacy. There is a clear need for a focus on this vulnerable cohort.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"313"},"PeriodicalIF":3.8000,"publicationDate":"2025-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12366306/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03155-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: To determine predictors of self-efficacy in breast cancer survivors and identify vulnerable groups.
Methods: This descriptive study was conducted between November 2023 and April 2024 at three hospitals in Türkiye and involved 430 breast cancer survivors. Data were collected through face-to-face surveys using a patient identification form and the Breast Cancer Survivor Self-Efficacy Scale. This study identified patient characteristics that indicate a tendency towards higher self-efficacy using four machine learning models; Logistic Regression (LR), Random Forest (RF), Support Vector Machine (SVM), XGBoost (XGB).
Results: The mean age of participants was 50.7 ± 11.5 years. Majority of the participants (n = 425) were female. AUC values were used as ranker for the machine learning models. The ranks of the models were as follows; logistic regression model (0.715), RF (0.710), SVM (0.704), and XGBoost (0.694). Education level ranked first in the LR (0.3874), RF (0.3290), and SVM (0.1250) models, and was the second most important variable in the XGB (0.2327) model. Conversely, the cancer stage stood out in the LR (0.2466) and RF (0.1935) models, ranking third and fourth, respectively, while it ranked third in SVM (0.0683) and fourth in XGB (0.1872). Additionally, comorbidity ranked third in importance in the LR (0.2213) and RF (0.1681) models, but second in SVM (0.0705) and seventh in XGB (0.1393).
Conclusion: The study demonstrated that the self-efficacy of breast cancer survivors was associated with their sociodemographic and medical characteristics. These characteristics may assist healthcare professionals in enhancing the care provided to breast cancer survivors. It is of the utmost importance to consider the aforementioned patient group as being vulnerable with regard to breast cancer survivor self-efficacy. There is a clear need for a focus on this vulnerable cohort.
期刊介绍:
BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.