Veysel Karani Baris, Yubo Fu, Brad Gilbreath, Jessica Rainbow, Luke A. Fiorini, Pamela Love
{"title":"Harnessing Machine Learning to Predict Nurse Turnover Intention and Uncover Key Predictors: A Multinational Investigation","authors":"Veysel Karani Baris, Yubo Fu, Brad Gilbreath, Jessica Rainbow, Luke A. Fiorini, Pamela Love","doi":"10.1111/jan.70260","DOIUrl":null,"url":null,"abstract":"AimsTo predict nurses' turnover intention using machine learning techniques and identify the most influential psychosocial, organisational and demographic predictors across three countries.DesignA cross‐sectional, multinational survey design.MethodsData were collected from 1625 nurses in the United States, Türkiye and Malta between June and September 2023 via an online survey. Twenty variables were assessed, including job satisfaction, psychological safety, depression, presenteeism, person‐group fit and work engagement. Turnover intention was transformed into a binary variable using unsupervised machine learning (k‐means clustering). Six supervised algorithms—logistic regression, random forest, XGBoost, decision tree, support vector machine and artificial neural networks—were employed. Model performance was evaluated using accuracy, precision, recall, F1 score and Area Under the Curve (AUC). Feature importance was examined using logistic regression (coefficients), XGBoost (gain) and random forest (mean decrease accuracy).ResultsLogistic regression achieved the best predictive performance (accuracy = 0.829, f1 = 0.851, AUC = 0.890) followed closely by support vector machine (polynomial kernel) (accuracy = 0.805, f1 0.830, AUC = 0.864) and random forest (accuracy = 0.791, f1 = 0.820, AUC = 0.859). In the feature importance analysis, job satisfaction consistently emerged as the most influential predictor across all models. Other key predictors identified in the logistic regression model included country (USA), work experience (6–10 years), depression and psychological safety. XGBoost and random forest additionally emphasised the roles of work engagement, group‐level authenticity and person–group fit. Job‐stress‐related presenteeism was uniquely significant in XGBoost, while depression ranked among the top predictors in both logistic regression and random forest models.ConclusionMachine learning can effectively predict turnover intention using multidimensional predictors. This methodology can support data‐driven decision‐making in clinical retention strategies.ImpactThis study provides a data‐driven framework to identify nurses at risk of turnover. By integrating machine learning into workforce planning, healthcare leaders can develop targeted, evidence‐based strategies to enhance retention and improve organisational stability.Reporting MethodThis study adhered to STROBE reporting guideline.Patient and Public ContributionThis study did not include patient or public involvement in its design, conduct or reporting.","PeriodicalId":54897,"journal":{"name":"Journal of Advanced Nursing","volume":"9 1","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advanced Nursing","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/jan.70260","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NURSING","Score":null,"Total":0}
引用次数: 0
Abstract
AimsTo predict nurses' turnover intention using machine learning techniques and identify the most influential psychosocial, organisational and demographic predictors across three countries.DesignA cross‐sectional, multinational survey design.MethodsData were collected from 1625 nurses in the United States, Türkiye and Malta between June and September 2023 via an online survey. Twenty variables were assessed, including job satisfaction, psychological safety, depression, presenteeism, person‐group fit and work engagement. Turnover intention was transformed into a binary variable using unsupervised machine learning (k‐means clustering). Six supervised algorithms—logistic regression, random forest, XGBoost, decision tree, support vector machine and artificial neural networks—were employed. Model performance was evaluated using accuracy, precision, recall, F1 score and Area Under the Curve (AUC). Feature importance was examined using logistic regression (coefficients), XGBoost (gain) and random forest (mean decrease accuracy).ResultsLogistic regression achieved the best predictive performance (accuracy = 0.829, f1 = 0.851, AUC = 0.890) followed closely by support vector machine (polynomial kernel) (accuracy = 0.805, f1 0.830, AUC = 0.864) and random forest (accuracy = 0.791, f1 = 0.820, AUC = 0.859). In the feature importance analysis, job satisfaction consistently emerged as the most influential predictor across all models. Other key predictors identified in the logistic regression model included country (USA), work experience (6–10 years), depression and psychological safety. XGBoost and random forest additionally emphasised the roles of work engagement, group‐level authenticity and person–group fit. Job‐stress‐related presenteeism was uniquely significant in XGBoost, while depression ranked among the top predictors in both logistic regression and random forest models.ConclusionMachine learning can effectively predict turnover intention using multidimensional predictors. This methodology can support data‐driven decision‐making in clinical retention strategies.ImpactThis study provides a data‐driven framework to identify nurses at risk of turnover. By integrating machine learning into workforce planning, healthcare leaders can develop targeted, evidence‐based strategies to enhance retention and improve organisational stability.Reporting MethodThis study adhered to STROBE reporting guideline.Patient and Public ContributionThis study did not include patient or public involvement in its design, conduct or reporting.
期刊介绍:
The Journal of Advanced Nursing (JAN) contributes to the advancement of evidence-based nursing, midwifery and healthcare by disseminating high quality research and scholarship of contemporary relevance and with potential to advance knowledge for practice, education, management or policy.
All JAN papers are required to have a sound scientific, evidential, theoretical or philosophical base and to be critical, questioning and scholarly in approach. As an international journal, JAN promotes diversity of research and scholarship in terms of culture, paradigm and healthcare context. For JAN’s worldwide readership, authors are expected to make clear the wider international relevance of their work and to demonstrate sensitivity to cultural considerations and differences.