Predicting all-cause mortality with machine learning among Brazilians aged 50 and over: results from The Brazilian Longitudinal Study of Ageing (ELSI-Brazil).
Felipe Mendes Delpino, Alexandre Dias Porto Chiavegatto Filho, Juliana Lustosa Torres, Fabíola Bof de Andrade, Maria Fernanda Lima-Costa, Bruno Pereira Nunes
{"title":"Predicting all-cause mortality with machine learning among Brazilians aged 50 and over: results from The Brazilian Longitudinal Study of Ageing (ELSI-Brazil).","authors":"Felipe Mendes Delpino, Alexandre Dias Porto Chiavegatto Filho, Juliana Lustosa Torres, Fabíola Bof de Andrade, Maria Fernanda Lima-Costa, Bruno Pereira Nunes","doi":"10.1038/s41514-025-00210-7","DOIUrl":null,"url":null,"abstract":"<p><p>We aimed to develop a machine-learning model to predict all-cause mortality among Brazilians aged 50 and over, incorporating demographic, health, and lifestyle characteristics as predictors. We analyzed data from the Brazilian Longitudinal Study of Aging (ELSI-Brazil), waves 1 and 2 (2015-2021), a nationally representative sample from 70 municipalities across Brazil's five regions. Nine algorithms, including Random Forest, Gradient Boosting, XGBOOST, and Logistic Regression, were tested on 9412 participants (54.6% female), with 970 deaths recorded over approximately five years. Using 59 predictor variables, we assessed performance with metrics like AUC, accuracy, precision, and F1-Score. Random Forest excelled with an AUC of 0.92 (95% CI: 0.90-0.94). SHAP analysis highlighted age, sex, BMI, medication use, and physical activity as top predictors. Integrating these models into healthcare systems can improve policy planning and enable targeted interventions, ultimately fostering better health outcomes for aging populations.</p>","PeriodicalId":94160,"journal":{"name":"npj aging","volume":"11 1","pages":"22"},"PeriodicalIF":4.1000,"publicationDate":"2025-03-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11953239/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"npj aging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1038/s41514-025-00210-7","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
We aimed to develop a machine-learning model to predict all-cause mortality among Brazilians aged 50 and over, incorporating demographic, health, and lifestyle characteristics as predictors. We analyzed data from the Brazilian Longitudinal Study of Aging (ELSI-Brazil), waves 1 and 2 (2015-2021), a nationally representative sample from 70 municipalities across Brazil's five regions. Nine algorithms, including Random Forest, Gradient Boosting, XGBOOST, and Logistic Regression, were tested on 9412 participants (54.6% female), with 970 deaths recorded over approximately five years. Using 59 predictor variables, we assessed performance with metrics like AUC, accuracy, precision, and F1-Score. Random Forest excelled with an AUC of 0.92 (95% CI: 0.90-0.94). SHAP analysis highlighted age, sex, BMI, medication use, and physical activity as top predictors. Integrating these models into healthcare systems can improve policy planning and enable targeted interventions, ultimately fostering better health outcomes for aging populations.