Shreya Verma, Tori A Holthaus, Shelby Martell, Hannah D Holscher, Ruoqing Zhu, Naiman A Khan
{"title":"Predicting Cognitive Outcome Through Nutrition and Health Markers Using Supervised Machine Learning.","authors":"Shreya Verma, Tori A Holthaus, Shelby Martell, Hannah D Holscher, Ruoqing Zhu, Naiman A Khan","doi":"10.1016/j.tjnut.2025.05.003","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Machine learning (ML) use in health research is growing, yet its application to predict cognitive outcomes using diverse health indicators is underinvestigated.</p><p><strong>Objectives: </strong>We used ML models to predict cognitive performance based on a set of health and behavioral factors, aiming to identify key contributors to cognitive function for insights into potential personalized interventions.</p><p><strong>Methods: </strong>Data from 374 adults aged 19-82 y (227 females) were used to develop ML models predicting cognitive performance (reaction time in milliseconds) on a modified Eriksen flanker task. Features included demographics, anthropometric measures, dietary indices (Healthy Eating Index, Dietary Approaches to Stop Hypertension, Mediterranean, and Mediterranean-Dietary Approaches to Stop Hypertension Intervention for Neurodegenerative Delay), self-reported physical activity, and systolic and diastolic blood pressures. The data set was split (80:20) for training and testing. Predictive models (decision trees, random forest, AdaBoost, XGBoost, gradient boosting, linear, ridge, and lasso regression) were used with hyperparameter tuning and crossvalidation. Feature importance was calculated using permutation importance, whereas performance using mean absolute error (MAE) and mean squared error.</p><p><strong>Results: </strong>Random forest regressor exhibited the best performance, with the lowest MAE (training: 0.66 ms; testing: 0.78 ms) and mean squared error (training: 0.70 ms<sup>2</sup>; testing: 1.05 ms<sup>2</sup>). Age was the most significant feature (score: 0.208), followed by diastolic blood pressure (0.169), BMI (0.079), systolic blood pressure (0.069), and Healthy Eating Index (0.048). Ethnicity (0.005) and sex (0.003) had minimal predictive effect.</p><p><strong>Conclusions: </strong>Age, blood pressure, and BMI show strong associations with cognitive performance, whereas diet quality has a subtler effect. These findings highlight the potential of ML models for developing personalized interventions and preventive strategies for cognitive decline.</p>","PeriodicalId":16620,"journal":{"name":"Journal of Nutrition","volume":" ","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2025-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Nutrition","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.tjnut.2025.05.003","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"NUTRITION & DIETETICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Machine learning (ML) use in health research is growing, yet its application to predict cognitive outcomes using diverse health indicators is underinvestigated.
Objectives: We used ML models to predict cognitive performance based on a set of health and behavioral factors, aiming to identify key contributors to cognitive function for insights into potential personalized interventions.
Methods: Data from 374 adults aged 19-82 y (227 females) were used to develop ML models predicting cognitive performance (reaction time in milliseconds) on a modified Eriksen flanker task. Features included demographics, anthropometric measures, dietary indices (Healthy Eating Index, Dietary Approaches to Stop Hypertension, Mediterranean, and Mediterranean-Dietary Approaches to Stop Hypertension Intervention for Neurodegenerative Delay), self-reported physical activity, and systolic and diastolic blood pressures. The data set was split (80:20) for training and testing. Predictive models (decision trees, random forest, AdaBoost, XGBoost, gradient boosting, linear, ridge, and lasso regression) were used with hyperparameter tuning and crossvalidation. Feature importance was calculated using permutation importance, whereas performance using mean absolute error (MAE) and mean squared error.
Results: Random forest regressor exhibited the best performance, with the lowest MAE (training: 0.66 ms; testing: 0.78 ms) and mean squared error (training: 0.70 ms2; testing: 1.05 ms2). Age was the most significant feature (score: 0.208), followed by diastolic blood pressure (0.169), BMI (0.079), systolic blood pressure (0.069), and Healthy Eating Index (0.048). Ethnicity (0.005) and sex (0.003) had minimal predictive effect.
Conclusions: Age, blood pressure, and BMI show strong associations with cognitive performance, whereas diet quality has a subtler effect. These findings highlight the potential of ML models for developing personalized interventions and preventive strategies for cognitive decline.
期刊介绍:
The Journal of Nutrition (JN/J Nutr) publishes peer-reviewed original research papers covering all aspects of experimental nutrition in humans and other animal species; special articles such as reviews and biographies of prominent nutrition scientists; and issues, opinions, and commentaries on controversial issues in nutrition. Supplements are frequently published to provide extended discussion of topics of special interest.