Ali Akbar Jamali, Corinne Berger, Raymond J. Spiteri
{"title":"Identification of depression predictors from standard health surveys using machine learning","authors":"Ali Akbar Jamali, Corinne Berger, Raymond J. Spiteri","doi":"10.1016/j.crbeha.2024.100157","DOIUrl":null,"url":null,"abstract":"<div><p>Depression has profound personal, societal, and economic impacts. Leveraging advances in technology can help identify predictors of depression. In this study, we compared seven machine learning (ML) algorithms to identify depression predictors using publicly available datasets from standard health surveys. We obtained data from the National Health and Nutrition Examination Survey (NHANES) 2017–2020, including medical, mental, demographic, and lifestyle information from 8965 individuals aged 18 to 80 years. Our study identified strongly correlated features of depression and demonstrated that ML algorithms can accurately identify depression predictors. The performance of the algorithms was evaluated using standard metrics. Among the algorithms tested, the Neural Network algorithm showed the highest overall performance, with an area under the curve of 91.34 %, which significantly outperformed results obtained with traditional statistical methods such as logistic regression and nomograms. This study demonstrates how ML can mine standard health surveys and identify depression predictors in a more accurate and nuanced fashion than other approaches. The findings of this study further suggest that incorporating heterogeneous data can enhance the performance of ML algorithms. These algorithms can be a valuable complementary tool for clinicians, particularly in remote settings, facilitating data analysis, and accelerating knowledge discovery in mental health studies.</p></div>","PeriodicalId":72746,"journal":{"name":"Current research in behavioral sciences","volume":"7 ","pages":"Article 100157"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666518224000111/pdfft?md5=ab3610e3792e96dccdc900dd473925fc&pid=1-s2.0-S2666518224000111-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current research in behavioral sciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666518224000111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Psychology","Score":null,"Total":0}
引用次数: 0
Abstract
Depression has profound personal, societal, and economic impacts. Leveraging advances in technology can help identify predictors of depression. In this study, we compared seven machine learning (ML) algorithms to identify depression predictors using publicly available datasets from standard health surveys. We obtained data from the National Health and Nutrition Examination Survey (NHANES) 2017–2020, including medical, mental, demographic, and lifestyle information from 8965 individuals aged 18 to 80 years. Our study identified strongly correlated features of depression and demonstrated that ML algorithms can accurately identify depression predictors. The performance of the algorithms was evaluated using standard metrics. Among the algorithms tested, the Neural Network algorithm showed the highest overall performance, with an area under the curve of 91.34 %, which significantly outperformed results obtained with traditional statistical methods such as logistic regression and nomograms. This study demonstrates how ML can mine standard health surveys and identify depression predictors in a more accurate and nuanced fashion than other approaches. The findings of this study further suggest that incorporating heterogeneous data can enhance the performance of ML algorithms. These algorithms can be a valuable complementary tool for clinicians, particularly in remote settings, facilitating data analysis, and accelerating knowledge discovery in mental health studies.