Ali Akbar Jamali, Corinne Berger, Raymond J. Spiteri
{"title":"利用机器学习从标准健康调查中识别抑郁症预测因子","authors":"Ali Akbar Jamali, Corinne Berger, Raymond J. Spiteri","doi":"10.1016/j.crbeha.2024.100157","DOIUrl":null,"url":null,"abstract":"<div><p>Depression has profound personal, societal, and economic impacts. Leveraging advances in technology can help identify predictors of depression. In this study, we compared seven machine learning (ML) algorithms to identify depression predictors using publicly available datasets from standard health surveys. We obtained data from the National Health and Nutrition Examination Survey (NHANES) 2017–2020, including medical, mental, demographic, and lifestyle information from 8965 individuals aged 18 to 80 years. Our study identified strongly correlated features of depression and demonstrated that ML algorithms can accurately identify depression predictors. The performance of the algorithms was evaluated using standard metrics. Among the algorithms tested, the Neural Network algorithm showed the highest overall performance, with an area under the curve of 91.34 %, which significantly outperformed results obtained with traditional statistical methods such as logistic regression and nomograms. This study demonstrates how ML can mine standard health surveys and identify depression predictors in a more accurate and nuanced fashion than other approaches. The findings of this study further suggest that incorporating heterogeneous data can enhance the performance of ML algorithms. These algorithms can be a valuable complementary tool for clinicians, particularly in remote settings, facilitating data analysis, and accelerating knowledge discovery in mental health studies.</p></div>","PeriodicalId":72746,"journal":{"name":"Current research in behavioral sciences","volume":"7 ","pages":"Article 100157"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666518224000111/pdfft?md5=ab3610e3792e96dccdc900dd473925fc&pid=1-s2.0-S2666518224000111-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Identification of depression predictors from standard health surveys using machine learning\",\"authors\":\"Ali Akbar Jamali, Corinne Berger, Raymond J. Spiteri\",\"doi\":\"10.1016/j.crbeha.2024.100157\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Depression has profound personal, societal, and economic impacts. Leveraging advances in technology can help identify predictors of depression. In this study, we compared seven machine learning (ML) algorithms to identify depression predictors using publicly available datasets from standard health surveys. We obtained data from the National Health and Nutrition Examination Survey (NHANES) 2017–2020, including medical, mental, demographic, and lifestyle information from 8965 individuals aged 18 to 80 years. Our study identified strongly correlated features of depression and demonstrated that ML algorithms can accurately identify depression predictors. The performance of the algorithms was evaluated using standard metrics. Among the algorithms tested, the Neural Network algorithm showed the highest overall performance, with an area under the curve of 91.34 %, which significantly outperformed results obtained with traditional statistical methods such as logistic regression and nomograms. This study demonstrates how ML can mine standard health surveys and identify depression predictors in a more accurate and nuanced fashion than other approaches. The findings of this study further suggest that incorporating heterogeneous data can enhance the performance of ML algorithms. These algorithms can be a valuable complementary tool for clinicians, particularly in remote settings, facilitating data analysis, and accelerating knowledge discovery in mental health studies.</p></div>\",\"PeriodicalId\":72746,\"journal\":{\"name\":\"Current research in behavioral sciences\",\"volume\":\"7 \",\"pages\":\"Article 100157\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666518224000111/pdfft?md5=ab3610e3792e96dccdc900dd473925fc&pid=1-s2.0-S2666518224000111-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Current research in behavioral sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666518224000111\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Psychology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current research in behavioral sciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666518224000111","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Psychology","Score":null,"Total":0}
引用次数: 0
摘要
抑郁症对个人、社会和经济有着深远的影响。利用技术进步可以帮助识别抑郁症的预测因素。在本研究中,我们比较了七种机器学习(ML)算法,以利用标准健康调查的公开数据集识别抑郁症预测因子。我们从 2017-2020 年全国健康与营养调查(NHANES)中获得了数据,包括来自 8965 名 18 至 80 岁个体的医疗、精神、人口统计学和生活方式信息。我们的研究发现了抑郁症的强相关特征,并证明了 ML 算法可以准确识别抑郁症预测因子。我们使用标准指标对算法的性能进行了评估。在所测试的算法中,神经网络算法的整体性能最高,曲线下面积为 91.34%,明显优于逻辑回归和提名图等传统统计方法的结果。这项研究表明,与其他方法相比,ML 可以更准确、更细致地挖掘标准健康调查并识别抑郁预测因素。本研究的结果进一步表明,结合异构数据可以提高 ML 算法的性能。这些算法可以成为临床医生的宝贵补充工具,尤其是在偏远地区,可以促进数据分析,加快心理健康研究的知识发现。
Identification of depression predictors from standard health surveys using machine learning
Depression has profound personal, societal, and economic impacts. Leveraging advances in technology can help identify predictors of depression. In this study, we compared seven machine learning (ML) algorithms to identify depression predictors using publicly available datasets from standard health surveys. We obtained data from the National Health and Nutrition Examination Survey (NHANES) 2017–2020, including medical, mental, demographic, and lifestyle information from 8965 individuals aged 18 to 80 years. Our study identified strongly correlated features of depression and demonstrated that ML algorithms can accurately identify depression predictors. The performance of the algorithms was evaluated using standard metrics. Among the algorithms tested, the Neural Network algorithm showed the highest overall performance, with an area under the curve of 91.34 %, which significantly outperformed results obtained with traditional statistical methods such as logistic regression and nomograms. This study demonstrates how ML can mine standard health surveys and identify depression predictors in a more accurate and nuanced fashion than other approaches. The findings of this study further suggest that incorporating heterogeneous data can enhance the performance of ML algorithms. These algorithms can be a valuable complementary tool for clinicians, particularly in remote settings, facilitating data analysis, and accelerating knowledge discovery in mental health studies.