Abigail Beech , Haoxue Fan , Jocelyn Shu , Javiera Oyarzun , Peter Nadel , Olivia T. Karaman , Sophia Vranos , Elizabeth A. Phelps , M. Alexandra Kredlow
{"title":"Using natural language processing to identify patterns associated with depression, anxiety, and stress symptoms during the COVID-19 pandemic","authors":"Abigail Beech , Haoxue Fan , Jocelyn Shu , Javiera Oyarzun , Peter Nadel , Olivia T. Karaman , Sophia Vranos , Elizabeth A. Phelps , M. Alexandra Kredlow","doi":"10.1016/j.jad.2025.01.139","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Combining data-driven natural language processing techniques with traditional methods using predefined word lists may offer greater insights into the connections between language patterns and depression and anxiety symptoms, particularly within specific stressful contexts.</div></div><div><h3>Methods</h3><div>Between 2020 and 2021, 1106 participants wrote narrative responses describing their experiences during the COVID-19 pandemic and completed the Depression Anxiety Stress Scale-21 (DASS). We investigated language patterns associated with DASS symptoms using established categories from Linguistic Inquiry and Word Count (LIWC) and sentiment analysis, as well as exploratory natural language processing techniques. Finally, we constructed machine learning regression models in order to assess how much of the variance in DASS symptoms is related to language use.</div></div><div><h3>Results</h3><div>We found significant positive bivariate correlations between total DASS symptoms and hypothesized LIWC categories: first-person singular pronouns, absolute language, and negative emotion words. These results remained largely similar when using negative sentiment scores and when statistically controlling for gender, age, and education. Exploratory n-gram analyses also revealed new individual words and phrases correlated with total DASS symptoms. Lastly, our regression models demonstrated a significant association between language use and total DASS symptoms (<em>R</em><sup>2</sup> = 0.36–0.62).</div></div><div><h3>Conclusions</h3><div>The current study is one of the first to examine associations between language use and DASS symptoms during the pandemic using both traditional and data-driven techniques. These results replicate and extend prior findings regarding negative emotion and absolute language and identify unique correlates of DASS symptoms during pandemic-related stress, contributing to the literature on language and mental health more broadly.</div></div>","PeriodicalId":14963,"journal":{"name":"Journal of affective disorders","volume":"376 ","pages":"Pages 113-121"},"PeriodicalIF":4.9000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of affective disorders","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0165032725001594","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Combining data-driven natural language processing techniques with traditional methods using predefined word lists may offer greater insights into the connections between language patterns and depression and anxiety symptoms, particularly within specific stressful contexts.
Methods
Between 2020 and 2021, 1106 participants wrote narrative responses describing their experiences during the COVID-19 pandemic and completed the Depression Anxiety Stress Scale-21 (DASS). We investigated language patterns associated with DASS symptoms using established categories from Linguistic Inquiry and Word Count (LIWC) and sentiment analysis, as well as exploratory natural language processing techniques. Finally, we constructed machine learning regression models in order to assess how much of the variance in DASS symptoms is related to language use.
Results
We found significant positive bivariate correlations between total DASS symptoms and hypothesized LIWC categories: first-person singular pronouns, absolute language, and negative emotion words. These results remained largely similar when using negative sentiment scores and when statistically controlling for gender, age, and education. Exploratory n-gram analyses also revealed new individual words and phrases correlated with total DASS symptoms. Lastly, our regression models demonstrated a significant association between language use and total DASS symptoms (R2 = 0.36–0.62).
Conclusions
The current study is one of the first to examine associations between language use and DASS symptoms during the pandemic using both traditional and data-driven techniques. These results replicate and extend prior findings regarding negative emotion and absolute language and identify unique correlates of DASS symptoms during pandemic-related stress, contributing to the literature on language and mental health more broadly.
期刊介绍:
The Journal of Affective Disorders publishes papers concerned with affective disorders in the widest sense: depression, mania, mood spectrum, emotions and personality, anxiety and stress. It is interdisciplinary and aims to bring together different approaches for a diverse readership. Top quality papers will be accepted dealing with any aspect of affective disorders, including neuroimaging, cognitive neurosciences, genetics, molecular biology, experimental and clinical neurosciences, pharmacology, neuroimmunoendocrinology, intervention and treatment trials.