An explainable ensemble model for revealing the level of depression in social media by considering personality traits and sentiment polarity pattern

Q1 Social Sciences

Online Social Networks and Media Pub Date : 2025-03-08 DOI:10.1016/j.osnem.2025.100307

Gede Aditra Pradnyana , Wiwik Anggraeni , Eko Mulyanto Yuniarno , Mauridhi Hery Purnomo

{"title":"An explainable ensemble model for revealing the level of depression in social media by considering personality traits and sentiment polarity pattern","authors":"Gede Aditra Pradnyana , Wiwik Anggraeni , Eko Mulyanto Yuniarno , Mauridhi Hery Purnomo","doi":"10.1016/j.osnem.2025.100307","DOIUrl":null,"url":null,"abstract":"<div><div>Early detection of depression in mental health is crucial for better intervention. Social media has been extensively used to examine users’ behavior, motivating researchers to develop an automatic depression detection model. However, the accuracy and clarity of the reasons behind the detection results still need to be improved. Current research focuses primarily on syntactic and semantic information in user-posted texts, while other aspects of users’ psychological characteristics are often overlooked. Therefore, this study addresses the gap by proposing a novel model integrating personality traits and sentiment polarity patterns into an explainable ensemble model. Specifically, we developed two base learners for the averaged and meta-ensemble learning strategy. The first learner employed the Robustly Optimized BERT Pre-training Approach (RoBERTa). For the second learner, we combined the Random Forest and Bidirectional Long Short-Term Memory (RF-BiLSTM) methods to effectively handle the combination of personality traits and sequential information in sentiment polarity patterns. These additional features are obtained by performing domain adaptation for personality prediction and sentiment analysis using a lexicon-based model. Based on the experimental results, our ensemble model improved depression detection results by leveraging the strengths of each base learner. Our model advanced the state-of-the-art, outperforming existing models with an increase in accuracy and F1-score of 4.14% and 2.99%, respectively. The model successfully enhanced the interpretability of detection results, providing a more comprehensive understanding of the factors underlying depressive symptoms. This research highlights the potential of considering alternative additional features as a promising avenue for enhancing depression detection in social media.</div></div>","PeriodicalId":52228,"journal":{"name":"Online Social Networks and Media","volume":"46 ","pages":"Article 100307"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Online Social Networks and Media","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468696425000084","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}

引用次数: 0

Abstract

Early detection of depression in mental health is crucial for better intervention. Social media has been extensively used to examine users’ behavior, motivating researchers to develop an automatic depression detection model. However, the accuracy and clarity of the reasons behind the detection results still need to be improved. Current research focuses primarily on syntactic and semantic information in user-posted texts, while other aspects of users’ psychological characteristics are often overlooked. Therefore, this study addresses the gap by proposing a novel model integrating personality traits and sentiment polarity patterns into an explainable ensemble model. Specifically, we developed two base learners for the averaged and meta-ensemble learning strategy. The first learner employed the Robustly Optimized BERT Pre-training Approach (RoBERTa). For the second learner, we combined the Random Forest and Bidirectional Long Short-Term Memory (RF-BiLSTM) methods to effectively handle the combination of personality traits and sequential information in sentiment polarity patterns. These additional features are obtained by performing domain adaptation for personality prediction and sentiment analysis using a lexicon-based model. Based on the experimental results, our ensemble model improved depression detection results by leveraging the strengths of each base learner. Our model advanced the state-of-the-art, outperforming existing models with an increase in accuracy and F1-score of 4.14% and 2.99%, respectively. The model successfully enhanced the interpretability of detection results, providing a more comprehensive understanding of the factors underlying depressive symptoms. This research highlights the potential of considering alternative additional features as a promising avenue for enhancing depression detection in social media.

Abstract Image

查看原文本刊更多论文

考虑人格特质和情绪极性模式的社交媒体抑郁水平可解释的集合模型

在心理健康方面早期发现抑郁症对于更好的干预至关重要。社交媒体被广泛用于检查用户的行为，这促使研究人员开发了一种自动抑郁检测模型。然而，检测结果背后原因的准确性和清晰度仍有待提高。目前的研究主要集中在用户发布文本中的句法和语义信息，而用户心理特征的其他方面往往被忽视。因此，本研究提出了一种将人格特质和情感极性模式整合到一个可解释的集成模型中的新模型来解决这一空白。具体来说，我们为平均学习策略和元集成学习策略开发了两个基本学习器。第一个学习者采用鲁棒优化BERT预训练方法（RoBERTa）。对于第二个学习者，我们将随机森林和双向长短期记忆（RF-BiLSTM）方法结合起来，有效地处理了情感极性模式下人格特征和顺序信息的组合。这些额外的特征是通过使用基于词典的模型进行人格预测和情感分析的领域适应来获得的。在实验结果的基础上，我们的集成模型通过利用每个基学习器的优势来改善抑郁检测结果。我们的模型比现有的模型更先进，准确率和f1分数分别提高了4.14%和2.99%。该模型成功地提高了检测结果的可解释性，为抑郁症状的潜在因素提供了更全面的理解。这项研究强调了考虑替代附加功能作为增强社交媒体抑郁症检测的有希望的途径的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊