A Multimodal Feature Fusion-Based Method for Individual Depression Detection on Sina Weibo

Yiding Wang, Zhenyi Wang, Chenghao Li, Yilin Zhang, Haizhou Wang
{"title":"A Multimodal Feature Fusion-Based Method for Individual Depression Detection on Sina Weibo","authors":"Yiding Wang, Zhenyi Wang, Chenghao Li, Yilin Zhang, Haizhou Wang","doi":"10.1109/IPCCC50635.2020.9391501","DOIUrl":null,"url":null,"abstract":"Existing studies have shown that various types of information on the online social network (OSN) can help predict the early stage of depression. However, studies using machine learning methods to accomplish depression detection tasks still do not have high classification performance, suggesting that there is much potential for improvement in their feature engineering. In this paper, we first construct a dataset on Sina Weibo (a leading OSN with the largest number of active users in the Chinese community), namely the Weibo User Depression Detection Dataset (WU3D). It includes more than 10,000 depressed users and 20,000 normal users, both of which are manually labeled and rechecked by specialists. Then, we extract text-based word features using the popular pretrained model XLNet and summarize nine statistical features related to user text, social behavior, and pictures. Moreover, we construct a deep neural network classification model, i.e. Multimodal Feature Fusion Network (MFFN), to fuse the above-extracted features from different information sources and further accomplish the classification task. The experimental results show that our approach achieves an F1-Score of 0.9685 on the test dataset, which has a good performance improvement compared to the existing works. In addition, we verify that our multimodal detecting approach is more robust than multimodel ensemble ones. Our work could also provide new research methods for depression detection on other OSN platforms.","PeriodicalId":226034,"journal":{"name":"2020 IEEE 39th International Performance Computing and Communications Conference (IPCCC)","volume":"146 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 39th International Performance Computing and Communications Conference (IPCCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IPCCC50635.2020.9391501","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13

Abstract

Existing studies have shown that various types of information on the online social network (OSN) can help predict the early stage of depression. However, studies using machine learning methods to accomplish depression detection tasks still do not have high classification performance, suggesting that there is much potential for improvement in their feature engineering. In this paper, we first construct a dataset on Sina Weibo (a leading OSN with the largest number of active users in the Chinese community), namely the Weibo User Depression Detection Dataset (WU3D). It includes more than 10,000 depressed users and 20,000 normal users, both of which are manually labeled and rechecked by specialists. Then, we extract text-based word features using the popular pretrained model XLNet and summarize nine statistical features related to user text, social behavior, and pictures. Moreover, we construct a deep neural network classification model, i.e. Multimodal Feature Fusion Network (MFFN), to fuse the above-extracted features from different information sources and further accomplish the classification task. The experimental results show that our approach achieves an F1-Score of 0.9685 on the test dataset, which has a good performance improvement compared to the existing works. In addition, we verify that our multimodal detecting approach is more robust than multimodel ensemble ones. Our work could also provide new research methods for depression detection on other OSN platforms.
基于多模态特征融合的新浪微博个体抑郁检测方法
现有研究表明,在线社交网络(OSN)上的各种类型的信息可以帮助预测抑郁症的早期阶段。然而,使用机器学习方法来完成抑郁症检测任务的研究仍然没有很高的分类性能,这表明它们在特征工程方面有很大的改进潜力。在本文中,我们首先在新浪微博(中国社区活跃用户数量最多的领先OSN)上构建了一个数据集,即微博用户抑郁检测数据集(WU3D)。它包括1万多名抑郁用户和2万多名正常用户,这两种用户都由专家手工标记和重新检查。然后,我们使用流行的预训练模型XLNet提取基于文本的单词特征,并总结了与用户文本、社会行为和图片相关的9个统计特征。此外,我们构建了一个深度神经网络分类模型,即多模态特征融合网络(Multimodal Feature Fusion network, MFFN),将上述从不同信息源提取的特征融合在一起,进一步完成分类任务。实验结果表明,我们的方法在测试数据集上达到了0.9685的F1-Score,与现有的工作相比,有了很好的性能提升。此外,我们还验证了我们的多模态检测方法比多模集成检测方法具有更强的鲁棒性。本研究也可为其他OSN平台的抑郁检测提供新的研究方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信