使用机器学习方法在具有用户网络和参与特征的社交媒体上检测抑郁症

Aik Seng Liaw, Hui Na Chua
{"title":"使用机器学习方法在具有用户网络和参与特征的社交媒体上检测抑郁症","authors":"Aik Seng Liaw, Hui Na Chua","doi":"10.1109/IICAIET55139.2022.9936814","DOIUrl":null,"url":null,"abstract":"Depression is a complicated mental health disorder with many different forms and symptoms. Traditional methods face barriers when detecting and diagnosing depression, including social stigma and societal labeling. As social media platforms became commonplace for information sharing, their anonymity meant that the barriers had considerably lessened. An alternative method to depression detection being researched is using social media data to build machine learning models for depression detection. To that end, this research uses machine learning models to incorporate new user networks and user engagement features into depression detection on Twitter users. These two features provide an additional understanding of users and may significantly affect depression detection. A Twitter dataset is constructed to include additional data on users' following list and the history of liked tweets not examined in prior studies. Ten machine learning models are constructed using five different machine learning algorithms tested on two sets of features. Models with proposed features outperformed other machine learning models without proposed features, with the best model yielding 82.05% performance for both accuracy and F1 score. This study discovered that the most important feature is the number of depression keywords in liked tweets, with at least twice the gain compared to 88% of other features used. Topic modelling features for liked tweets also have high gain and are important in detecting depression. Additionally, features derived from original tweets, replies, and liked tweets have higher gain and are more important than retweets and quote tweets in detecting depression.","PeriodicalId":142482,"journal":{"name":"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Depression Detection on Social Media With User Network and Engagement Features Using Machine Learning Methods\",\"authors\":\"Aik Seng Liaw, Hui Na Chua\",\"doi\":\"10.1109/IICAIET55139.2022.9936814\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Depression is a complicated mental health disorder with many different forms and symptoms. Traditional methods face barriers when detecting and diagnosing depression, including social stigma and societal labeling. As social media platforms became commonplace for information sharing, their anonymity meant that the barriers had considerably lessened. An alternative method to depression detection being researched is using social media data to build machine learning models for depression detection. To that end, this research uses machine learning models to incorporate new user networks and user engagement features into depression detection on Twitter users. These two features provide an additional understanding of users and may significantly affect depression detection. A Twitter dataset is constructed to include additional data on users' following list and the history of liked tweets not examined in prior studies. Ten machine learning models are constructed using five different machine learning algorithms tested on two sets of features. Models with proposed features outperformed other machine learning models without proposed features, with the best model yielding 82.05% performance for both accuracy and F1 score. This study discovered that the most important feature is the number of depression keywords in liked tweets, with at least twice the gain compared to 88% of other features used. Topic modelling features for liked tweets also have high gain and are important in detecting depression. Additionally, features derived from original tweets, replies, and liked tweets have higher gain and are more important than retweets and quote tweets in detecting depression.\",\"PeriodicalId\":142482,\"journal\":{\"name\":\"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IICAIET55139.2022.9936814\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IICAIET55139.2022.9936814","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

抑郁症是一种复杂的精神健康障碍,有许多不同的形式和症状。传统方法在检测和诊断抑郁症时面临障碍,包括社会耻辱和社会标签。随着社交媒体平台成为信息共享的普遍平台,它们的匿名性意味着障碍大大减少。正在研究的抑郁症检测的另一种方法是使用社交媒体数据建立抑郁症检测的机器学习模型。为此,本研究使用机器学习模型将新的用户网络和用户参与特征整合到Twitter用户的抑郁检测中。这两个功能提供了对用户的额外了解,并可能显著影响抑郁症检测。构建了一个Twitter数据集,以包括用户的关注列表和先前研究中未检查的喜欢的推文历史的附加数据。使用五种不同的机器学习算法在两组特征上进行测试,构建了十个机器学习模型。具有提出的特征的模型优于其他没有提出特征的机器学习模型,最佳模型在准确率和F1分数方面的性能均达到82.05%。这项研究发现,最重要的功能是“喜欢”推文中抑郁关键词的数量,与88%的其他功能相比,它的收益至少是前者的两倍。喜欢的推文的主题建模功能也具有高增益,并且在检测抑郁方面很重要。此外,从原始推文、回复和喜欢的推文中获得的特征在检测抑郁症方面具有更高的增益,并且比转发和引用推文更重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Depression Detection on Social Media With User Network and Engagement Features Using Machine Learning Methods
Depression is a complicated mental health disorder with many different forms and symptoms. Traditional methods face barriers when detecting and diagnosing depression, including social stigma and societal labeling. As social media platforms became commonplace for information sharing, their anonymity meant that the barriers had considerably lessened. An alternative method to depression detection being researched is using social media data to build machine learning models for depression detection. To that end, this research uses machine learning models to incorporate new user networks and user engagement features into depression detection on Twitter users. These two features provide an additional understanding of users and may significantly affect depression detection. A Twitter dataset is constructed to include additional data on users' following list and the history of liked tweets not examined in prior studies. Ten machine learning models are constructed using five different machine learning algorithms tested on two sets of features. Models with proposed features outperformed other machine learning models without proposed features, with the best model yielding 82.05% performance for both accuracy and F1 score. This study discovered that the most important feature is the number of depression keywords in liked tweets, with at least twice the gain compared to 88% of other features used. Topic modelling features for liked tweets also have high gain and are important in detecting depression. Additionally, features derived from original tweets, replies, and liked tweets have higher gain and are more important than retweets and quote tweets in detecting depression.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信