Classifying Severity Level of Psychiatric Symptoms on Twitter Data

M. Negash, Michael Melese Woldeyohannis
{"title":"Classifying Severity Level of Psychiatric Symptoms on Twitter Data","authors":"M. Negash, Michael Melese Woldeyohannis","doi":"10.1109/ict4da53266.2021.9672228","DOIUrl":null,"url":null,"abstract":"Internet-based social media sites such as Twitter represent a growing level of modern experience. These sites claim a large number of users and their influence is increasingly being experienced in clinical practice. Psychiatric disorders currently are affecting many people from different cultures, ages, and geographic locations. Although, the majority of individuals who experience symptom of psychiatric disorder practice the desire to be isolated, which drives them to use online channels to share their feelings. Hence, this sites provide a way to detect undiagnosed psychiatric disorders. In order to address this issue, we propose a model to classify the severity level of psychiatric symptoms (i.e. depression, anxiety, and bipolar) based on a data extracted from Twitter. The model is employed by fusing the linguistic features of Term Frequency Inverse Document Frequency (TFIDF) weighed by N-gram (unigram, bigram, and trigram), and word2vce, with Pattern of Life Feature (PLF) that take polarity, subjectivity, and gender. The experiment, is conducted by incorporating with machine learning classifiers of Support Vector Machine (SVM), Logistic Regression (LR), Random Forest (RF), and Naïve Bayes (NB). Experimental results show that SVM with features of TFIDF weighed by unigram combined with PLF outperforms with an accuracy score of 97.3%. In future, the proposed model could be employed to include lexicon-based features for classifying various psychiatric symptoms with a combined approach of machine learning and lexicon-based classification.","PeriodicalId":371663,"journal":{"name":"2021 International Conference on Information and Communication Technology for Development for Africa (ICT4DA)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Information and Communication Technology for Development for Africa (ICT4DA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ict4da53266.2021.9672228","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Internet-based social media sites such as Twitter represent a growing level of modern experience. These sites claim a large number of users and their influence is increasingly being experienced in clinical practice. Psychiatric disorders currently are affecting many people from different cultures, ages, and geographic locations. Although, the majority of individuals who experience symptom of psychiatric disorder practice the desire to be isolated, which drives them to use online channels to share their feelings. Hence, this sites provide a way to detect undiagnosed psychiatric disorders. In order to address this issue, we propose a model to classify the severity level of psychiatric symptoms (i.e. depression, anxiety, and bipolar) based on a data extracted from Twitter. The model is employed by fusing the linguistic features of Term Frequency Inverse Document Frequency (TFIDF) weighed by N-gram (unigram, bigram, and trigram), and word2vce, with Pattern of Life Feature (PLF) that take polarity, subjectivity, and gender. The experiment, is conducted by incorporating with machine learning classifiers of Support Vector Machine (SVM), Logistic Regression (LR), Random Forest (RF), and Naïve Bayes (NB). Experimental results show that SVM with features of TFIDF weighed by unigram combined with PLF outperforms with an accuracy score of 97.3%. In future, the proposed model could be employed to include lexicon-based features for classifying various psychiatric symptoms with a combined approach of machine learning and lexicon-based classification.
在Twitter数据上对精神症状的严重程度进行分类
以互联网为基础的社交媒体网站,如Twitter,代表了现代体验水平的不断提高。这些网站声称拥有大量用户,并且在临床实践中越来越多地体验到它们的影响。精神疾病目前影响着许多来自不同文化、年龄和地理位置的人。尽管如此,大多数有精神障碍症状的人都有被孤立的愿望,这促使他们使用在线渠道来分享他们的感受。因此,这些网站提供了一种检测未确诊精神疾病的方法。为了解决这个问题,我们提出了一个模型,根据从Twitter提取的数据对精神症状(即抑郁、焦虑和双相情感障碍)的严重程度进行分类。该模型将n元(单元、双元和三元)加权的词频逆文档频率(TFIDF)和单词的语言特征与具有极性、主观性和性别的生活模式特征(PLF)融合在一起。该实验结合了支持向量机(SVM)、逻辑回归(LR)、随机森林(RF)和Naïve贝叶斯(NB)等机器学习分类器进行。实验结果表明,采用单图加权TFIDF特征与PLF特征相结合的SVM准确率达到97.3%。未来,提出的模型可以采用机器学习和基于词典的分类相结合的方法,包括基于词典的特征来分类各种精神症状。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信