Natural Language Processing and Lexical Approach for Depression Symptoms Screening of Indonesian Twitter User

Irwan Oyong, Ema Utami, E. T. Luthfi
{"title":"Natural Language Processing and Lexical Approach for Depression Symptoms Screening of Indonesian Twitter User","authors":"Irwan Oyong, Ema Utami, E. T. Luthfi","doi":"10.1109/ICITEED.2018.8534929","DOIUrl":null,"url":null,"abstract":"The shame and disgrace of depression cause people not to seek help for the problem. Contemporary social media technologies such as Twitter provide an opportunity for people to be able to express their feelings in an anonymous and confidential environment. In this study symptoms lexicon terms are used with the frequency lexicon for weighting and perform depression score calculations from text materials that have undergone Natural Language Processing. Through the scraping process, 55 relevant users were found who netted depression-related keywords. For each user, a data search is made for a week before and after the initial tweet. Tweet extensions generate a total of 6055 tweets from 55 users in question. The sum of the scores becomes the label determinant of user depression, which is then compared to the labels that have been given manually by psychologists based on clinical screening standards. Based on comparative and evaluation results, the same F1 score of 0.47 is obtained for standard text processing and text-specific processing for Twitter, and a Sensitivity value of 0.89 at the threshold value of 0.5. A slightly better F1 Score value of 0.50 is obtained by text-specific processing on the threshold value of 0.8. Research shows differences in minor results between standard text processing and text-specific processing of Twitter. Seen some of the advantages of text-specific Twitter processing when handling non-standard text, thereby enhancing the fmdings of the Part-of-Speech tagging process.","PeriodicalId":142523,"journal":{"name":"2018 10th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"150 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 10th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEED.2018.8534929","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15

Abstract

The shame and disgrace of depression cause people not to seek help for the problem. Contemporary social media technologies such as Twitter provide an opportunity for people to be able to express their feelings in an anonymous and confidential environment. In this study symptoms lexicon terms are used with the frequency lexicon for weighting and perform depression score calculations from text materials that have undergone Natural Language Processing. Through the scraping process, 55 relevant users were found who netted depression-related keywords. For each user, a data search is made for a week before and after the initial tweet. Tweet extensions generate a total of 6055 tweets from 55 users in question. The sum of the scores becomes the label determinant of user depression, which is then compared to the labels that have been given manually by psychologists based on clinical screening standards. Based on comparative and evaluation results, the same F1 score of 0.47 is obtained for standard text processing and text-specific processing for Twitter, and a Sensitivity value of 0.89 at the threshold value of 0.5. A slightly better F1 Score value of 0.50 is obtained by text-specific processing on the threshold value of 0.8. Research shows differences in minor results between standard text processing and text-specific processing of Twitter. Seen some of the advantages of text-specific Twitter processing when handling non-standard text, thereby enhancing the fmdings of the Part-of-Speech tagging process.
印尼Twitter用户抑郁症状筛选的自然语言处理和词汇方法
抑郁症带来的羞耻感和耻辱使人们不去寻求帮助。像Twitter这样的当代社交媒体技术为人们提供了一个在匿名和保密的环境中表达自己感受的机会。在本研究中,症状词汇词汇与频率词汇一起用于加权,并从经过自然语言处理的文本材料中进行抑郁评分计算。通过抓取过程,发现了55个相关用户,他们上网了与抑郁症相关的关键词。对于每个用户,在初始tweet之前和之后的一周内进行数据搜索。Tweet扩展生成来自55个用户的总共6055条Tweet。分数的总和成为用户抑郁的标签决定因素,然后将其与心理学家根据临床筛选标准手动给出的标签进行比较。根据对比和评价结果,Twitter的标准文本处理和特定文本处理F1得分相同,均为0.47,在阈值为0.5时,Sensitivity值为0.89。在阈值为0.8的基础上,通过文本特定处理,F1 Score值略好,为0.50。研究表明,Twitter的标准文本处理和特定于文本的处理在次要结果上存在差异。在处理非标准文本时,看到了特定于文本的Twitter处理的一些优点,从而增强了词性标记过程的发现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信