心理健康耻辱与自然语言处理:有限语料库镜头下的两个谜

2022 IEEE World AI IoT Congress (AIIoT) Pub Date : 2022-06-06 DOI:10.1109/aiiot54504.2022.9817362

Min Hyung Lee, Richard Kyung

{"title":"心理健康耻辱与自然语言处理:有限语料库镜头下的两个谜","authors":"Min Hyung Lee, Richard Kyung","doi":"10.1109/aiiot54504.2022.9817362","DOIUrl":null,"url":null,"abstract":"Mental health stigma is an elephant in the room. It exacerbates one's illness, impedes approaches to treatment, and ultimately contributes to the persistence of a “mental health epidemic.” A definitive solution for managing stigmatized language is yet to be discovered, especially on the internet, where stigma is virtually ubiquitous in the forms of user posts, text messages, and biased articles. This study proposes text classification, a subset of natural language processing (NLP), as a solution to identify stigma in context. NLP is frequently used to detect human sentiments and emotions to eradicate hate speech, racism, and personal attacks; however, it has not been thoroughly explored in the field of mental health stigma, and the lack of preexisting data presents a challenge. Facing limited resources, the study hypothesized that the BERT model's fine-tuning method allowed for a small corpus to provide satisfactory results. The model returned surprisingly impressive results (0.94 accuracies, 0.91 F1-Score). The study not only confirms that NLP can be used as an effective solution to detect and later reduce stigma but also that the BERT model is still proficient with a limited corpus. Therefore, NLP tasks historically focused on thoroughly researched fields with an abundance of data, can also be used effectively in underdeveloped, unexplored fields of research that currently lack the datasets needed for training.","PeriodicalId":409264,"journal":{"name":"2022 IEEE World AI IoT Congress (AIIoT)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Mental Health Stigma and Natural Language Processing: Two Enigmas Through the Lens of a Limited Corpus\",\"authors\":\"Min Hyung Lee, Richard Kyung\",\"doi\":\"10.1109/aiiot54504.2022.9817362\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Mental health stigma is an elephant in the room. It exacerbates one's illness, impedes approaches to treatment, and ultimately contributes to the persistence of a “mental health epidemic.” A definitive solution for managing stigmatized language is yet to be discovered, especially on the internet, where stigma is virtually ubiquitous in the forms of user posts, text messages, and biased articles. This study proposes text classification, a subset of natural language processing (NLP), as a solution to identify stigma in context. NLP is frequently used to detect human sentiments and emotions to eradicate hate speech, racism, and personal attacks; however, it has not been thoroughly explored in the field of mental health stigma, and the lack of preexisting data presents a challenge. Facing limited resources, the study hypothesized that the BERT model's fine-tuning method allowed for a small corpus to provide satisfactory results. The model returned surprisingly impressive results (0.94 accuracies, 0.91 F1-Score). The study not only confirms that NLP can be used as an effective solution to detect and later reduce stigma but also that the BERT model is still proficient with a limited corpus. Therefore, NLP tasks historically focused on thoroughly researched fields with an abundance of data, can also be used effectively in underdeveloped, unexplored fields of research that currently lack the datasets needed for training.\",\"PeriodicalId\":409264,\"journal\":{\"name\":\"2022 IEEE World AI IoT Congress (AIIoT)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE World AI IoT Congress (AIIoT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/aiiot54504.2022.9817362\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE World AI IoT Congress (AIIoT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/aiiot54504.2022.9817362","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

心理健康的耻辱是一个显而易见的问题。它加剧了一个人的疾病，阻碍了治疗方法，最终导致了“精神健康流行病”的持续存在。管理污名化语言的最终解决方案尚未找到，特别是在互联网上，污名化几乎无处不在，以用户帖子、短信和有偏见的文章的形式存在。本研究提出文本分类，自然语言处理(NLP)的一个子集，作为在上下文中识别耻辱的解决方案。NLP经常被用来检测人类的情绪和情绪，以消除仇恨言论、种族主义和人身攻击;然而，在精神健康耻辱领域尚未对其进行彻底探索，并且缺乏先前存在的数据提出了挑战。面对有限的资源，本研究假设BERT模型的微调方法允许小语料库提供令人满意的结果。该模型返回了令人惊讶的令人印象深刻的结果(0.94准确率，0.91 F1-Score)。该研究不仅证实了NLP可以作为一种有效的解决方案来检测和后来减少病耻感，而且BERT模型仍然精通有限的语料库。因此，历史上专注于具有丰富数据的彻底研究领域的NLP任务也可以有效地用于目前缺乏训练所需数据集的不发达，未开发的研究领域。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Mental Health Stigma and Natural Language Processing: Two Enigmas Through the Lens of a Limited Corpus

Mental health stigma is an elephant in the room. It exacerbates one's illness, impedes approaches to treatment, and ultimately contributes to the persistence of a “mental health epidemic.” A definitive solution for managing stigmatized language is yet to be discovered, especially on the internet, where stigma is virtually ubiquitous in the forms of user posts, text messages, and biased articles. This study proposes text classification, a subset of natural language processing (NLP), as a solution to identify stigma in context. NLP is frequently used to detect human sentiments and emotions to eradicate hate speech, racism, and personal attacks; however, it has not been thoroughly explored in the field of mental health stigma, and the lack of preexisting data presents a challenge. Facing limited resources, the study hypothesized that the BERT model's fine-tuning method allowed for a small corpus to provide satisfactory results. The model returned surprisingly impressive results (0.94 accuracies, 0.91 F1-Score). The study not only confirms that NLP can be used as an effective solution to detect and later reduce stigma but also that the BERT model is still proficient with a limited corpus. Therefore, NLP tasks historically focused on thoroughly researched fields with an abundance of data, can also be used effectively in underdeveloped, unexplored fields of research that currently lack the datasets needed for training.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE World AI IoT Congress (AIIoT)

自引率

0.00%

发文量