Depression Intensity Identification using Transformer Ensemble Technique for the Resource-constrained Bengali Language

Journal of Engineering Advancements Pub Date : 2024-05-10 DOI:10.38032/jea.2024.02.001

Md. Nesarul Hoque, Umme Salma, Md. Jamal Uddin, Sadia Afrin Shampa

{"title":"Depression Intensity Identification using Transformer Ensemble Technique for the Resource-constrained Bengali Language","authors":"Md. Nesarul Hoque, Umme Salma, Md. Jamal Uddin, Sadia Afrin Shampa","doi":"10.38032/jea.2024.02.001","DOIUrl":null,"url":null,"abstract":"Depression is an ordinary mental health-related disorder that hampers people’s daily activities, and sometimes, it destroys an individual’s life. It is one of the major social issues at present. Since depressed people use various social networking sites for sharing their thoughts and feelings, many scholars have tried to identify depression texts in highly resourced languages like English; however, only a small quantity of papers are detected in the resource-constrained Bengali language. This paper focuses on developing a depression intensity detection system from Bengali text data. In this regard, this study experiments on a 2,596 sample-sized dataset with four levels of depression by utilizing five state-of-the-art transformer models, including multilingual Bidirectional Encoder Representations from Transformers, DistilmBERT, XLM-RoBERTa, Bangla-BERT-Base, and BanglaBERT, and suggests a new ensemble method called MaxOfAvgProb. This method goes beyond the performance of the previous work on the same dataset, scoring 63.47% F1-score and 62.90% accuracy. To increase the reliability of the proposed method, we utilize this approach on another available dataset with 4,897 entries. In this case, our recommended method also surpasses the performance of the existing work on the same dataset, with accuracy at 86.45% and F1-score at 86.35%. Identifying the intensity of depression, depressed people may get proper counseling or treatment from their respected guardians or psychologists according to the victims’ level of depression.","PeriodicalId":292407,"journal":{"name":"Journal of Engineering Advancements","volume":" 16","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Engineering Advancements","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.38032/jea.2024.02.001","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Depression is an ordinary mental health-related disorder that hampers people’s daily activities, and sometimes, it destroys an individual’s life. It is one of the major social issues at present. Since depressed people use various social networking sites for sharing their thoughts and feelings, many scholars have tried to identify depression texts in highly resourced languages like English; however, only a small quantity of papers are detected in the resource-constrained Bengali language. This paper focuses on developing a depression intensity detection system from Bengali text data. In this regard, this study experiments on a 2,596 sample-sized dataset with four levels of depression by utilizing five state-of-the-art transformer models, including multilingual Bidirectional Encoder Representations from Transformers, DistilmBERT, XLM-RoBERTa, Bangla-BERT-Base, and BanglaBERT, and suggests a new ensemble method called MaxOfAvgProb. This method goes beyond the performance of the previous work on the same dataset, scoring 63.47% F1-score and 62.90% accuracy. To increase the reliability of the proposed method, we utilize this approach on another available dataset with 4,897 entries. In this case, our recommended method also surpasses the performance of the existing work on the same dataset, with accuracy at 86.45% and F1-score at 86.35%. Identifying the intensity of depression, depressed people may get proper counseling or treatment from their respected guardians or psychologists according to the victims’ level of depression.

查看原文本刊更多论文

利用变压器集合技术为资源有限的孟加拉语识别抑郁强度

抑郁症是一种普通的与精神健康有关的疾病，它妨碍人们的日常活动，有时甚至会毁掉一个人的一生。它是当前主要的社会问题之一。由于抑郁症患者会使用各种社交网站来分享他们的想法和感受，许多学者都试图用英语等资源丰富的语言来识别抑郁症文本；然而，在资源有限的孟加拉语中只能检测到少量的论文。本文的重点是从孟加拉语文本数据中开发抑郁强度检测系统。在这方面，本研究利用五种最先进的变换器模型，包括多语种变换器双向编码器表示法、DistilmBERT、XLM-RoBERTa、Bangla-BERT-Base 和 BanglaBERT，在 2596 个样本大小的数据集上进行了实验，并提出了一种名为 MaxOfAvgProb 的新集合方法。该方法在同一数据集上的表现超越了之前的工作，F1 分数为 63.47%，准确率为 62.90%。为了提高建议方法的可靠性，我们在另一个包含 4,897 个条目的可用数据集上使用了这种方法。在这种情况下，我们推荐的方法在同一数据集上的表现也超过了现有工作，准确率为 86.45%，F1 分数为 86.35%。通过识别抑郁症的强度，抑郁症患者可以根据其抑郁程度，从其尊敬的监护人或心理学家那里获得适当的咨询或治疗。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Engineering Advancements

自引率

0.00%

发文量