{"title":"A hybrid BERT-CPSO model for multi-class depression detection using pure hindi and hinglish multimodal data on social media","authors":"Rohit Beniwal, Pavi Saraswat","doi":"10.1016/j.compeleceng.2024.109786","DOIUrl":null,"url":null,"abstract":"<div><div>Due to the psychological strain that depression causes, there has been a noticeable increase in the number of persons compromising their lives in recent years. Social media platforms provide researchers with an entirely novel viewpoint on identifying individuals who are depressed. Previous research on automatic learning models for depression detection revealed low detection accuracy and an absence of optimizing techniques that could enhance detection accuracy. Furthermore, there is no such dataset, and very little study has been done on the multimodal pure Hindi and code-mixed Hinglish language domains. In light of this, we developed a Hindi dataset and suggested reliable methods for depression detection based on multimodal data, i.e., text and images, using the Hindi and Hinglish languages. This study aims to accomplish three things: first, it will evaluate text data using an effective Bidirectional Encoder Representations from Transformers (BERT) approach and compare it with other transfer learning variants; second, it will analyze image data by suggesting a Convolutional Neural Network (CNN) optimized with a nature-inspired algorithm, namely Particle Swarm Optimization (PSO), or CPSO; and third, it will classify the multimodal data into depressive and non-depressive posts by suggesting a hybrid of the best-performing models on text and images, namely BERT-CPSO (BTCPSO). The results produced with the BERT model showed the best accuracy of 95% for text data, in contrast to RoBERTa, DistilBERT, and XLNet. Further, CPSO outperforms other Machine Learning (ML) and Deep Learning (DL) algorithms for image data with an accuracy of 95%. Additionally, comparing the proposed CPSO with a basic CNN revealed that integrating the PSO technique with CNN increased the model's accuracy in detecting depressed posts by 5%. In conclusion, hybrid BERT-CPSO outperforms other BERT combinations with ML and DL algorithms for multimodal data, achieving 97%, 95%, 98%, and 96%, respectively, in accuracy, recall, precision, and F1-scores. As a result, the findings of comparing the suggested technique with the earlier models show the effectiveness of the approach that has been provided and can help medical professionals diagnose depression with precision.</div></div>","PeriodicalId":50630,"journal":{"name":"Computers & Electrical Engineering","volume":"120 ","pages":"Article 109786"},"PeriodicalIF":4.0000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Electrical Engineering","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0045790624007134","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Due to the psychological strain that depression causes, there has been a noticeable increase in the number of persons compromising their lives in recent years. Social media platforms provide researchers with an entirely novel viewpoint on identifying individuals who are depressed. Previous research on automatic learning models for depression detection revealed low detection accuracy and an absence of optimizing techniques that could enhance detection accuracy. Furthermore, there is no such dataset, and very little study has been done on the multimodal pure Hindi and code-mixed Hinglish language domains. In light of this, we developed a Hindi dataset and suggested reliable methods for depression detection based on multimodal data, i.e., text and images, using the Hindi and Hinglish languages. This study aims to accomplish three things: first, it will evaluate text data using an effective Bidirectional Encoder Representations from Transformers (BERT) approach and compare it with other transfer learning variants; second, it will analyze image data by suggesting a Convolutional Neural Network (CNN) optimized with a nature-inspired algorithm, namely Particle Swarm Optimization (PSO), or CPSO; and third, it will classify the multimodal data into depressive and non-depressive posts by suggesting a hybrid of the best-performing models on text and images, namely BERT-CPSO (BTCPSO). The results produced with the BERT model showed the best accuracy of 95% for text data, in contrast to RoBERTa, DistilBERT, and XLNet. Further, CPSO outperforms other Machine Learning (ML) and Deep Learning (DL) algorithms for image data with an accuracy of 95%. Additionally, comparing the proposed CPSO with a basic CNN revealed that integrating the PSO technique with CNN increased the model's accuracy in detecting depressed posts by 5%. In conclusion, hybrid BERT-CPSO outperforms other BERT combinations with ML and DL algorithms for multimodal data, achieving 97%, 95%, 98%, and 96%, respectively, in accuracy, recall, precision, and F1-scores. As a result, the findings of comparing the suggested technique with the earlier models show the effectiveness of the approach that has been provided and can help medical professionals diagnose depression with precision.
期刊介绍:
The impact of computers has nowhere been more revolutionary than in electrical engineering. The design, analysis, and operation of electrical and electronic systems are now dominated by computers, a transformation that has been motivated by the natural ease of interface between computers and electrical systems, and the promise of spectacular improvements in speed and efficiency.
Published since 1973, Computers & Electrical Engineering provides rapid publication of topical research into the integration of computer technology and computational techniques with electrical and electronic systems. The journal publishes papers featuring novel implementations of computers and computational techniques in areas like signal and image processing, high-performance computing, parallel processing, and communications. Special attention will be paid to papers describing innovative architectures, algorithms, and software tools.