A Novel Approach to Train Diverse Types of Language Models for Health Mention Classification of Tweets

Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society) Pub Date : 2022-04-13 DOI:10.48550/arXiv.2204.06337

Pervaiz Iqbal Khan, Imran Razzak, A. Dengel, Sheraz Ahmed

{"title":"A Novel Approach to Train Diverse Types of Language Models for Health Mention Classification of Tweets","authors":"Pervaiz Iqbal Khan, Imran Razzak, A. Dengel, Sheraz Ahmed","doi":"10.48550/arXiv.2204.06337","DOIUrl":null,"url":null,"abstract":"Health mention classification deals with the disease detection in a given text containing disease words. However, non-health and figurative use of disease words adds challenges to the task. Recently, adversarial training acting as a means of regularization has gained popularity in many NLP tasks. In this paper, we propose a novel approach to train language models for health mention classification of tweets that involves adversarial training. We generate adversarial examples by adding perturbation to the representations of transformer models for tweet examples at various levels using Gaussian noise. Further, we employ contrastive loss as an additional objective function. We evaluate the proposed method on the PHM2017 dataset extended version. Results show that our proposed approach improves the performance of classifier significantly over the baseline methods. Moreover, our analysis shows that adding noise at earlier layers improves models' performance whereas adding noise at intermediate layers deteriorates models' performance. Finally, adding noise towards the final layers performs better than the middle layers noise addition.","PeriodicalId":93416,"journal":{"name":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","volume":"157 1","pages":"136-147"},"PeriodicalIF":0.0000,"publicationDate":"2022-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2204.06337","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Health mention classification deals with the disease detection in a given text containing disease words. However, non-health and figurative use of disease words adds challenges to the task. Recently, adversarial training acting as a means of regularization has gained popularity in many NLP tasks. In this paper, we propose a novel approach to train language models for health mention classification of tweets that involves adversarial training. We generate adversarial examples by adding perturbation to the representations of transformer models for tweet examples at various levels using Gaussian noise. Further, we employ contrastive loss as an additional objective function. We evaluate the proposed method on the PHM2017 dataset extended version. Results show that our proposed approach improves the performance of classifier significantly over the baseline methods. Moreover, our analysis shows that adding noise at earlier layers improves models' performance whereas adding noise at intermediate layers deteriorates models' performance. Finally, adding noise towards the final layers performs better than the middle layers noise addition.

查看原文本刊更多论文

一种训练不同类型的推文健康提及分类语言模型的新方法

健康提及分类处理包含疾病词的给定文本中的疾病检测。然而，非健康和比喻性的疾病词汇的使用给这项任务增加了挑战。最近，对抗训练作为一种正则化手段在许多NLP任务中得到了普及。在本文中，我们提出了一种新的方法来训练涉及对抗性训练的推文健康提及分类的语言模型。我们通过使用高斯噪声在不同级别的推文示例的变压器模型的表示中添加扰动来生成对抗性示例。此外，我们采用对比损失作为一个额外的目标函数。我们在PHM2017数据集扩展版本上对所提出的方法进行了评估。结果表明，与基线方法相比，我们提出的方法显著提高了分类器的性能。此外，我们的分析表明，在早期层添加噪声可以提高模型的性能，而在中间层添加噪声会降低模型的性能。最后，向最后一层添加噪声比中间层添加噪声效果更好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Artificial neural networks, ICANN : international conference ... proceedings. International Conference on Artificial Neural Networks (European Neural Network Society)

自引率

0.00%

发文量