NLP@NISER: Classification of COVID19 tweets containing symptoms

Deepak Kumar, Nalin Kumar, Subhankar Mishra
{"title":"NLP@NISER: Classification of COVID19 tweets containing symptoms","authors":"Deepak Kumar, Nalin Kumar, Subhankar Mishra","doi":"10.18653/V1/2021.SMM4H-1.19","DOIUrl":null,"url":null,"abstract":"In this paper, we describe our approaches for task six of Social Media Mining for Health Applications (SMM4H) shared task in 2021. The task is to classify twitter tweets containing COVID-19 symptoms in three classes (self-reports, non-personal reports & literature/news mentions). We implemented BERT and XLNet for this text classification task. Best result was achieved by XLNet approach, which is F1 score 0.94, precision 0.9448 and recall 0.94448. This is slightly better than the average score, i.e. F1 score 0.93, precision 0.93235 and recall 0.93235.","PeriodicalId":378985,"journal":{"name":"Proceedings of the Sixth Social Media Mining for Health (#SMM4H) Workshop and Shared Task","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Sixth Social Media Mining for Health (#SMM4H) Workshop and Shared Task","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.18653/V1/2021.SMM4H-1.19","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

In this paper, we describe our approaches for task six of Social Media Mining for Health Applications (SMM4H) shared task in 2021. The task is to classify twitter tweets containing COVID-19 symptoms in three classes (self-reports, non-personal reports & literature/news mentions). We implemented BERT and XLNet for this text classification task. Best result was achieved by XLNet approach, which is F1 score 0.94, precision 0.9448 and recall 0.94448. This is slightly better than the average score, i.e. F1 score 0.93, precision 0.93235 and recall 0.93235.
NLP@NISER:包含症状的covid - 19推文分类
在本文中,我们描述了我们在2021年的健康应用社交媒体挖掘(SMM4H)共享任务的任务六的方法。任务是将包含COVID-19症状的推文分为三类(自我报告、非个人报告和文献/新闻提及)。我们为这个文本分类任务实现了BERT和XLNet。采用XLNet方法的结果最好,F1得分为0.94,准确率为0.9448,召回率为0.94448。这比平均得分略好,即F1得分0.93,精度0.93235,召回率0.93235。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信