ACOVMD:使用自训练半监督混合深度学习模型在Twitter中自动检测COVID - 19错误信息

Q2 Social Sciences

International Social Science Journal Pub Date : 2023-11-13 DOI:10.1111/issj.12475

S. Selva Birunda, R. Kanniga Devi, M. Muthukannan, M. Mahesh Babu

{"title":"ACOVMD:使用自训练半监督混合深度学习模型在Twitter中自动检测COVID - 19错误信息","authors":"S. Selva Birunda, R. Kanniga Devi, M. Muthukannan, M. Mahesh Babu","doi":"10.1111/issj.12475","DOIUrl":null,"url":null,"abstract":"Abstract During the COVID‐19 pandemic, online social networks are extensively utilized, more than ever before by 8.4%, resulting in the propagation of false information related to COVID‐19. Despite the existence of many fake news detection models; annotation inconsistency, memory consumption, accurate and self‐trained efficient algorithms for detecting the emerging COVID‐19 misinformation tweets are still challenging. Hence, the main aim of this work is to come up with a self‐trained semi‐supervised model that accurately and automatically detects the reliability of emerging COVID‐19 tweets without delay. In this work, COVID‐19 tweet dataset is created in English Language from the period January 2020 to January 2022 as a ground truth database. Then self‐trained semi‐supervised hybrid deep learning model is proposed to train both supervised and unsupervised components simultaneously using the created dataset. The proposed model is self‐trained repeatedly and the model gets updated to predict the reliability of upcoming COVID‐19 tweets that differ from training tweets. We performed experiments multiple times by limiting the percentage amount of labelled tweets shown to the model, namely 80%, 50%, 40%, 30%, 20% and 10% labelled tweets, respectively. Experimental results show that the proposed model achieves 80.92% accuracy and 98.15% accuracy in the 10% and 80% label‐seen experiments, respectively. This shows a clear rising trend in the performance curve. Therefore, this technique will be useful for effectively classifying voluminous amounts of emerging tweets generated as part of the COVID‐19 infodemic. The proposed model may efficiently use a huge amount of unlabelled tweets and enhance the model's generalization performance.","PeriodicalId":35727,"journal":{"name":"International Social Science Journal","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ACOVMD: Automatic COVID‐19 misinformation detection in Twitter using self‐trained semi‐supervised hybrid deep learning model\",\"authors\":\"S. Selva Birunda, R. Kanniga Devi, M. Muthukannan, M. Mahesh Babu\",\"doi\":\"10.1111/issj.12475\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract During the COVID‐19 pandemic, online social networks are extensively utilized, more than ever before by 8.4%, resulting in the propagation of false information related to COVID‐19. Despite the existence of many fake news detection models; annotation inconsistency, memory consumption, accurate and self‐trained efficient algorithms for detecting the emerging COVID‐19 misinformation tweets are still challenging. Hence, the main aim of this work is to come up with a self‐trained semi‐supervised model that accurately and automatically detects the reliability of emerging COVID‐19 tweets without delay. In this work, COVID‐19 tweet dataset is created in English Language from the period January 2020 to January 2022 as a ground truth database. Then self‐trained semi‐supervised hybrid deep learning model is proposed to train both supervised and unsupervised components simultaneously using the created dataset. The proposed model is self‐trained repeatedly and the model gets updated to predict the reliability of upcoming COVID‐19 tweets that differ from training tweets. We performed experiments multiple times by limiting the percentage amount of labelled tweets shown to the model, namely 80%, 50%, 40%, 30%, 20% and 10% labelled tweets, respectively. Experimental results show that the proposed model achieves 80.92% accuracy and 98.15% accuracy in the 10% and 80% label‐seen experiments, respectively. This shows a clear rising trend in the performance curve. Therefore, this technique will be useful for effectively classifying voluminous amounts of emerging tweets generated as part of the COVID‐19 infodemic. The proposed model may efficiently use a huge amount of unlabelled tweets and enhance the model's generalization performance.\",\"PeriodicalId\":35727,\"journal\":{\"name\":\"International Social Science Journal\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-11-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Social Science Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1111/issj.12475\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Social Science Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1111/issj.12475","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Social Sciences","Score":null,"Total":0}

引用次数: 0

摘要

在COVID - 19大流行期间，在线社交网络被广泛使用，比以往任何时候都多8.4%，导致与COVID - 19相关的虚假信息传播。尽管存在许多假新闻检测模型;注释不一致、内存消耗、检测新出现的COVID - 19错误信息推文的准确和自我训练的高效算法仍然具有挑战性。因此，这项工作的主要目的是提出一个自我训练的半监督模型，该模型可以准确、自动地检测新出现的COVID - 19推文的可靠性。在这项工作中，使用英语创建了2020年1月至2022年1月期间的COVID - 19推文数据集，作为基础事实数据库。然后提出了自训练半监督混合深度学习模型，利用创建的数据集同时训练有监督和无监督组件。所提出的模型是反复自我训练的，并且模型得到更新，以预测即将到来的与训练推文不同的COVID - 19推文的可靠性。我们通过限制标注推文显示给模型的百分比，分别为80%、50%、40%、30%、20%和10%，进行了多次实验。实验结果表明，在10%和80%的标签率下，该模型的准确率分别达到80.92%和98.15%。这显示了性能曲线上明显的上升趋势。因此，这项技术将有助于有效地对作为COVID - 19信息流行的一部分生成的大量新推文进行分类。该模型可以有效地利用大量的未标记推文，提高模型的泛化性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

ACOVMD: Automatic COVID‐19 misinformation detection in Twitter using self‐trained semi‐supervised hybrid deep learning model

Abstract During the COVID‐19 pandemic, online social networks are extensively utilized, more than ever before by 8.4%, resulting in the propagation of false information related to COVID‐19. Despite the existence of many fake news detection models; annotation inconsistency, memory consumption, accurate and self‐trained efficient algorithms for detecting the emerging COVID‐19 misinformation tweets are still challenging. Hence, the main aim of this work is to come up with a self‐trained semi‐supervised model that accurately and automatically detects the reliability of emerging COVID‐19 tweets without delay. In this work, COVID‐19 tweet dataset is created in English Language from the period January 2020 to January 2022 as a ground truth database. Then self‐trained semi‐supervised hybrid deep learning model is proposed to train both supervised and unsupervised components simultaneously using the created dataset. The proposed model is self‐trained repeatedly and the model gets updated to predict the reliability of upcoming COVID‐19 tweets that differ from training tweets. We performed experiments multiple times by limiting the percentage amount of labelled tweets shown to the model, namely 80%, 50%, 40%, 30%, 20% and 10% labelled tweets, respectively. Experimental results show that the proposed model achieves 80.92% accuracy and 98.15% accuracy in the 10% and 80% label‐seen experiments, respectively. This shows a clear rising trend in the performance curve. Therefore, this technique will be useful for effectively classifying voluminous amounts of emerging tweets generated as part of the COVID‐19 infodemic. The proposed model may efficiently use a huge amount of unlabelled tweets and enhance the model's generalization performance.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Social Science Journal Social Sciences-Social Sciences (all)

CiteScore

1.90

自引率

0.00%

发文量

期刊介绍： The International Social Science Journal bridges social science communities across disciplines and continents with a view to sharing information and debate with the widest possible audience. The ISSJ has a particular focus on interdisciplinary and transdisciplinary work that pushes the boundaries of current approaches, and welcomes both applied and theoretical research. Originally founded by UNESCO in 1949, ISSJ has since grown into a forum for innovative review, reflection and discussion informed by recent and ongoing international, social science research. It provides a home for work that asks questions in new ways and/or employs original methods to classic problems and whose insights have implications across the disciplines and beyond the academy. The journal publishes regular editions featuring rigorous, peer-reviewed research articles that reflect its international and heterodox scope.