SentiMage:使用机器学习的基于情感图像的COVID-19健康错误信息检测

2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME) Pub Date : 2022-11-16 DOI:10.1109/ICECCME55909.2022.9987818

K. Ramakrishnan, Vimala Balakrishnan

{"title":"SentiMage:使用机器学习的基于情感图像的COVID-19健康错误信息检测","authors":"K. Ramakrishnan, Vimala Balakrishnan","doi":"10.1109/ICECCME55909.2022.9987818","DOIUrl":null,"url":null,"abstract":"The rapid dissemination of misinformation (generally known as fake news) has become worrisome, especially during the on-going COVID-19 pandemic both globally, and locally. In fact, the proliferation of health-related misinformation intensified on social media, which many experts believe is contributing to the threats of the pandemic. Sentiment has been shown to improve detection mechanisms in various social media related studies, however this aspect is under-researched in the context of health misinformation. Further, metadata such as location or image that constitute part of real and fake news were not fully explored as well. This study develops a health misinformation detection model using machine learning algorithms, and further assesses the impact of sentiment and image on the model performance. Local data gathered from a fact-checking portal were pre-processed, translated, and used to train the detection model. Evaluation results show Support Vector Machine to yield the best performance with 99.4% for F-measure and accuracy of 99.1%, followed closely by Random Forest when sentiment was included, however, the presence of image was not found to significantly improve health misinformation detection.","PeriodicalId":202568,"journal":{"name":"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SentiMage: A Sentiment-Image-based COVID-19 Health Misinformation Detection using Machine Learning\",\"authors\":\"K. Ramakrishnan, Vimala Balakrishnan\",\"doi\":\"10.1109/ICECCME55909.2022.9987818\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The rapid dissemination of misinformation (generally known as fake news) has become worrisome, especially during the on-going COVID-19 pandemic both globally, and locally. In fact, the proliferation of health-related misinformation intensified on social media, which many experts believe is contributing to the threats of the pandemic. Sentiment has been shown to improve detection mechanisms in various social media related studies, however this aspect is under-researched in the context of health misinformation. Further, metadata such as location or image that constitute part of real and fake news were not fully explored as well. This study develops a health misinformation detection model using machine learning algorithms, and further assesses the impact of sentiment and image on the model performance. Local data gathered from a fact-checking portal were pre-processed, translated, and used to train the detection model. Evaluation results show Support Vector Machine to yield the best performance with 99.4% for F-measure and accuracy of 99.1%, followed closely by Random Forest when sentiment was included, however, the presence of image was not found to significantly improve health misinformation detection.\",\"PeriodicalId\":202568,\"journal\":{\"name\":\"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECCME55909.2022.9987818\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECCME55909.2022.9987818","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

错误信息(通常被称为假新闻)的迅速传播已经变得令人担忧，特别是在全球和地区正在进行的COVID-19大流行期间。事实上，社交媒体上与健康有关的错误信息的扩散加剧了，许多专家认为这加剧了疫情的威胁。在各种与社交媒体相关的研究中，情绪已被证明可以改善检测机制，然而，在健康错误信息的背景下，这方面的研究还不足。此外，构成真假新闻一部分的位置或图像等元数据也没有得到充分探索。本研究利用机器学习算法开发了一个健康错误信息检测模型，并进一步评估了情绪和图像对模型性能的影响。从事实检查门户收集的本地数据经过预处理、翻译并用于训练检测模型。评估结果显示，支持向量机在f度量方面的表现最好，达到99.4%，准确率为99.1%，紧随其后的是随机森林，当包含情绪时，然而，图像的存在并没有显著提高健康错误信息的检测。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

SentiMage: A Sentiment-Image-based COVID-19 Health Misinformation Detection using Machine Learning

The rapid dissemination of misinformation (generally known as fake news) has become worrisome, especially during the on-going COVID-19 pandemic both globally, and locally. In fact, the proliferation of health-related misinformation intensified on social media, which many experts believe is contributing to the threats of the pandemic. Sentiment has been shown to improve detection mechanisms in various social media related studies, however this aspect is under-researched in the context of health misinformation. Further, metadata such as location or image that constitute part of real and fake news were not fully explored as well. This study develops a health misinformation detection model using machine learning algorithms, and further assesses the impact of sentiment and image on the model performance. Local data gathered from a fact-checking portal were pre-processed, translated, and used to train the detection model. Evaluation results show Support Vector Machine to yield the best performance with 99.4% for F-measure and accuracy of 99.1%, followed closely by Random Forest when sentiment was included, however, the presence of image was not found to significantly improve health misinformation detection.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)

自引率

0.00%

发文量