SentiMage:使用机器学习的基于情感图像的COVID-19健康错误信息检测

K. Ramakrishnan, Vimala Balakrishnan
{"title":"SentiMage:使用机器学习的基于情感图像的COVID-19健康错误信息检测","authors":"K. Ramakrishnan, Vimala Balakrishnan","doi":"10.1109/ICECCME55909.2022.9987818","DOIUrl":null,"url":null,"abstract":"The rapid dissemination of misinformation (generally known as fake news) has become worrisome, especially during the on-going COVID-19 pandemic both globally, and locally. In fact, the proliferation of health-related misinformation intensified on social media, which many experts believe is contributing to the threats of the pandemic. Sentiment has been shown to improve detection mechanisms in various social media related studies, however this aspect is under-researched in the context of health misinformation. Further, metadata such as location or image that constitute part of real and fake news were not fully explored as well. This study develops a health misinformation detection model using machine learning algorithms, and further assesses the impact of sentiment and image on the model performance. Local data gathered from a fact-checking portal were pre-processed, translated, and used to train the detection model. Evaluation results show Support Vector Machine to yield the best performance with 99.4% for F-measure and accuracy of 99.1%, followed closely by Random Forest when sentiment was included, however, the presence of image was not found to significantly improve health misinformation detection.","PeriodicalId":202568,"journal":{"name":"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SentiMage: A Sentiment-Image-based COVID-19 Health Misinformation Detection using Machine Learning\",\"authors\":\"K. Ramakrishnan, Vimala Balakrishnan\",\"doi\":\"10.1109/ICECCME55909.2022.9987818\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The rapid dissemination of misinformation (generally known as fake news) has become worrisome, especially during the on-going COVID-19 pandemic both globally, and locally. In fact, the proliferation of health-related misinformation intensified on social media, which many experts believe is contributing to the threats of the pandemic. Sentiment has been shown to improve detection mechanisms in various social media related studies, however this aspect is under-researched in the context of health misinformation. Further, metadata such as location or image that constitute part of real and fake news were not fully explored as well. This study develops a health misinformation detection model using machine learning algorithms, and further assesses the impact of sentiment and image on the model performance. Local data gathered from a fact-checking portal were pre-processed, translated, and used to train the detection model. Evaluation results show Support Vector Machine to yield the best performance with 99.4% for F-measure and accuracy of 99.1%, followed closely by Random Forest when sentiment was included, however, the presence of image was not found to significantly improve health misinformation detection.\",\"PeriodicalId\":202568,\"journal\":{\"name\":\"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICECCME55909.2022.9987818\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Electrical, Computer, Communications and Mechatronics Engineering (ICECCME)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECCME55909.2022.9987818","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

错误信息(通常被称为假新闻)的迅速传播已经变得令人担忧,特别是在全球和地区正在进行的COVID-19大流行期间。事实上,社交媒体上与健康有关的错误信息的扩散加剧了,许多专家认为这加剧了疫情的威胁。在各种与社交媒体相关的研究中,情绪已被证明可以改善检测机制,然而,在健康错误信息的背景下,这方面的研究还不足。此外,构成真假新闻一部分的位置或图像等元数据也没有得到充分探索。本研究利用机器学习算法开发了一个健康错误信息检测模型,并进一步评估了情绪和图像对模型性能的影响。从事实检查门户收集的本地数据经过预处理、翻译并用于训练检测模型。评估结果显示,支持向量机在f度量方面的表现最好,达到99.4%,准确率为99.1%,紧随其后的是随机森林,当包含情绪时,然而,图像的存在并没有显著提高健康错误信息的检测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
SentiMage: A Sentiment-Image-based COVID-19 Health Misinformation Detection using Machine Learning
The rapid dissemination of misinformation (generally known as fake news) has become worrisome, especially during the on-going COVID-19 pandemic both globally, and locally. In fact, the proliferation of health-related misinformation intensified on social media, which many experts believe is contributing to the threats of the pandemic. Sentiment has been shown to improve detection mechanisms in various social media related studies, however this aspect is under-researched in the context of health misinformation. Further, metadata such as location or image that constitute part of real and fake news were not fully explored as well. This study develops a health misinformation detection model using machine learning algorithms, and further assesses the impact of sentiment and image on the model performance. Local data gathered from a fact-checking portal were pre-processed, translated, and used to train the detection model. Evaluation results show Support Vector Machine to yield the best performance with 99.4% for F-measure and accuracy of 99.1%, followed closely by Random Forest when sentiment was included, however, the presence of image was not found to significantly improve health misinformation detection.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信