具有混合特征的假新闻检测模型——新闻文本、图像和社会语境

IF 6.9 3区 管理学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS
Szu-Yin Lin, Ya-Han Hu, Pei-Ju Lee, Yi-Hua Zeng, Chi-Min Chang, Hsiao-Chuan Chang
{"title":"具有混合特征的假新闻检测模型——新闻文本、图像和社会语境","authors":"Szu-Yin Lin, Ya-Han Hu, Pei-Ju Lee, Yi-Hua Zeng, Chi-Min Chang, Hsiao-Chuan Chang","doi":"10.1007/s10796-025-10589-z","DOIUrl":null,"url":null,"abstract":"<p>With the evolving realm of news propagation and the surge in social media usage, detecting and combatting fake news has become an increasingly important issue. Currently, fake news detection employs three main feature categories: news text, social context, and news images. However, most studies emphasize just one, while only a limited number incorporate image features. This study presents an innovative hybrid fake news detection model amalgamating text mining technology to extract news text features, user information on Twitter to extract social context features, and VGG19 model to extract news image features to increase the model's accuracy. We harness four diverse machine learning algorithms (Logistic Regression, Random Forest, Support Vector Machine, and Extreme Gradient Boosting) to construct models and evaluate their performance via Precision, Recall, F1-Score, and Accuracy metrics. Results indicate the fusion of news text, social context, and image features outperforms their individual application, yielding a noteworthy 92.5% overall accuracy. Significantly, social context attributes, encompassing users, publishers, and distribution networks, contribute crucial insights into detecting early-stage fake news dissemination. Consequently, our study bolsters fact-checking entities by furnishing them with news-content insights for verification and equips social media platforms with a potent fake news detection model—comprising news content, imagery, and user-centric social context data—to discern erroneous information.</p>","PeriodicalId":13610,"journal":{"name":"Information Systems Frontiers","volume":"67 1","pages":""},"PeriodicalIF":6.9000,"publicationDate":"2025-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fake News Detection Model with Hybrid Features—News Text, Image, and Social Context\",\"authors\":\"Szu-Yin Lin, Ya-Han Hu, Pei-Ju Lee, Yi-Hua Zeng, Chi-Min Chang, Hsiao-Chuan Chang\",\"doi\":\"10.1007/s10796-025-10589-z\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>With the evolving realm of news propagation and the surge in social media usage, detecting and combatting fake news has become an increasingly important issue. Currently, fake news detection employs three main feature categories: news text, social context, and news images. However, most studies emphasize just one, while only a limited number incorporate image features. This study presents an innovative hybrid fake news detection model amalgamating text mining technology to extract news text features, user information on Twitter to extract social context features, and VGG19 model to extract news image features to increase the model's accuracy. We harness four diverse machine learning algorithms (Logistic Regression, Random Forest, Support Vector Machine, and Extreme Gradient Boosting) to construct models and evaluate their performance via Precision, Recall, F1-Score, and Accuracy metrics. Results indicate the fusion of news text, social context, and image features outperforms their individual application, yielding a noteworthy 92.5% overall accuracy. Significantly, social context attributes, encompassing users, publishers, and distribution networks, contribute crucial insights into detecting early-stage fake news dissemination. Consequently, our study bolsters fact-checking entities by furnishing them with news-content insights for verification and equips social media platforms with a potent fake news detection model—comprising news content, imagery, and user-centric social context data—to discern erroneous information.</p>\",\"PeriodicalId\":13610,\"journal\":{\"name\":\"Information Systems Frontiers\",\"volume\":\"67 1\",\"pages\":\"\"},\"PeriodicalIF\":6.9000,\"publicationDate\":\"2025-03-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Systems Frontiers\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1007/s10796-025-10589-z\",\"RegionNum\":3,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Systems Frontiers","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s10796-025-10589-z","RegionNum":3,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

随着新闻传播领域的不断发展和社交媒体使用的激增,发现和打击假新闻已经成为一个越来越重要的问题。目前,假新闻检测主要采用三种特征类别:新闻文本、社会语境和新闻图像。然而,大多数研究只强调一种,而只有少数研究结合了图像特征。本研究提出了一种创新的混合假新闻检测模型,该模型融合了文本挖掘技术提取新闻文本特征,Twitter用户信息提取社会语境特征,VGG19模型提取新闻图像特征,以提高模型的准确性。我们利用四种不同的机器学习算法(逻辑回归、随机森林、支持向量机和极端梯度增强)来构建模型,并通过精度、召回率、F1-Score和准确性指标来评估它们的性能。结果表明,新闻文本、社会背景和图像特征的融合优于它们单独的应用,产生了值得注意的92.5%的总体准确率。值得注意的是,包括用户、出版商和分发网络在内的社会背景属性,为检测早期假新闻传播提供了至关重要的见解。因此,我们的研究通过为事实核查实体提供新闻内容见解以进行验证,并为社交媒体平台提供强大的假新闻检测模型(包括新闻内容,图像和以用户为中心的社会背景数据)来识别错误信息,从而支持事实核查实体。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Fake News Detection Model with Hybrid Features—News Text, Image, and Social Context

With the evolving realm of news propagation and the surge in social media usage, detecting and combatting fake news has become an increasingly important issue. Currently, fake news detection employs three main feature categories: news text, social context, and news images. However, most studies emphasize just one, while only a limited number incorporate image features. This study presents an innovative hybrid fake news detection model amalgamating text mining technology to extract news text features, user information on Twitter to extract social context features, and VGG19 model to extract news image features to increase the model's accuracy. We harness four diverse machine learning algorithms (Logistic Regression, Random Forest, Support Vector Machine, and Extreme Gradient Boosting) to construct models and evaluate their performance via Precision, Recall, F1-Score, and Accuracy metrics. Results indicate the fusion of news text, social context, and image features outperforms their individual application, yielding a noteworthy 92.5% overall accuracy. Significantly, social context attributes, encompassing users, publishers, and distribution networks, contribute crucial insights into detecting early-stage fake news dissemination. Consequently, our study bolsters fact-checking entities by furnishing them with news-content insights for verification and equips social media platforms with a potent fake news detection model—comprising news content, imagery, and user-centric social context data—to discern erroneous information.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Information Systems Frontiers
Information Systems Frontiers 工程技术-计算机:理论方法
CiteScore
13.30
自引率
18.60%
发文量
127
审稿时长
9 months
期刊介绍: The interdisciplinary interfaces of Information Systems (IS) are fast emerging as defining areas of research and development in IS. These developments are largely due to the transformation of Information Technology (IT) towards networked worlds and its effects on global communications and economies. While these developments are shaping the way information is used in all forms of human enterprise, they are also setting the tone and pace of information systems of the future. The major advances in IT such as client/server systems, the Internet and the desktop/multimedia computing revolution, for example, have led to numerous important vistas of research and development with considerable practical impact and academic significance. While the industry seeks to develop high performance IS/IT solutions to a variety of contemporary information support needs, academia looks to extend the reach of IS technology into new application domains. Information Systems Frontiers (ISF) aims to provide a common forum of dissemination of frontline industrial developments of substantial academic value and pioneering academic research of significant practical impact.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信