Source Attribution of Online News Images by Compression Analysis

Michael Albright, Nitesh Menon, Kristy Roschke, Arslan Basharat
{"title":"Source Attribution of Online News Images by Compression Analysis","authors":"Michael Albright, Nitesh Menon, Kristy Roschke, Arslan Basharat","doi":"10.1109/WIFS53200.2021.9648385","DOIUrl":null,"url":null,"abstract":"The rapid increase in the amount of online disinformation warrants new and robust digital forensics methods for validating purported sources of multimodal news articles. We conducted a survey of news photojournalists for insights into their workflows. A high percentage (91%) of respondents reported standardized photo publishing procedures, which we hypothesize facilitates source verification. In this work, we demonstrate that the online news sites leave predictable and discernible patterns in the compression settings of the images they publish. We propose novel, simple, and very efficient algorithms to analyze the image compression profiles for news source verification and identification. We evaluate the algorithms' effectiveness through extensive experiments on a newly-released dataset of over 64K images from over 34K articles collected from 30 news sites. The image compression features are modeled by Naive Bayes variants or XGBoost classifiers for source attribution and verification. For these news sources we are able to achieve very strong performance with the proposed algorithms resulting in 0.92–0.94 average AUC for source verification under a closed set scenario, and compelling open set generalization with only 0.0–0.04 reduction in the average AUC.","PeriodicalId":196985,"journal":{"name":"2021 IEEE International Workshop on Information Forensics and Security (WIFS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Workshop on Information Forensics and Security (WIFS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WIFS53200.2021.9648385","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The rapid increase in the amount of online disinformation warrants new and robust digital forensics methods for validating purported sources of multimodal news articles. We conducted a survey of news photojournalists for insights into their workflows. A high percentage (91%) of respondents reported standardized photo publishing procedures, which we hypothesize facilitates source verification. In this work, we demonstrate that the online news sites leave predictable and discernible patterns in the compression settings of the images they publish. We propose novel, simple, and very efficient algorithms to analyze the image compression profiles for news source verification and identification. We evaluate the algorithms' effectiveness through extensive experiments on a newly-released dataset of over 64K images from over 34K articles collected from 30 news sites. The image compression features are modeled by Naive Bayes variants or XGBoost classifiers for source attribution and verification. For these news sources we are able to achieve very strong performance with the proposed algorithms resulting in 0.92–0.94 average AUC for source verification under a closed set scenario, and compelling open set generalization with only 0.0–0.04 reduction in the average AUC.
基于压缩分析的在线新闻图像来源归属
在线虚假信息数量的迅速增加需要新的和强大的数字取证方法来验证多式联运新闻文章的据称来源。我们对新闻摄影记者进行了一项调查,以了解他们的工作流程。高百分比(91%)的受访者报告了标准化的照片发布程序,我们假设这有助于来源验证。在这项工作中,我们证明了在线新闻网站在他们发布的图像的压缩设置中留下了可预测和可识别的模式。我们提出新颖、简单、高效的算法来分析新闻源验证和识别的图像压缩配置文件。我们在一个新发布的数据集上进行了广泛的实验,该数据集包含来自30个新闻网站的34K多篇文章的64K多张图像,从而评估了算法的有效性。图像压缩特征由朴素贝叶斯变体或XGBoost分类器建模,用于源属性和验证。对于这些新闻源,我们能够通过所提出的算法获得非常强的性能,在封闭集场景下,源验证的平均AUC为0.92-0.94,而开放集泛化的平均AUC仅降低了0.0-0.04。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信