Training CNNs in Presence of JPEG Compression: Multimedia Forensics vs Computer Vision

2020 IEEE International Workshop on Information Forensics and Security (WIFS) Pub Date : 2020-09-25 DOI:10.1109/WIFS49906.2020.9360903

S. Mandelli, Nicolò Bonettini, Paolo Bestagini, S. Tubaro

{"title":"Training CNNs in Presence of JPEG Compression: Multimedia Forensics vs Computer Vision","authors":"S. Mandelli, Nicolò Bonettini, Paolo Bestagini, S. Tubaro","doi":"10.1109/WIFS49906.2020.9360903","DOIUrl":null,"url":null,"abstract":"Convolutional Neural Networks (CNNs) have proved very accurate in multiple computer vision image classification tasks that required visual inspection in the past (e.g., object recognition, face detection, etc.). Motivated by these astonishing results, researchers have also started using CNNs to cope with image forensic problems (e.g., camera model identification, tampering detection, etc.). However, in computer vision, image classification methods typically rely on visual cues easily detectable by human eyes. Conversely, forensic solutions rely on almost invisible traces that are often very subtle and lie in the fine details of the image under analysis. For this reason, training a CNN to solve a forensic task requires some special care, as common processing operations (e.g., resampling, compression, etc.) can strongly hinder forensic traces. In this work, we focus on the effect that JPEG has on CNN training considering different computer vision and forensic image classification problems. Specifically, we consider the issues that rise from JPEG compression and misalignment of the JPEG grid. We show that it is necessary to consider these effects when generating a training dataset in order to properly train a forensic detector not losing generalization capability, whereas it is almost possible to ignore these effects for computer vision tasks.","PeriodicalId":354881,"journal":{"name":"2020 IEEE International Workshop on Information Forensics and Security (WIFS)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Workshop on Information Forensics and Security (WIFS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WIFS49906.2020.9360903","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 22

Abstract

Convolutional Neural Networks (CNNs) have proved very accurate in multiple computer vision image classification tasks that required visual inspection in the past (e.g., object recognition, face detection, etc.). Motivated by these astonishing results, researchers have also started using CNNs to cope with image forensic problems (e.g., camera model identification, tampering detection, etc.). However, in computer vision, image classification methods typically rely on visual cues easily detectable by human eyes. Conversely, forensic solutions rely on almost invisible traces that are often very subtle and lie in the fine details of the image under analysis. For this reason, training a CNN to solve a forensic task requires some special care, as common processing operations (e.g., resampling, compression, etc.) can strongly hinder forensic traces. In this work, we focus on the effect that JPEG has on CNN training considering different computer vision and forensic image classification problems. Specifically, we consider the issues that rise from JPEG compression and misalignment of the JPEG grid. We show that it is necessary to consider these effects when generating a training dataset in order to properly train a forensic detector not losing generalization capability, whereas it is almost possible to ignore these effects for computer vision tasks.

查看原文本刊更多论文

在JPEG压缩下训练cnn:多媒体取证vs计算机视觉

卷积神经网络(cnn)在过去需要视觉检查的多种计算机视觉图像分类任务(例如，物体识别，人脸检测等)中被证明是非常准确的。在这些惊人结果的激励下，研究人员也开始使用cnn来处理图像取证问题(例如，相机模型识别，篡改检测等)。然而，在计算机视觉中，图像分类方法通常依赖于人眼容易检测到的视觉线索。相反，法医解决方案依赖于几乎看不见的痕迹，这些痕迹通常非常微妙，存在于被分析图像的精细细节中。因此，训练CNN解决取证任务需要特别注意，因为常见的处理操作(如重采样、压缩等)会严重阻碍取证痕迹。在这项工作中，我们关注JPEG对CNN训练的影响，考虑不同的计算机视觉和法医图像分类问题。具体来说，我们考虑了JPEG压缩和JPEG网格不对齐引起的问题。我们表明，在生成训练数据集时，有必要考虑这些影响，以便正确地训练取证检测器而不失去泛化能力，而对于计算机视觉任务，几乎可以忽略这些影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2020 IEEE International Workshop on Information Forensics and Security (WIFS)

自引率

0.00%

发文量