DVMark: A Deep Multiscale Framework for Video Watermarking.

IF 10.8 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

IEEE Transactions on Image Processing Pub Date : 2023-03-28 DOI:10.1109/TIP.2023.3251737

Xiyang Luo, Yinxiao Li, Huiwen Chang, Ce Liu, Peyman Milanfar, Feng Yang

引用次数: 0

Abstract

Video watermarking embeds a message into a cover video in an imperceptible manner, which can be retrieved even if the video undergoes certain modifications or distortions. Traditional watermarking methods are often manually designed for particular types of distortions and thus cannot simultaneously handle a broad spectrum of distortions. To this end, we propose a robust deep learning-based solution for video watermarking that is end-to-end trainable. Our model consists of a novel multiscale design where the watermarks are distributed across multiple spatial-temporal scales. Extensive evaluations on a wide variety of distortions show that our method outperforms traditional video watermarking methods as well as deep image watermarking models by a large margin. We further demonstrate the practicality of our method on a realistic video-editing application.

查看原文本刊更多论文

DVMark：用于视频水印的深度多尺度框架。

视频水印以不易察觉的方式将信息嵌入封面视频中，即使视频发生了某些修改或失真，也能被检索到。传统的水印方法通常是针对特定类型的失真手动设计的，因此无法同时处理各种失真。为此，我们提出了一种基于深度学习的稳健的视频水印解决方案，该方案可进行端到端训练。我们的模型包含一种新颖的多尺度设计，其中的水印分布在多个时空尺度上。对各种失真的广泛评估表明，我们的方法远远优于传统的视频水印方法和深度图像水印模型。我们还在实际的视频编辑应用中进一步证明了我们方法的实用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Image Processing 工程技术-工程：电子与电气

CiteScore

20.90

自引率

6.60%

发文量

774

审稿时长

7.6 months

期刊介绍： The IEEE Transactions on Image Processing delves into groundbreaking theories, algorithms, and structures concerning the generation, acquisition, manipulation, transmission, scrutiny, and presentation of images, video, and multidimensional signals across diverse applications. Topics span mathematical, statistical, and perceptual aspects, encompassing modeling, representation, formation, coding, filtering, enhancement, restoration, rendering, halftoning, search, and analysis of images, video, and multidimensional signals. Pertinent applications range from image and video communications to electronic imaging, biomedical imaging, image and video systems, and remote sensing.