fastcrew：性能还是效率？一种基于fastnet的轻量级条件残差dnn水印

IF 11.1 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Transactions on Circuits and Systems for Video Technology Pub Date : 2025-03-14 DOI:10.1109/TCSVT.2025.3550908

Baowei Wang;Jianbo Zhang;Yufeng Wu;Qi Cui

{"title":"fastcrew：性能还是效率？一种基于fastnet的轻量级条件残差dnn水印","authors":"Baowei Wang;Jianbo Zhang;Yufeng Wu;Qi Cui","doi":"10.1109/TCSVT.2025.3550908","DOIUrl":null,"url":null,"abstract":"Deep neural networks (DNNs) based watermarking algorithms have made significant strides in recent years. However, existing methods either demand substantial resources for image feature extraction during watermark embedding, sacrificing efficiency, or completely neglect image texture information, resulting in suboptimal performance. Moreover, current algorithms struggle with real-time watermark extraction. To address these limitations, we propose a lightweight conditional residual watermarking (CReW) architecture. Specifically, CReW employs a Conditional Generative Adversarial Network (CGAN) framework to generate an adaptive residual image guided by the structure of the cover image, which is decoupled from the network to reduce computational complexity. This design enables CReW to achieve an optimal balance between performance and efficiency. Additionally, by directly optimizing the residual image to capture variations in watermark behavior under distortion, CReW significantly enhances robustness. Furthermore, we design redundancy coding blocks to increase the mutual information of the watermark, along with a patch-level discriminator to improve local patch discrimination, thereby further enhancing image quality. Finally, by reducing channel redundancy and leveraging FasterNet, we developed a low-complexity network architecture, FasterCReW, which facilitates real-time watermark embedding and extraction. Extensive experimental results demonstrate that, despite having <inline-formula> <tex-math>$36 \\times $ </tex-math></inline-formula> fewer network parameters and <inline-formula> <tex-math>$30\\times $ </tex-math></inline-formula> fewer floating point operations (FLOPs) than Adaptor, FasterCReW exhibits excellent robustness against distortions such as cropout, JPEG compression, and Gaussian noise. Furthermore, FasterCReW significantly outperforms other existing DNN-based watermarking algorithms in terms of running speed, achieving an <inline-formula> <tex-math>$8\\times $ </tex-math></inline-formula> speed increase over UDH and a <inline-formula> <tex-math>$28\\times $ </tex-math></inline-formula> increase over Adaptor on an Intel Core i7-8750H CPU.","PeriodicalId":13082,"journal":{"name":"IEEE Transactions on Circuits and Systems for Video Technology","volume":"35 8","pages":"7732-7746"},"PeriodicalIF":11.1000,"publicationDate":"2025-03-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FasterCReW: Performance or Efficiency? A Lightweight Conditional Residual DNN-Based Watermarking Based on FasterNet\",\"authors\":\"Baowei Wang;Jianbo Zhang;Yufeng Wu;Qi Cui\",\"doi\":\"10.1109/TCSVT.2025.3550908\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Deep neural networks (DNNs) based watermarking algorithms have made significant strides in recent years. However, existing methods either demand substantial resources for image feature extraction during watermark embedding, sacrificing efficiency, or completely neglect image texture information, resulting in suboptimal performance. Moreover, current algorithms struggle with real-time watermark extraction. To address these limitations, we propose a lightweight conditional residual watermarking (CReW) architecture. Specifically, CReW employs a Conditional Generative Adversarial Network (CGAN) framework to generate an adaptive residual image guided by the structure of the cover image, which is decoupled from the network to reduce computational complexity. This design enables CReW to achieve an optimal balance between performance and efficiency. Additionally, by directly optimizing the residual image to capture variations in watermark behavior under distortion, CReW significantly enhances robustness. Furthermore, we design redundancy coding blocks to increase the mutual information of the watermark, along with a patch-level discriminator to improve local patch discrimination, thereby further enhancing image quality. Finally, by reducing channel redundancy and leveraging FasterNet, we developed a low-complexity network architecture, FasterCReW, which facilitates real-time watermark embedding and extraction. Extensive experimental results demonstrate that, despite having <inline-formula> <tex-math>$36 \\\\times $ </tex-math></inline-formula> fewer network parameters and <inline-formula> <tex-math>$30\\\\times $ </tex-math></inline-formula> fewer floating point operations (FLOPs) than Adaptor, FasterCReW exhibits excellent robustness against distortions such as cropout, JPEG compression, and Gaussian noise. Furthermore, FasterCReW significantly outperforms other existing DNN-based watermarking algorithms in terms of running speed, achieving an <inline-formula> <tex-math>$8\\\\times $ </tex-math></inline-formula> speed increase over UDH and a <inline-formula> <tex-math>$28\\\\times $ </tex-math></inline-formula> increase over Adaptor on an Intel Core i7-8750H CPU.\",\"PeriodicalId\":13082,\"journal\":{\"name\":\"IEEE Transactions on Circuits and Systems for Video Technology\",\"volume\":\"35 8\",\"pages\":\"7732-7746\"},\"PeriodicalIF\":11.1000,\"publicationDate\":\"2025-03-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Circuits and Systems for Video Technology\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10925430/\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Circuits and Systems for Video Technology","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10925430/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

摘要

近年来，基于深度神经网络（dnn）的水印算法取得了重大进展。然而，现有方法在水印嵌入过程中要么需要大量的图像特征提取资源，牺牲了效率，要么完全忽略了图像纹理信息，导致性能不理想。此外，目前的算法难以实现实时水印提取。为了解决这些限制，我们提出了一种轻量级的条件剩余水印（CReW）架构。CReW采用条件生成对抗网络（Conditional Generative Adversarial Network， CGAN）框架，根据覆盖图像的结构生成自适应残差图像，该残差图像与网络解耦，以降低计算复杂度。这种设计使CReW能够实现性能和效率之间的最佳平衡。此外，CReW通过直接优化残差图像来捕捉失真情况下水印行为的变化，显著增强了鲁棒性。此外，我们设计了冗余编码块来增加水印的互信息，并设计了一个补丁级鉴别器来提高局部补丁的识别能力，从而进一步提高图像质量。最后，通过减少信道冗余并利用FasterNet，我们开发了一种低复杂度的网络架构FasterCReW，该架构便于实时水印嵌入和提取。大量的实验结果表明，尽管与Adaptor相比，fastcrew的网络参数少了36倍，浮点运算（FLOPs）少了30倍，但它对裁剪、JPEG压缩和高斯噪声等失真具有出色的鲁棒性。此外，fastcrew在运行速度方面显著优于其他现有的基于dnn的水印算法，在英特尔酷睿i7-8750H CPU上，比UDH速度提高了8倍，比Adaptor速度提高了28倍。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

FasterCReW: Performance or Efficiency? A Lightweight Conditional Residual DNN-Based Watermarking Based on FasterNet

Deep neural networks (DNNs) based watermarking algorithms have made significant strides in recent years. However, existing methods either demand substantial resources for image feature extraction during watermark embedding, sacrificing efficiency, or completely neglect image texture information, resulting in suboptimal performance. Moreover, current algorithms struggle with real-time watermark extraction. To address these limitations, we propose a lightweight conditional residual watermarking (CReW) architecture. Specifically, CReW employs a Conditional Generative Adversarial Network (CGAN) framework to generate an adaptive residual image guided by the structure of the cover image, which is decoupled from the network to reduce computational complexity. This design enables CReW to achieve an optimal balance between performance and efficiency. Additionally, by directly optimizing the residual image to capture variations in watermark behavior under distortion, CReW significantly enhances robustness. Furthermore, we design redundancy coding blocks to increase the mutual information of the watermark, along with a patch-level discriminator to improve local patch discrimination, thereby further enhancing image quality. Finally, by reducing channel redundancy and leveraging FasterNet, we developed a low-complexity network architecture, FasterCReW, which facilitates real-time watermark embedding and extraction. Extensive experimental results demonstrate that, despite having

$36 \times $

fewer network parameters and

$30\times $

fewer floating point operations (FLOPs) than Adaptor, FasterCReW exhibits excellent robustness against distortions such as cropout, JPEG compression, and Gaussian noise. Furthermore, FasterCReW significantly outperforms other existing DNN-based watermarking algorithms in terms of running speed, achieving an

$8\times $

speed increase over UDH and a

$28\times $

increase over Adaptor on an Intel Core i7-8750H CPU.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Circuits and Systems for Video Technology 工程技术-工程：电子与电气

CiteScore

13.80

自引率

27.40%

发文量

660

审稿时长

5 months

期刊介绍： The IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) is dedicated to covering all aspects of video technologies from a circuits and systems perspective. We encourage submissions of general, theoretical, and application-oriented papers related to image and video acquisition, representation, presentation, and display. Additionally, we welcome contributions in areas such as processing, filtering, and transforms; analysis and synthesis; learning and understanding; compression, transmission, communication, and networking; as well as storage, retrieval, indexing, and search. Furthermore, papers focusing on hardware and software design and implementation are highly valued. Join us in advancing the field of video technology through innovative research and insights.