NGEMD & ADV-NGEMD: a unified framework for high-capacity, efficient, and adversarially secure image steganography

IF 3.1 4区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of Visual Communication and Image Representation Pub Date : 2025-08-31 DOI:10.1016/j.jvcir.2025.104575

Hanieh Rafiei , Mojtaba Mahdavi , Ahmad Reza NaghshNilchi

{"title":"NGEMD & ADV-NGEMD: a unified framework for high-capacity, efficient, and adversarially secure image steganography","authors":"Hanieh Rafiei , Mojtaba Mahdavi , Ahmad Reza NaghshNilchi","doi":"10.1016/j.jvcir.2025.104575","DOIUrl":null,"url":null,"abstract":"<div><div>Exploring image steganography, one prominent technique is Exploiting Modification Direction (EMD), which is favored for its high efficiency achieved through minimal image alterations. However, this efficiency comes at the cost of low capacity, prompting the development of numerous EMD‐based methods primarily focused on increasing payload. Yet, none have managed to simultaneously deliver both high capacity and optimal efficiency. To address these shortcomings, we introduce NGEMD—a next-generation EMD‐based steganographic framework that determines optimal extraction coefficients and bases solely from the pixel group length, maximum per‐pixel change, and number of modifiable pixels <span><math><mrow><mo>(</mo><mi>n</mi><mo>,</mo><mi>z</mi><mo>,</mo><mi>k</mi><mo>)</mo></mrow></math></span>. By deriving recursive relations to establish an ary‐notational system and employing a systematic solution based on the Chinese Remainder Theorem, NGEMD maximizes both capacity and efficiency while significantly reducing computational cost. Due to the inherent weakness of conventional EMD‐based methods against modern steganalysis, we further develop ADV‐NGEMD (Adversarially-NGEMD). We present a scheme to resist deep learning‐based steganalyzers such as YeNet, called ADV‐NGEMD, by considering the hidden message as an adversarial vector and applying changes based on the opposite sign of the gradient while controlling the modifications through a customized cost function. Comprehensive experiments confirm that both NGEMD and ADV‐NGEMD deliver exceptional performance, achieving high payload capacities (up to 2.5 bpp) while preserving visual quality (with PSNR values up to 58 dB and SSIM above 0.99) and, for instance, significantly increasing miss detection rates—from 4 % in NGEMD to as high as 60 % in ADV‐NGEMD at comparable capacities—without sacrificing their high‐capacity advantages.</div></div>","PeriodicalId":54755,"journal":{"name":"Journal of Visual Communication and Image Representation","volume":"112 ","pages":"Article 104575"},"PeriodicalIF":3.1000,"publicationDate":"2025-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Visual Communication and Image Representation","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047320325001890","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Exploring image steganography, one prominent technique is Exploiting Modification Direction (EMD), which is favored for its high efficiency achieved through minimal image alterations. However, this efficiency comes at the cost of low capacity, prompting the development of numerous EMD‐based methods primarily focused on increasing payload. Yet, none have managed to simultaneously deliver both high capacity and optimal efficiency. To address these shortcomings, we introduce NGEMD—a next-generation EMD‐based steganographic framework that determines optimal extraction coefficients and bases solely from the pixel group length, maximum per‐pixel change, and number of modifiable pixels

(n, z, k)

. By deriving recursive relations to establish an ary‐notational system and employing a systematic solution based on the Chinese Remainder Theorem, NGEMD maximizes both capacity and efficiency while significantly reducing computational cost. Due to the inherent weakness of conventional EMD‐based methods against modern steganalysis, we further develop ADV‐NGEMD (Adversarially-NGEMD). We present a scheme to resist deep learning‐based steganalyzers such as YeNet, called ADV‐NGEMD, by considering the hidden message as an adversarial vector and applying changes based on the opposite sign of the gradient while controlling the modifications through a customized cost function. Comprehensive experiments confirm that both NGEMD and ADV‐NGEMD deliver exceptional performance, achieving high payload capacities (up to 2.5 bpp) while preserving visual quality (with PSNR values up to 58 dB and SSIM above 0.99) and, for instance, significantly increasing miss detection rates—from 4 % in NGEMD to as high as 60 % in ADV‐NGEMD at comparable capacities—without sacrificing their high‐capacity advantages.

查看原文本刊更多论文

NGEMD和ADV-NGEMD：一个统一的框架，用于高容量、高效和对抗安全的图像隐写

在图像隐写技术中，一种突出的技术是利用修改方向（EMD），它以最小的图像修改实现高效率而受到青睐。然而，这种效率是以低容量为代价的，这促使了许多基于EMD的方法的发展，这些方法主要侧重于增加有效载荷。然而，没有一个能同时提供高容量和最佳效率。为了解决这些缺点，我们引入了ngemd -下一代基于EMD的隐写框架，该框架仅根据像素组长度、最大每像素变化和可修改像素数量（n,z,k）来确定最佳提取系数和基数。通过推导递归关系来建立任意符号系统，并采用基于中国剩余定理的系统解决方案，ngmd最大限度地提高了容量和效率，同时显著降低了计算成本。由于传统的基于EMD的方法对现代隐写分析的固有弱点，我们进一步开发了ADV -NGEMD（对抗性-NGEMD）。我们提出了一种抵抗基于深度学习的隐写分析器（如YeNet）的方案，称为ADV‐NGEMD，通过将隐藏信息视为对抗向量，并根据梯度的相反符号应用变化，同时通过定制的成本函数控制修改。综合实验证实，NGEMD和ADV‐NGEMD都提供了卓越的性能，在保持视觉质量的同时实现了高有效载荷容量（高达2.5 bpp）（PSNR值高达58 dB， SSIM高于0.99），例如，在不牺牲其高容量优势的情况下，显著提高了误检测率（从NGEMD的4%提高到ADV‐NGEMD的60%）。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Visual Communication and Image Representation 工程技术-计算机：软件工程

CiteScore

5.40

自引率

11.50%

发文量

188

审稿时长

9.9 months

期刊介绍： The Journal of Visual Communication and Image Representation publishes papers on state-of-the-art visual communication and image representation, with emphasis on novel technologies and theoretical work in this multidisciplinary area of pure and applied research. The field of visual communication and image representation is considered in its broadest sense and covers both digital and analog aspects as well as processing and communication in biological visual systems.