{"title":"TMFNet: Two-Stream Multi-Channels Fusion Networks for Color Image Operation Chain Detection","authors":"Yakun Niu, Lei Tan, Lei Zhang, Xianyu Zuo","doi":"arxiv-2409.07701","DOIUrl":null,"url":null,"abstract":"Image operation chain detection techniques have gained increasing attention\nrecently in the field of multimedia forensics. However, existing detection\nmethods suffer from the generalization problem. Moreover, the channel\ncorrelation of color images that provides additional forensic evidence is often\nignored. To solve these issues, in this article, we propose a novel two-stream\nmulti-channels fusion networks for color image operation chain detection in\nwhich the spatial artifact stream and the noise residual stream are explored in\na complementary manner. Specifically, we first propose a novel deep residual\narchitecture without pooling in the spatial artifact stream for learning the\nglobal features representation of multi-channel correlation. Then, a set of\nfilters is designed to aggregate the correlation information of multi-channels\nwhile capturing the low-level features in the noise residual stream.\nSubsequently, the high-level features are extracted by the deep residual model.\nFinally, features from the two streams are fed into a fusion module, to\neffectively learn richer discriminative representations of the operation chain.\nExtensive experiments show that the proposed method achieves state-of-the-art\ngeneralization ability while maintaining robustness to JPEG compression. The\nsource code used in these experiments will be released at\nhttps://github.com/LeiTan-98/TMFNet.","PeriodicalId":501480,"journal":{"name":"arXiv - CS - Multimedia","volume":"44 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Multimedia","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07701","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Image operation chain detection techniques have gained increasing attention
recently in the field of multimedia forensics. However, existing detection
methods suffer from the generalization problem. Moreover, the channel
correlation of color images that provides additional forensic evidence is often
ignored. To solve these issues, in this article, we propose a novel two-stream
multi-channels fusion networks for color image operation chain detection in
which the spatial artifact stream and the noise residual stream are explored in
a complementary manner. Specifically, we first propose a novel deep residual
architecture without pooling in the spatial artifact stream for learning the
global features representation of multi-channel correlation. Then, a set of
filters is designed to aggregate the correlation information of multi-channels
while capturing the low-level features in the noise residual stream.
Subsequently, the high-level features are extracted by the deep residual model.
Finally, features from the two streams are fed into a fusion module, to
effectively learn richer discriminative representations of the operation chain.
Extensive experiments show that the proposed method achieves state-of-the-art
generalization ability while maintaining robustness to JPEG compression. The
source code used in these experiments will be released at
https://github.com/LeiTan-98/TMFNet.