LPCLNet: Leveraging local pixel-wise contrastive learning for image tampering localization

IF 6.8 1区计算机科学 0 COMPUTER SCIENCE, INFORMATION SYSTEMS

Information Sciences Pub Date : 2026-06-05 Epub Date: 2026-02-06 DOI:10.1016/j.ins.2026.123205

Jun Sang , Xiaowen Chen , Wenhui Gong , Sergey Gorbachev , Shanjun Zhang

{"title":"LPCLNet: Leveraging local pixel-wise contrastive learning for image tampering localization","authors":"Jun Sang , Xiaowen Chen , Wenhui Gong , Sergey Gorbachev , Shanjun Zhang","doi":"10.1016/j.ins.2026.123205","DOIUrl":null,"url":null,"abstract":"<div><div>To address poor generalization caused by the scarcity of real samples in image tampering localization, this paper proposes a Local Pixel-level Contrastive Learning Network (LPCLNet). The main contributions are: (1) a contour patch-oriented contrastive learning mechanism that categorizes patches into tampered, authentic, and contour classes, applying pixel-level and patch-level contrastive losses alongside binary cross-entropy loss to leverage boundary information and reduce dependence on synthetic data; (2) an LPCLNet architecture that integrates a multi-scale feature fusion module and an Atrous Spatial Pyramid Pooling module to aggregate fine-grained features and embed contextual information for multi-scale representation of tampered regions; (3) a joint optimization strategy combining InfoNCE contrastive loss with binary cross-entropy loss to enhance feature discriminability and localization accuracy. Experiments on the Columbia, NIST16, CASIA v1, and Coverage datasets demonstrate that LPCLNet achieves comparable or superior performance to mainstream methods without requiring synthetic data pre-training. Specifically, it attains leading F1 scores of 0.529 and 0.369 on CASIA v1 and NIST16, respectively, as well as the highest average IoU of 0.500 and AUC of 0.830 across benchmarks, validating its stable and highly generalizable performance with limited real samples.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"740 ","pages":"Article 123205"},"PeriodicalIF":6.8000,"publicationDate":"2026-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025526001362","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/2/6 0:00:00","PubModel":"Epub","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

To address poor generalization caused by the scarcity of real samples in image tampering localization, this paper proposes a Local Pixel-level Contrastive Learning Network (LPCLNet). The main contributions are: (1) a contour patch-oriented contrastive learning mechanism that categorizes patches into tampered, authentic, and contour classes, applying pixel-level and patch-level contrastive losses alongside binary cross-entropy loss to leverage boundary information and reduce dependence on synthetic data; (2) an LPCLNet architecture that integrates a multi-scale feature fusion module and an Atrous Spatial Pyramid Pooling module to aggregate fine-grained features and embed contextual information for multi-scale representation of tampered regions; (3) a joint optimization strategy combining InfoNCE contrastive loss with binary cross-entropy loss to enhance feature discriminability and localization accuracy. Experiments on the Columbia, NIST16, CASIA v1, and Coverage datasets demonstrate that LPCLNet achieves comparable or superior performance to mainstream methods without requiring synthetic data pre-training. Specifically, it attains leading F1 scores of 0.529 and 0.369 on CASIA v1 and NIST16, respectively, as well as the highest average IoU of 0.500 and AUC of 0.830 across benchmarks, validating its stable and highly generalizable performance with limited real samples.

查看原文本刊更多论文

lclnet：利用局部逐像素对比学习实现图像篡改定位

为了解决图像篡改定位中由于真实样本的稀缺性而导致的泛化差的问题，本文提出了一种局部像素级对比学习网络（Local Pixel-level contrast Learning Network, LPCLNet）。主要贡献有：(1)一种面向轮廓斑块的对比学习机制，该机制将斑块分为篡改类、真实类和轮廓类，利用像素级和斑块级对比损失以及二元交叉熵损失来利用边界信息，减少对合成数据的依赖；(2)集成多尺度特征融合模块和空间金字塔池模块的LPCLNet体系结构，聚合细粒度特征并嵌入上下文信息，实现篡改区域的多尺度表示；(3)结合InfoNCE对比损失和二值交叉熵损失的联合优化策略，提高特征的可分辨性和定位精度。在Columbia、NIST16、CASIA v1和Coverage数据集上的实验表明，LPCLNet在不需要合成数据预训练的情况下实现了与主流方法相当或更好的性能。具体来说，它在CASIA v1和NIST16上分别获得了0.529和0.369的领先F1分数，以及最高的平均IoU 0.500和AUC 0.830，在有限的真实样本下验证了其稳定和高度泛化的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information Sciences 工程技术-计算机：信息系统

CiteScore

14.00

自引率

17.30%

发文量

1322

审稿时长

10.4 months

期刊介绍： Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions. Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.