Jun Sang , Xiaowen Chen , Wenhui Gong , Sergey Gorbachev , Shanjun Zhang
{"title":"LPCLNet: Leveraging local pixel-wise contrastive learning for image tampering localization","authors":"Jun Sang , Xiaowen Chen , Wenhui Gong , Sergey Gorbachev , Shanjun Zhang","doi":"10.1016/j.ins.2026.123205","DOIUrl":null,"url":null,"abstract":"<div><div>To address poor generalization caused by the scarcity of real samples in image tampering localization, this paper proposes a Local Pixel-level Contrastive Learning Network (LPCLNet). The main contributions are: (1) a contour patch-oriented contrastive learning mechanism that categorizes patches into tampered, authentic, and contour classes, applying pixel-level and patch-level contrastive losses alongside binary cross-entropy loss to leverage boundary information and reduce dependence on synthetic data; (2) an LPCLNet architecture that integrates a multi-scale feature fusion module and an Atrous Spatial Pyramid Pooling module to aggregate fine-grained features and embed contextual information for multi-scale representation of tampered regions; (3) a joint optimization strategy combining InfoNCE contrastive loss with binary cross-entropy loss to enhance feature discriminability and localization accuracy. Experiments on the Columbia, NIST16, CASIA v1, and Coverage datasets demonstrate that LPCLNet achieves comparable or superior performance to mainstream methods without requiring synthetic data pre-training. Specifically, it attains leading F1 scores of 0.529 and 0.369 on CASIA v1 and NIST16, respectively, as well as the highest average IoU of 0.500 and AUC of 0.830 across benchmarks, validating its stable and highly generalizable performance with limited real samples.</div></div>","PeriodicalId":51063,"journal":{"name":"Information Sciences","volume":"740 ","pages":"Article 123205"},"PeriodicalIF":6.8000,"publicationDate":"2026-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0020025526001362","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/2/6 0:00:00","PubModel":"Epub","JCR":"0","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
To address poor generalization caused by the scarcity of real samples in image tampering localization, this paper proposes a Local Pixel-level Contrastive Learning Network (LPCLNet). The main contributions are: (1) a contour patch-oriented contrastive learning mechanism that categorizes patches into tampered, authentic, and contour classes, applying pixel-level and patch-level contrastive losses alongside binary cross-entropy loss to leverage boundary information and reduce dependence on synthetic data; (2) an LPCLNet architecture that integrates a multi-scale feature fusion module and an Atrous Spatial Pyramid Pooling module to aggregate fine-grained features and embed contextual information for multi-scale representation of tampered regions; (3) a joint optimization strategy combining InfoNCE contrastive loss with binary cross-entropy loss to enhance feature discriminability and localization accuracy. Experiments on the Columbia, NIST16, CASIA v1, and Coverage datasets demonstrate that LPCLNet achieves comparable or superior performance to mainstream methods without requiring synthetic data pre-training. Specifically, it attains leading F1 scores of 0.529 and 0.369 on CASIA v1 and NIST16, respectively, as well as the highest average IoU of 0.500 and AUC of 0.830 across benchmarks, validating its stable and highly generalizable performance with limited real samples.
期刊介绍:
Informatics and Computer Science Intelligent Systems Applications is an esteemed international journal that focuses on publishing original and creative research findings in the field of information sciences. We also feature a limited number of timely tutorial and surveying contributions.
Our journal aims to cater to a diverse audience, including researchers, developers, managers, strategic planners, graduate students, and anyone interested in staying up-to-date with cutting-edge research in information science, knowledge engineering, and intelligent systems. While readers are expected to share a common interest in information science, they come from varying backgrounds such as engineering, mathematics, statistics, physics, computer science, cell biology, molecular biology, management science, cognitive science, neurobiology, behavioral sciences, and biochemistry.