Qiuxia Yang , Zhengpeng Zhao , Yuanyuan Pu , Shuyu Pan , Jinjing Gu , Dan Xu
{"title":"FNContra: Frequency-domain Negative Sample Mining in Contrastive Learning for limited-data image generation","authors":"Qiuxia Yang , Zhengpeng Zhao , Yuanyuan Pu , Shuyu Pan , Jinjing Gu , Dan Xu","doi":"10.1016/j.eswa.2024.125676","DOIUrl":null,"url":null,"abstract":"<div><div>Substantial training data is necessary to train an effective generative adversarial network(GANs), without which the discriminator is easily overfitting, causing the sub-optimal models. To solve these problems, this work explores the Frequency-domain Negative Sample Mining in Contrastive learning (FNContra) to improve data efficiency, which requires the discriminator to differentiate the definite relationships between the negative samples and real images. Concretely, this work first constructs multiple-level negative samples in the frequency domain and then proposes Discriminated Wavelet-instance Contrastive Learning (DWCL) and Generated Wavelet-prototype Contrastive Learning (GWCL). The former helps the discriminator learn the fine-grained texture features, and the latter impels the generated feature distribution to be close to real. Considering the learning difficulty of multi-level negative samples, this work proposes a dynamic weight driven by self-information, which ensures the resultant force is positive from the multi-level negative samples during the training. Finally, this work performs experiments on eleven datasets with different domains and resolutions. The quantitative and qualitative results demonstrate the superiority and effectiveness of the FNContra trained on limited data, and it indicates that FNContra can synthesize high-quality images. Notably, FNContra achieves the best FID scores on 10 out of 11 datasets, with improvements of 17.90 and 29.24 on Moongate and Shells, respectively, compared to the baseline. The code can be found at <span><span>https://github.com/YQX1996/FNContra</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"263 ","pages":"Article 125676"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417424025430","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Substantial training data is necessary to train an effective generative adversarial network(GANs), without which the discriminator is easily overfitting, causing the sub-optimal models. To solve these problems, this work explores the Frequency-domain Negative Sample Mining in Contrastive learning (FNContra) to improve data efficiency, which requires the discriminator to differentiate the definite relationships between the negative samples and real images. Concretely, this work first constructs multiple-level negative samples in the frequency domain and then proposes Discriminated Wavelet-instance Contrastive Learning (DWCL) and Generated Wavelet-prototype Contrastive Learning (GWCL). The former helps the discriminator learn the fine-grained texture features, and the latter impels the generated feature distribution to be close to real. Considering the learning difficulty of multi-level negative samples, this work proposes a dynamic weight driven by self-information, which ensures the resultant force is positive from the multi-level negative samples during the training. Finally, this work performs experiments on eleven datasets with different domains and resolutions. The quantitative and qualitative results demonstrate the superiority and effectiveness of the FNContra trained on limited data, and it indicates that FNContra can synthesize high-quality images. Notably, FNContra achieves the best FID scores on 10 out of 11 datasets, with improvements of 17.90 and 29.24 on Moongate and Shells, respectively, compared to the baseline. The code can be found at https://github.com/YQX1996/FNContra.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.