基于随机决策规则的生成对抗网络新范式。

IF 1.2 3区数学 Q2 STATISTICS & PROBABILITY

Statistica Sinica Pub Date : 2025-04-01 DOI:10.5705/ss.202022.0404

Sehwan Kim, Qifan Song, Faming Liang

{"title":"基于随机决策规则的生成对抗网络新范式。","authors":"Sehwan Kim, Qifan Song, Faming Liang","doi":"10.5705/ss.202022.0404","DOIUrl":null,"url":null,"abstract":"The Generative Adversarial Network (GAN) was recently introduced in the literature as a novel machine learning method for training generative models. It has many applications in statistics such as nonparametric clustering and nonparametric conditional independence tests. However, training the GAN is notoriously difficult due to the issue of mode collapse, which refers to the lack of diversity among generated data. In this paper, we identify the reasons why the GAN suffers from this issue, and to address it, we propose a new formulation for the GAN based on randomized decision rules. In the new formulation, the discriminator converges to a fixed point while the generator converges to a distribution at the Nash equilibrium. We propose to train the GAN by an empirical Bayes-like method by treating the discriminator as a hyper-parameter of the posterior distribution of the generator. Specifically, we simulate generators from its posterior distribution conditioned on the discriminator using a stochastic gradient Markov chain Monte Carlo (MCMC) algorithm, and update the discriminator using stochastic gradient descent along with simulations of the generators. We establish convergence of the proposed method to the Nash equilibrium. Apart from image generation, we apply the proposed method to nonparametric clustering and nonparametric conditional independence tests. A portion of the numerical results is presented in the supplementary material.","PeriodicalId":49478,"journal":{"name":"Statistica Sinica","volume":"35 2","pages":"897-918"},"PeriodicalIF":1.2000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12017776/pdf/","citationCount":"0","resultStr":"{\"title\":\"A New Paradigm for Generative Adversarial Networks based on Randomized Decision Rules.\",\"authors\":\"Sehwan Kim, Qifan Song, Faming Liang\",\"doi\":\"10.5705/ss.202022.0404\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The Generative Adversarial Network (GAN) was recently introduced in the literature as a novel machine learning method for training generative models. It has many applications in statistics such as nonparametric clustering and nonparametric conditional independence tests. However, training the GAN is notoriously difficult due to the issue of mode collapse, which refers to the lack of diversity among generated data. In this paper, we identify the reasons why the GAN suffers from this issue, and to address it, we propose a new formulation for the GAN based on randomized decision rules. In the new formulation, the discriminator converges to a fixed point while the generator converges to a distribution at the Nash equilibrium. We propose to train the GAN by an empirical Bayes-like method by treating the discriminator as a hyper-parameter of the posterior distribution of the generator. Specifically, we simulate generators from its posterior distribution conditioned on the discriminator using a stochastic gradient Markov chain Monte Carlo (MCMC) algorithm, and update the discriminator using stochastic gradient descent along with simulations of the generators. We establish convergence of the proposed method to the Nash equilibrium. Apart from image generation, we apply the proposed method to nonparametric clustering and nonparametric conditional independence tests. A portion of the numerical results is presented in the supplementary material.\",\"PeriodicalId\":49478,\"journal\":{\"name\":\"Statistica Sinica\",\"volume\":\"35 2\",\"pages\":\"897-918\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2025-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12017776/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Statistica Sinica\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.5705/ss.202022.0404\",\"RegionNum\":3,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistica Sinica","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.5705/ss.202022.0404","RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 0

摘要

生成对抗网络（GAN）作为一种训练生成模型的新型机器学习方法最近在文献中被介绍。它在非参数聚类和非参数条件独立检验等统计学中有广泛的应用。然而，由于模式崩溃的问题，训练GAN是出了名的困难，模式崩溃是指生成的数据之间缺乏多样性。在本文中，我们确定了GAN遭受此问题的原因，并提出了一种基于随机决策规则的GAN新公式。在新公式中，鉴别器收敛到一个不动点，而发生器收敛到纳什平衡点的一个分布。我们建议通过经验贝叶斯方法训练GAN，将鉴别器视为生成器后验分布的超参数。具体来说，我们使用随机梯度马尔可夫链蒙特卡罗（MCMC）算法从判别器条件下的后验分布模拟生成器，并使用随机梯度下降随着生成器的模拟更新鉴别器。并证明了该方法对纳什均衡的收敛性。除了图像生成之外，我们还将该方法应用于非参数聚类和非参数条件独立性检验。部分数值结果载于补充资料中。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A New Paradigm for Generative Adversarial Networks based on Randomized Decision Rules.

The Generative Adversarial Network (GAN) was recently introduced in the literature as a novel machine learning method for training generative models. It has many applications in statistics such as nonparametric clustering and nonparametric conditional independence tests. However, training the GAN is notoriously difficult due to the issue of mode collapse, which refers to the lack of diversity among generated data. In this paper, we identify the reasons why the GAN suffers from this issue, and to address it, we propose a new formulation for the GAN based on randomized decision rules. In the new formulation, the discriminator converges to a fixed point while the generator converges to a distribution at the Nash equilibrium. We propose to train the GAN by an empirical Bayes-like method by treating the discriminator as a hyper-parameter of the posterior distribution of the generator. Specifically, we simulate generators from its posterior distribution conditioned on the discriminator using a stochastic gradient Markov chain Monte Carlo (MCMC) algorithm, and update the discriminator using stochastic gradient descent along with simulations of the generators. We establish convergence of the proposed method to the Nash equilibrium. Apart from image generation, we apply the proposed method to nonparametric clustering and nonparametric conditional independence tests. A portion of the numerical results is presented in the supplementary material.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Statistica Sinica 数学-统计学与概率论

CiteScore

2.10

自引率

0.00%

发文量

审稿时长

10.5 months

期刊介绍： Statistica Sinica aims to meet the needs of statisticians in a rapidly changing world. It provides a forum for the publication of innovative work of high quality in all areas of statistics, including theory, methodology and applications. The journal encourages the development and principled use of statistical methodology that is relevant for society, science and technology.