Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models

IF 13.7

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society Pub Date : 2025-04-11 DOI:10.1109/TIP.2025.3558420

Aye Phyu Phyu Aung;Xinrun Wang;Ruiyu Wang;Hau Chan;Bo An;Xiaoli Li;J. Senthilnath

{"title":"Double Oracle Neural Architecture Search for Game Theoretic Deep Learning Models","authors":"Aye Phyu Phyu Aung;Xinrun Wang;Ruiyu Wang;Hau Chan;Bo An;Xiaoli Li;J. Senthilnath","doi":"10.1109/TIP.2025.3558420","DOIUrl":null,"url":null,"abstract":"In this paper, we propose a new approach to train deep learning models using game theory concepts including Generative Adversarial Networks (GANs) and Adversarial Training (AT) where we deploy a double-oracle framework using best response oracles. GAN is essentially a two-player zero-sum game between the generator and the discriminator. The same concept can be applied to AT with attacker and classifier as players. Training these models is challenging as a pure Nash equilibrium may not exist and even finding the mixed Nash equilibrium is difficult as training algorithms for both GAN and AT have a large-scale strategy space. Extending our preliminary model DO-GAN, we propose the methods to apply the double oracle framework concept to Adversarial Neural Architecture Search (NAS for GAN) and Adversarial Training (NAS for AT) algorithms. We first generalize the players’ strategies as the trained models of generator and discriminator from the best response oracles. We then compute the meta-strategies using a linear program. For scalability of the framework where multiple network models of best responses are stored in the memory, we prune the weakly-dominated players’ strategies to keep the oracles from becoming intractable. Finally, we conduct experiments on MNIST, CIFAR-10 and TinyImageNet for DONAS-GAN. We also evaluate the robustness under FGSM and PGD attacks on CIFAR-10, SVHN and TinyImageNet for DONAS-AT. We show that all our variants have significant improvements in both subjective qualitative evaluation and quantitative metrics, compared with their respective base architectures.","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":"34 ","pages":"2463-2472"},"PeriodicalIF":13.7000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10964082/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In this paper, we propose a new approach to train deep learning models using game theory concepts including Generative Adversarial Networks (GANs) and Adversarial Training (AT) where we deploy a double-oracle framework using best response oracles. GAN is essentially a two-player zero-sum game between the generator and the discriminator. The same concept can be applied to AT with attacker and classifier as players. Training these models is challenging as a pure Nash equilibrium may not exist and even finding the mixed Nash equilibrium is difficult as training algorithms for both GAN and AT have a large-scale strategy space. Extending our preliminary model DO-GAN, we propose the methods to apply the double oracle framework concept to Adversarial Neural Architecture Search (NAS for GAN) and Adversarial Training (NAS for AT) algorithms. We first generalize the players’ strategies as the trained models of generator and discriminator from the best response oracles. We then compute the meta-strategies using a linear program. For scalability of the framework where multiple network models of best responses are stored in the memory, we prune the weakly-dominated players’ strategies to keep the oracles from becoming intractable. Finally, we conduct experiments on MNIST, CIFAR-10 and TinyImageNet for DONAS-GAN. We also evaluate the robustness under FGSM and PGD attacks on CIFAR-10, SVHN and TinyImageNet for DONAS-AT. We show that all our variants have significant improvements in both subjective qualitative evaluation and quantitative metrics, compared with their respective base architectures.

查看原文本刊更多论文

博弈论深度学习模型的双Oracle神经结构搜索

在本文中，我们提出了一种使用博弈论概念（包括生成对抗网络（gan）和对抗训练（AT））来训练深度学习模型的新方法，其中我们使用最佳响应预言部署了双预言框架。GAN本质上是生成器和鉴别器之间的两方零和博弈。同样的概念可以应用于攻击者和分类器作为玩家的AT。训练这些模型具有挑战性，因为纯纳什均衡可能不存在，甚至很难找到混合纳什均衡，因为GAN和AT的训练算法都具有大规模的策略空间。扩展我们的初步模型DO-GAN，我们提出了将双oracle框架概念应用于对抗神经结构搜索（GAN）和对抗训练（AT）算法的方法。我们首先将参与者的策略推广为最佳对策预言的生成器和鉴别器的训练模型。然后，我们使用线性程序计算元策略。对于存储在内存中的多个最佳响应网络模型的框架的可扩展性，我们修剪弱支配玩家的策略，以防止预言变得难以处理。最后，我们在MNIST、CIFAR-10和TinyImageNet上对DONAS-GAN进行了实验。我们还评估了DONAS-AT在CIFAR-10、SVHN和TinyImageNet的FGSM和PGD攻击下的鲁棒性。我们表明，与各自的基础架构相比，我们所有的变体在主观定性评估和定量度量方面都有显著的改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

自引率

0.00%

发文量