Distributed Stackelberg Strategies in State-based Potential Games for Autonomous Decentralized Learning Manufacturing Systems

arXiv - CS - Multiagent Systems Pub Date : 2024-08-12 DOI:arxiv-2408.06397

Steve Yuwono, Dorothea Schwung, Andreas Schwung

{"title":"Distributed Stackelberg Strategies in State-based Potential Games for Autonomous Decentralized Learning Manufacturing Systems","authors":"Steve Yuwono, Dorothea Schwung, Andreas Schwung","doi":"arxiv-2408.06397","DOIUrl":null,"url":null,"abstract":"This article describes a novel game structure for autonomously optimizing\ndecentralized manufacturing systems with multi-objective optimization\nchallenges, namely Distributed Stackelberg Strategies in State-Based Potential\nGames (DS2-SbPG). DS2-SbPG integrates potential games and Stackelberg games,\nwhich improves the cooperative trade-off capabilities of potential games and\nthe multi-objective optimization handling by Stackelberg games. Notably, all\ntraining procedures remain conducted in a fully distributed manner. DS2-SbPG\noffers a promising solution to finding optimal trade-offs between objectives by\neliminating the complexities of setting up combined objective optimization\nfunctions for individual players in self-learning domains, particularly in\nreal-world industrial settings with diverse and numerous objectives between the\nsub-systems. We further prove that DS2-SbPG constitutes a dynamic potential\ngame that results in corresponding converge guarantees. Experimental validation\nconducted on a laboratory-scale testbed highlights the efficacy of DS2-SbPG and\nits two variants, such as DS2-SbPG for single-leader-follower and Stack\nDS2-SbPG for multi-leader-follower. The results show significant reductions in\npower consumption and improvements in overall performance, which signals the\npotential of DS2-SbPG in real-world applications.","PeriodicalId":501315,"journal":{"name":"arXiv - CS - Multiagent Systems","volume":"8 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Multiagent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.06397","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This article describes a novel game structure for autonomously optimizing decentralized manufacturing systems with multi-objective optimization challenges, namely Distributed Stackelberg Strategies in State-Based Potential Games (DS2-SbPG). DS2-SbPG integrates potential games and Stackelberg games, which improves the cooperative trade-off capabilities of potential games and the multi-objective optimization handling by Stackelberg games. Notably, all training procedures remain conducted in a fully distributed manner. DS2-SbPG offers a promising solution to finding optimal trade-offs between objectives by eliminating the complexities of setting up combined objective optimization functions for individual players in self-learning domains, particularly in real-world industrial settings with diverse and numerous objectives between the sub-systems. We further prove that DS2-SbPG constitutes a dynamic potential game that results in corresponding converge guarantees. Experimental validation conducted on a laboratory-scale testbed highlights the efficacy of DS2-SbPG and its two variants, such as DS2-SbPG for single-leader-follower and Stack DS2-SbPG for multi-leader-follower. The results show significant reductions in power consumption and improvements in overall performance, which signals the potential of DS2-SbPG in real-world applications.

查看原文本刊更多论文

自主分散学习制造系统基于状态的潜能博弈中的分布式斯塔克尔伯格策略

本文介绍了一种用于自主优化具有多目标优化挑战的分散式制造系统的新型博弈结构，即基于状态的潜在博弈中的分布式斯台克尔伯格策略（DS2-SbPG）。DS2-SbPG 整合了潜能博弈和斯台克尔伯格博弈，提高了潜能博弈的合作权衡能力和斯台克尔伯格博弈的多目标优化处理能力。值得注意的是，所有训练过程仍以完全分布式的方式进行。DS2-SbPG 为寻找目标间的最优权衡提供了一个很有前景的解决方案，它消除了在自学领域为单个参与者设置组合目标优化函数的复杂性，特别是在子系统间目标多样、数量众多的现实世界工业环境中。我们还进一步证明，DS2-SbPG 构成了一个动态势能博弈，能带来相应的收敛保证。在实验室规模的测试平台上进行的实验验证凸显了 DS2-SbPG 及其两个变体的功效，如用于单领导-追随者的 DS2-SbPG 和用于多领导-追随者的 StackDS2-SbPG。结果表明，功耗明显降低，整体性能显著提高，这预示着 DS2-SbPG 在实际应用中大有可为。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

arXiv - CS - Multiagent Systems

自引率

0.00%

发文量