{"title":"粒度q学习自适应提高多智能体囚徒困境中的集体福利","authors":"Hsuan-Wei Lee , Yi-Ning Weng","doi":"10.1016/j.chaos.2025.116642","DOIUrl":null,"url":null,"abstract":"<div><div>Understanding how cooperation emerges and stabilizes in a difficult environment is a core challenge across biology, physics, and the social sciences. We present a reinforcement-learning framework for the Prisoner’s Dilemma Game between the two distinct agent types: Interactive Identity (II) and Interactive Diversity (ID). While II agents compress all neighbor interactions into one strategy update, ID agents assign one strategy to each neighbor, enabling finer-grained strategic adaptation. We systematically sweep dilemma strengths and analyze both homogeneous and heterogeneous network structures to show that ID agents persistently outcompete II agents at sustaining cooperation, especially for moderate temptations to defect. Moreover, in scenarios where agents can shift from II to ID based on relative payoffs, ID learning often invades populations of II learners, though influential hub nodes can impede this transition in heterogeneous networks. Spatiotemporal analyses indicate that ID agents form a strong cluster of cooperation, which prevents defection from spreading. Finally, extrapolating these results to wider moral dimensions, such as honesty, trust, and punishment, can give a rich understanding of how this granular, neighbor-specific learning raises collective welfare within both natural ecosystems and engineered multi-agent systems.</div></div>","PeriodicalId":9764,"journal":{"name":"Chaos Solitons & Fractals","volume":"199 ","pages":"Article 116642"},"PeriodicalIF":5.6000,"publicationDate":"2025-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Granular Q-learning adaptation boosts collective welfare in multi-agent Prisoner’s Dilemma\",\"authors\":\"Hsuan-Wei Lee , Yi-Ning Weng\",\"doi\":\"10.1016/j.chaos.2025.116642\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Understanding how cooperation emerges and stabilizes in a difficult environment is a core challenge across biology, physics, and the social sciences. We present a reinforcement-learning framework for the Prisoner’s Dilemma Game between the two distinct agent types: Interactive Identity (II) and Interactive Diversity (ID). While II agents compress all neighbor interactions into one strategy update, ID agents assign one strategy to each neighbor, enabling finer-grained strategic adaptation. We systematically sweep dilemma strengths and analyze both homogeneous and heterogeneous network structures to show that ID agents persistently outcompete II agents at sustaining cooperation, especially for moderate temptations to defect. Moreover, in scenarios where agents can shift from II to ID based on relative payoffs, ID learning often invades populations of II learners, though influential hub nodes can impede this transition in heterogeneous networks. Spatiotemporal analyses indicate that ID agents form a strong cluster of cooperation, which prevents defection from spreading. Finally, extrapolating these results to wider moral dimensions, such as honesty, trust, and punishment, can give a rich understanding of how this granular, neighbor-specific learning raises collective welfare within both natural ecosystems and engineered multi-agent systems.</div></div>\",\"PeriodicalId\":9764,\"journal\":{\"name\":\"Chaos Solitons & Fractals\",\"volume\":\"199 \",\"pages\":\"Article 116642\"},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2025-06-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chaos Solitons & Fractals\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0960077925006551\",\"RegionNum\":1,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chaos Solitons & Fractals","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0960077925006551","RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MATHEMATICS, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Granular Q-learning adaptation boosts collective welfare in multi-agent Prisoner’s Dilemma
Understanding how cooperation emerges and stabilizes in a difficult environment is a core challenge across biology, physics, and the social sciences. We present a reinforcement-learning framework for the Prisoner’s Dilemma Game between the two distinct agent types: Interactive Identity (II) and Interactive Diversity (ID). While II agents compress all neighbor interactions into one strategy update, ID agents assign one strategy to each neighbor, enabling finer-grained strategic adaptation. We systematically sweep dilemma strengths and analyze both homogeneous and heterogeneous network structures to show that ID agents persistently outcompete II agents at sustaining cooperation, especially for moderate temptations to defect. Moreover, in scenarios where agents can shift from II to ID based on relative payoffs, ID learning often invades populations of II learners, though influential hub nodes can impede this transition in heterogeneous networks. Spatiotemporal analyses indicate that ID agents form a strong cluster of cooperation, which prevents defection from spreading. Finally, extrapolating these results to wider moral dimensions, such as honesty, trust, and punishment, can give a rich understanding of how this granular, neighbor-specific learning raises collective welfare within both natural ecosystems and engineered multi-agent systems.
期刊介绍:
Chaos, Solitons & Fractals strives to establish itself as a premier journal in the interdisciplinary realm of Nonlinear Science, Non-equilibrium, and Complex Phenomena. It welcomes submissions covering a broad spectrum of topics within this field, including dynamics, non-equilibrium processes in physics, chemistry, and geophysics, complex matter and networks, mathematical models, computational biology, applications to quantum and mesoscopic phenomena, fluctuations and random processes, self-organization, and social phenomena.