Yufeng Shen, Xueling Ma, Yukun Bao, Gang Kou, Jianming Zhan
{"title":"基于强化学习的群体决策共识方法","authors":"Yufeng Shen, Xueling Ma, Yukun Bao, Gang Kou, Jianming Zhan","doi":"10.1016/j.ejor.2025.08.054","DOIUrl":null,"url":null,"abstract":"In group decision-making (GDM), the consensus-reaching process (CRP) is essential for aligning the diverse opinions of decision-makers (DMs) to achieve collective agreements. However, the process often faces obstacles due to the uncertainty of DMs in terms of unit cost and willingness to adjust opinions. To this end, this study constructs a new GDM framework by introducing reinforcement learning (RL) to the CRP. In this framework, we design a unit cost learning algorithm based on RL. The algorithm introduces an action space based on linguistic expressions, and therefore exhibits strong interpretability. On this basis, a weight reward–penalty mechanism based on asymmetric Nash bargaining is further proposed. The mechanism takes marginal and adjustment contributions as objective criteria, which provides a reasonable basis for improving consensus outcomes and managing non-cooperative behaviors. The proposed model incorporates both interactive and automatic strategies: the former is able to accurately capture individuals’ willingness to cooperate with the help of RL, and the latter relies on optimization models to effectively reduce the time and cost spent on reaching consensus. Finally, we provide an example to illustrate the proposed approach and experimentally verify its feasibility and the potential of the RL framework.","PeriodicalId":55161,"journal":{"name":"European Journal of Operational Research","volume":"39 1","pages":""},"PeriodicalIF":6.0000,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A consensus method based on reinforcement learning for group decision-making\",\"authors\":\"Yufeng Shen, Xueling Ma, Yukun Bao, Gang Kou, Jianming Zhan\",\"doi\":\"10.1016/j.ejor.2025.08.054\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In group decision-making (GDM), the consensus-reaching process (CRP) is essential for aligning the diverse opinions of decision-makers (DMs) to achieve collective agreements. However, the process often faces obstacles due to the uncertainty of DMs in terms of unit cost and willingness to adjust opinions. To this end, this study constructs a new GDM framework by introducing reinforcement learning (RL) to the CRP. In this framework, we design a unit cost learning algorithm based on RL. The algorithm introduces an action space based on linguistic expressions, and therefore exhibits strong interpretability. On this basis, a weight reward–penalty mechanism based on asymmetric Nash bargaining is further proposed. The mechanism takes marginal and adjustment contributions as objective criteria, which provides a reasonable basis for improving consensus outcomes and managing non-cooperative behaviors. The proposed model incorporates both interactive and automatic strategies: the former is able to accurately capture individuals’ willingness to cooperate with the help of RL, and the latter relies on optimization models to effectively reduce the time and cost spent on reaching consensus. Finally, we provide an example to illustrate the proposed approach and experimentally verify its feasibility and the potential of the RL framework.\",\"PeriodicalId\":55161,\"journal\":{\"name\":\"European Journal of Operational Research\",\"volume\":\"39 1\",\"pages\":\"\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2025-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Operational Research\",\"FirstCategoryId\":\"91\",\"ListUrlMain\":\"https://doi.org/10.1016/j.ejor.2025.08.054\",\"RegionNum\":2,\"RegionCategory\":\"管理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"OPERATIONS RESEARCH & MANAGEMENT SCIENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Operational Research","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1016/j.ejor.2025.08.054","RegionNum":2,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPERATIONS RESEARCH & MANAGEMENT SCIENCE","Score":null,"Total":0}
A consensus method based on reinforcement learning for group decision-making
In group decision-making (GDM), the consensus-reaching process (CRP) is essential for aligning the diverse opinions of decision-makers (DMs) to achieve collective agreements. However, the process often faces obstacles due to the uncertainty of DMs in terms of unit cost and willingness to adjust opinions. To this end, this study constructs a new GDM framework by introducing reinforcement learning (RL) to the CRP. In this framework, we design a unit cost learning algorithm based on RL. The algorithm introduces an action space based on linguistic expressions, and therefore exhibits strong interpretability. On this basis, a weight reward–penalty mechanism based on asymmetric Nash bargaining is further proposed. The mechanism takes marginal and adjustment contributions as objective criteria, which provides a reasonable basis for improving consensus outcomes and managing non-cooperative behaviors. The proposed model incorporates both interactive and automatic strategies: the former is able to accurately capture individuals’ willingness to cooperate with the help of RL, and the latter relies on optimization models to effectively reduce the time and cost spent on reaching consensus. Finally, we provide an example to illustrate the proposed approach and experimentally verify its feasibility and the potential of the RL framework.
期刊介绍:
The European Journal of Operational Research (EJOR) publishes high quality, original papers that contribute to the methodology of operational research (OR) and to the practice of decision making.