{"title":"约束通信拓扑下多自治追随者的人类领导行为学习","authors":"Xiao-Xiao Zhang;Huai-Ning Wu;Jin-Liang Wang","doi":"10.1109/TSMC.2025.3560325","DOIUrl":null,"url":null,"abstract":"Owing to the immaturity of current artificial intelligence techniques, practical multiagent systems (MASs) often require supervision and intervention from humans. However, it is unrealistic for a human to monitor the entire MAS and provide appropriate input in some circumstances. A viable approach is to allow a human to control an agent as the leader which in turn influences the other autonomous followers. To this end, a critical issue is how to learn human behavior to improve the autonomy of followers for collaborating with human effectively, since the autonomous followers do not have prior knowledge of human behavior. In this article, the human leading behavior learning problem is studied for a class of human-in-the-loop (HiTL) MASs that are not fully connected. A linear quadratic differential game framework is applied to formulate the collaborative control problem in the HiTL MAS where the human behavior is represented as a cost function whose weighting matrix is unknown to the followers. In the HiTL MAS, we select a follower that has strong computing power called follower 1 to learn the human behavior via an online adaptive inverse differential game (IDG) approach. Based on concurrent learning (CL) technique, an adaptive law is developed for follower 1 to determine the human feedback matrix online, and at the same time the interaction strategies for the autonomous followers are also calculated by follower 1 in case of the constrained communication topology. Subsequently, the weighting matrix in the human cost function is recovered by addressing a linear matrix inequality (LMI) optimization problem. Finally, a numerical example is presented to demonstrate the effectiveness of the proposed method.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 7","pages":"4791-4803"},"PeriodicalIF":8.6000,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Human Leading Behavior Learning for Multiple Autonomous Followers Under Constrained Communication Topologies\",\"authors\":\"Xiao-Xiao Zhang;Huai-Ning Wu;Jin-Liang Wang\",\"doi\":\"10.1109/TSMC.2025.3560325\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Owing to the immaturity of current artificial intelligence techniques, practical multiagent systems (MASs) often require supervision and intervention from humans. However, it is unrealistic for a human to monitor the entire MAS and provide appropriate input in some circumstances. A viable approach is to allow a human to control an agent as the leader which in turn influences the other autonomous followers. To this end, a critical issue is how to learn human behavior to improve the autonomy of followers for collaborating with human effectively, since the autonomous followers do not have prior knowledge of human behavior. In this article, the human leading behavior learning problem is studied for a class of human-in-the-loop (HiTL) MASs that are not fully connected. A linear quadratic differential game framework is applied to formulate the collaborative control problem in the HiTL MAS where the human behavior is represented as a cost function whose weighting matrix is unknown to the followers. In the HiTL MAS, we select a follower that has strong computing power called follower 1 to learn the human behavior via an online adaptive inverse differential game (IDG) approach. Based on concurrent learning (CL) technique, an adaptive law is developed for follower 1 to determine the human feedback matrix online, and at the same time the interaction strategies for the autonomous followers are also calculated by follower 1 in case of the constrained communication topology. Subsequently, the weighting matrix in the human cost function is recovered by addressing a linear matrix inequality (LMI) optimization problem. Finally, a numerical example is presented to demonstrate the effectiveness of the proposed method.\",\"PeriodicalId\":48915,\"journal\":{\"name\":\"IEEE Transactions on Systems Man Cybernetics-Systems\",\"volume\":\"55 7\",\"pages\":\"4791-4803\"},\"PeriodicalIF\":8.6000,\"publicationDate\":\"2025-04-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Systems Man Cybernetics-Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10976993/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Systems Man Cybernetics-Systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10976993/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
Human Leading Behavior Learning for Multiple Autonomous Followers Under Constrained Communication Topologies
Owing to the immaturity of current artificial intelligence techniques, practical multiagent systems (MASs) often require supervision and intervention from humans. However, it is unrealistic for a human to monitor the entire MAS and provide appropriate input in some circumstances. A viable approach is to allow a human to control an agent as the leader which in turn influences the other autonomous followers. To this end, a critical issue is how to learn human behavior to improve the autonomy of followers for collaborating with human effectively, since the autonomous followers do not have prior knowledge of human behavior. In this article, the human leading behavior learning problem is studied for a class of human-in-the-loop (HiTL) MASs that are not fully connected. A linear quadratic differential game framework is applied to formulate the collaborative control problem in the HiTL MAS where the human behavior is represented as a cost function whose weighting matrix is unknown to the followers. In the HiTL MAS, we select a follower that has strong computing power called follower 1 to learn the human behavior via an online adaptive inverse differential game (IDG) approach. Based on concurrent learning (CL) technique, an adaptive law is developed for follower 1 to determine the human feedback matrix online, and at the same time the interaction strategies for the autonomous followers are also calculated by follower 1 in case of the constrained communication topology. Subsequently, the weighting matrix in the human cost function is recovered by addressing a linear matrix inequality (LMI) optimization problem. Finally, a numerical example is presented to demonstrate the effectiveness of the proposed method.
期刊介绍:
The IEEE Transactions on Systems, Man, and Cybernetics: Systems encompasses the fields of systems engineering, covering issue formulation, analysis, and modeling throughout the systems engineering lifecycle phases. It addresses decision-making, issue interpretation, systems management, processes, and various methods such as optimization, modeling, and simulation in the development and deployment of large systems.