{"title":"Hierarchical Containment Control With Bipartite Cluster Consensus for Heterogeneous Multiagent Systems Under Layer-Signed Digraph","authors":"Dazhong Ma;Jingshu Sang;Lei Liu;Zhanshan Wang","doi":"10.1109/TCYB.2024.3506986","DOIUrl":"10.1109/TCYB.2024.3506986","url":null,"abstract":"This article considers the hierarchical containment control (HCC) for flexible mirrored collaboration, which accommodates the bipartite cluster consensus behavior in two symmetric convex hulls formed by multiple leaders. First, to achieve the mirrored collaboration in symmetric convex hulls, the layer-signed digraph is generated by involving the antagonistic interaction. Benefiting from the hierarchical structure, the antagonistic interaction in the assistant-layer replaces the assumption of in-degree balance for the existing cluster consensus issues. Second, the existing types of control protocols and the framework of cooperative output regulation limit the achievement of the studied hierarchical mirrored collaboration. To solve this problem, the hierarchical cooperative output regulation is extended based on the formulated hierarchical mirrored collaborative errors. Third, the layer-signal compensator is designed estimating the states of leaders as well as guaranteeing the convergence of collaborative behaviors. Combining with the designed layer-signal compensator, a novel HCC protocol is proposed so that the bipartite cluster consensus behavior can be achieved simultaneously in two symmetric convex hulls. Finally, theoretical results are verified by performing the numerical simulation.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 2","pages":"765-775"},"PeriodicalIF":9.4,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142961511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"FingHV: Efficient Sharing and Fine-Grained Scheduling of Virtualized HPU Resources","authors":"Hui Wang;Zhiwen Yu;Zhuoli Ren;Yao Zhang;Jiaqi Liu;Liang Wang;Bin Guo","doi":"10.1109/TCYB.2024.3518569","DOIUrl":"10.1109/TCYB.2024.3518569","url":null,"abstract":"While artificial intelligence (AI) technology has advanced in real-world applications, there is a strong motivation to develop hybrid systems where AI algorithms and humans collaborate, promoting more human-centered approaches in AI system design. This has led to the emergence of a novel human-machine computing (HMC) paradigm, which combines human cognitive abilities with machine computational power to create a collaborative computing framework that meets the demands of large-scale, complex tasks and enables human-machine symbiosis. Human processing units (HPUs) are crucial computing resources in HMC-oriented systems, and efficient HPU resource provisioning is key to boosting system performance. However, existing schemes often fail to assign tasks to the most suitable HPUs and optimize HPU utility, as they either cannot quantitatively measure skills or overlook utility concerns during task assignment and scheduling. To address these challenges, this article proposes a fine-grained HPU virtualization (FingHV) approach, which leverages virtualization techniques to improve flexibility, fairness, and utility in the provisioning process. The core idea is to use a tree-based skill model to precisely measure the levels and correlations of multiple skills within individual HPUs, and to apply a mixed time/event-based scheduling policy to maximize HPU utility. Specifically, we begin by proposing a hierarchical multiskill tree to model HPU skills and their correlations. Next, we formulate the HPU virtualization problem and present a fine-grained virtualization method, which includes a quality-driven HPU assignment process and a mixed time/event-based scheduling policy to improve resource-sharing efficiency. Finally, we evaluate FingHV on a synthetic dataset with varying task sizes and a real-world case. The results demonstrate that FingHV improves global matching quality by up to 39.7% and increases HPU utility by 11.2% compared to the baselines.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 2","pages":"600-614"},"PeriodicalIF":9.4,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142961510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Auxiliary Task-Based Deep Reinforcement Learning for Quantum Control","authors":"Shumin Zhou;Hailan Ma;Sen Kuang;Daoyi Dong","doi":"10.1109/TCYB.2024.3521300","DOIUrl":"10.1109/TCYB.2024.3521300","url":null,"abstract":"Due to its property of not requiring prior knowledge of the environment, reinforcement learning (RL) has significant potential for solving quantum control problems. In this work, we investigate the effectiveness of continuous control policies based on deep deterministic policy gradient. To achieve good control of quantum systems with high fidelity, we propose an auxiliary task-based deep RL (AT-DRL) for quantum control. In particular, we design an auxiliary task to predict the fidelity value, sharing partial parameters with the main network (from the main RL task). The auxiliary task learns synchronously with the main task, allowing one to extract intrinsic features of the environment, thus aiding the agent to achieve the desired state with high fidelity. To further enhance the control performance, we also design a guided reward function based on the fidelity of quantum states that enables gradual fidelity improvement. Numerical simulations demonstrate that the proposed AT-DRL can provide a good solution to the exploration of quantum dynamics. It not only achieves high task fidelities but also demonstrates fast learning rates. Moreover, AT-DRL has great potential in designing control pulses that achieve effective quantum state preparation.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 2","pages":"712-725"},"PeriodicalIF":9.4,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142936235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Target–Attackers–Defenders Linear–Quadratic Exponential Stochastic Differential Games With Distributed Control","authors":"Guilu Li;Jianan Wang;Fuxiang Liu;Fang Deng","doi":"10.1109/TCYB.2024.3508694","DOIUrl":"10.1109/TCYB.2024.3508694","url":null,"abstract":"This article investigates stochastic differential games involving multiple attackers, defenders, and a single target, with their interactions defined by a distributed topology. By leveraging principles of topological graph theory, a distributed design strategy is developed that operates without requiring global information, thereby minimizing system coupling. Additionally, this study extends the analysis to incorporate stochastic elements into the target-attackers–defenders games, moving beyond the scope of deterministic differential games. Using the direct method of completing the square and the Radon-Nikodym derivative, we derive optimal distributed control strategies for two scenarios: one where the target follows a predefined trajectory and another where it has free maneuverability. In both scenarios, our research demonstrates the effectiveness of the designed control strategies in driving the system toward a Nash equilibrium. Notably, our algorithm eliminates the need to solve the coupled Hamilton-Jacobi equation, significantly reducing computational complexity. To validate the effectiveness of the proposed control strategies, numerical simulations are presented in this article.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 2","pages":"574-587"},"PeriodicalIF":9.4,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142934510","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Juan Shi;Chen Chu;Guoxi Fan;Die Hu;Jinzhuo Liu;Zhen Wang;Shuyue Hu
{"title":"Payoff Control in Multichannel Games: Influencing Opponent Learning Evolution","authors":"Juan Shi;Chen Chu;Guoxi Fan;Die Hu;Jinzhuo Liu;Zhen Wang;Shuyue Hu","doi":"10.1109/TCYB.2024.3507830","DOIUrl":"10.1109/TCYB.2024.3507830","url":null,"abstract":"In this article, we introduce a new theory for payoff control in multichannel learning environments, where agents interact with each other over multiple channels and each channel is a repeated normal form game. We propose two payoff control strategies—partial control and full control—that allow a single agent to set an upper bound to the opponent’s expected payoffs summed across all channels, even if the opponent is a reinforcement learning agent. We prove that a partial (or full) control strategy can be obtained by solving a system of inequalities, and characterize the conditions under which such a partial (or full) control strategy exists. We show that by utilizing these control strategies, the agent can influence the opponent’s learning evolution and direct it toward a desired viable equilibrium. Our experiments confirm the effectiveness of our theory for payoff control in a wide range of multichannel learning environments.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 2","pages":"776-785"},"PeriodicalIF":9.4,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142924641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Fei Teng;Xin Zhang;Tieshan Li;Qihe Shan;C. L. Philip Chen;Yushuai Li
{"title":"Distributed Resilient Energy Management for Seaport Microgrid Against Stealthy Attacks With Limited Security Defense Resource","authors":"Fei Teng;Xin Zhang;Tieshan Li;Qihe Shan;C. L. Philip Chen;Yushuai Li","doi":"10.1109/TCYB.2024.3514693","DOIUrl":"10.1109/TCYB.2024.3514693","url":null,"abstract":"This article investigates the distributed resilient energy management (EM) strategy for the seaport microgrid under stealthy attacks. First, based on an analysis of seaport microgrid characteristics, we construct an EM model that aims at minimizing both operating cost and security defense resource (SDR) cost. Second, we present a distributed, resilient strategy by defining node security levels and establishing dynamic security intervals. We prove that the gap between the obtained feasible solution and the optimal one is bounded. The designed strategy is capable of tolerating the effect of the unlimited number of stealthy attacked nodes on the seaport microgrid. In addition, given the limited SDRs for the resilient EM of the island seaport microgrid, a distributed mechanism for searching the minimum security connected dominating set (MSCDS) is proposed to minimize the size of trusted nodes. Finally, simulation results demonstrate the effectiveness of the proposed strategy. Note to Practitioners: This article addresses the vulnerability of the seaport microgrid, a critical issue that impacts EM and disrupts seaport operations. Current approaches to seaport EM do not account for potential attacks. Meanwhile, existing methods for attack resilience often overlook the costs of security resources. We propose a new approach for the distributed and resilient EM of the island seaport microgrid. Secure operation is achieved by protecting the fewest trusted nodes, thereby conserving SDRs. We then show how this algorithm (searching the MSCDS) can be efficiently designed. Preliminary simulations indicate its feasibility, though it has yet to be tested in a production environment. Future research will focus on designing trusted nodes within dynamic topologies.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 2","pages":"917-926"},"PeriodicalIF":9.4,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142924588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"ADG-Net: A Sim2Real Multimodal Learning Framework for Adaptive Dexterous Grasping","authors":"Hui Zhang;Jianzhi Lyu;Chuangchuang Zhou;Hongzhuo Liang;Yuyang Tu;Fuchun Sun;Jianwei Zhang","doi":"10.1109/TCYB.2024.3518975","DOIUrl":"10.1109/TCYB.2024.3518975","url":null,"abstract":"In this article, a novel simulation-to-real (sim2real) multimodal learning framework is proposed for adaptive dexterous grasping and grasp status prediction. A two-stage approach is built upon the Isaac Gym and several proposed pluggable modules, which can effectively simulate dexterous grasps with multimodal sensing data, including RGB-D images of grasping scenarios, joint angles, 3-D tactile forces of soft fingertips, etc. Over 500K multimodal synthetic grasping scenarios are collected for neural network training. An adaptive dexterous grasping neural network (ADG-Net) is trained to learn dexterous grasp principles and predict grasp parameters, employing an attention mechanism and a graph convolutional neural network module to fuse multimodal information. The proposed adaptive dexterous grasping method can detect feasible grasp parameters from an RGB-D image of a grasp scene and then optimize grasp parameters based on multimodal sensing data when the dexterous hand touches a target object. Various experiments in both simulation and physical grasps indicate that our ADG-Net grasping method outperforms state-of-the-art grasping methods, achieving an average success rate of 92% for grasping isolated unseen objects and 83% for stacked objects. Code and video demos are available at <uri>https://github.com/huikul/adgnet</uri>.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 2","pages":"840-853"},"PeriodicalIF":9.4,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142924589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adaptive Descriptor Sliding-Mode Observer-Based Dynamic Event-Triggered Consensus of Multiagent Systems Against Actuator and Sensor Faults","authors":"Zhengyu Ye;Bin Jiang;Ziquan Yu;Yuehua Cheng","doi":"10.1109/TCYB.2024.3519593","DOIUrl":"10.1109/TCYB.2024.3519593","url":null,"abstract":"Actuator and sensor faults are among the most common factors affecting the stability of multiagent systems (MASs). This article proposes a dynamic event-triggered fault-tolerant control (FTC) algorithm based on descriptor sliding-mode observers to address actuator and sensor faults in MASs. First, the MAS dynamics are reformulated into a descriptor form, enabling an observer to simultaneously achieve state estimation and fault diagnosis. Using the estimation results, an adaptive FTC algorithm is developed to maintain the stability of MASs in the presence of concurrent faults, with control gains updated based on the observer consensus error. A dynamic event-triggered mechanism is incorporated to manage data transmission and update neighboring agents’ information for the controller, thereby reducing communication overhead. Finally, a numerical simulation involving multiple quadrotors is conducted to validate the effectiveness of the proposed method.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 2","pages":"672-683"},"PeriodicalIF":9.4,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142924590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Global Dynamic Double Side Event-Triggered Adaptive Control for Interconnected Nonlinear Systems via Intermittent Output Feedback","authors":"Hao Li, Changchun Hua, Kuo Li","doi":"10.1109/tcyb.2024.3518773","DOIUrl":"https://doi.org/10.1109/tcyb.2024.3518773","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"20 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142911789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}