Sheng-Hao Wu;Yuxiao Huang;Xingyu Wu;Liang Feng;Zhi-Hui Zhan;Kay Chen Tan
{"title":"Learning to Transfer for Evolutionary Multitasking","authors":"Sheng-Hao Wu;Yuxiao Huang;Xingyu Wu;Liang Feng;Zhi-Hui Zhan;Kay Chen Tan","doi":"10.1109/TCYB.2025.3561518","DOIUrl":"10.1109/TCYB.2025.3561518","url":null,"abstract":"Evolutionary multitasking (EMT) is an emerging approach for solving multitask optimization problems (MTOPs) and has garnered considerable research interest. The implicit EMT is a significant research branch that utilizes evolution operators to enable knowledge transfer (KT) between tasks. However, current approaches in implicit EMT face challenges in adaptability, due to the limited use of different evolution operators with different parameter settings and insufficient utilization of evolutionary states for performing KT. This results in suboptimal exploitation of implicit KT’s potential to tackle a variety of MTOPs. To overcome these limitations, we propose a novel learning-to-transfer (L2T) framework to automatically discover efficient KT policies for the MTOPs at hand. Our framework conceptualizes the KT process as a learning agent’s sequence of strategic decisions within the EMT process. We propose an action formulation for deciding when and how to transfer, a state representation with informative features of evolution states, a reward formulation concerning convergence and transfer efficiency gain, and the environment for the agent to interact with MTOPs. We employ an actor-critic network structure for the agent and learn the policy via proximal policy optimization. This learned agent can be integrated with various evolutionary algorithms, enhancing their ability to address unseen MTOPs. Comprehensive empirical studies on both synthetic and real-world MTOPs, encompassing diverse intertask relationships, function classes, and task distributions are conducted to validate the proposed L2T framework. The results show a marked improvement in the adaptability and performance of implicit EMT when solving a wide spectrum of unseen MTOPs.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 7","pages":"3342-3355"},"PeriodicalIF":9.4,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143889971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Shuxing Xuan;Hongjing Liang;Shihao Huang;Tieshan Li;Jiayue Sun
{"title":"Distributed Optimal Consensus Problem of Input Constrained Nonlinear Discrete-Time MASs: A Mode-Free Reinforcement Learning Approach","authors":"Shuxing Xuan;Hongjing Liang;Shihao Huang;Tieshan Li;Jiayue Sun","doi":"10.1109/TCYB.2025.3562390","DOIUrl":"10.1109/TCYB.2025.3562390","url":null,"abstract":"In this article, a model-free reinforcement learning (RL) approach is proposed for solving the optimal consensus control issue of nonlinear discrete-time multiagent systems with input constraint. To address the challenge of solving the coupled discrete Hamilton-Jacobi–Bellman (HJB) equation, a RL approach based on actor-critic framework is proposed for optimal consensus control. A well-defined cost function is designed, and the actor and critic networks are updated through online learning to obtain the optimal controllers. Furthermore, the actuator’s performance is often limited due to physical constraints. To address such actuator constraints, a gradual transition control (GTC) method is proposed, and update-free and update-weak policies are introduced to further optimize network performance. Additionally, in real-world distributed systems, the actor-critic networks deployed in each agent rely on data from neighboring agents, which necessitates addressing the issue of distributed synchronization. To address this challenge, the synchronization blocking method is designed, which designs additional control signals for each agent to handle these issues. Finally, two simulations under different scenarios are presented to verify the effectiveness of the proposed approach.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 6","pages":"2910-2923"},"PeriodicalIF":9.4,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143889970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Multicontroller-Based Fault-Tolerant Control for Uncertain High-Order Sub-Fully Actuated Systems","authors":"Mengtong Gong;Donghua Zhou;Li Sheng;Xiao He","doi":"10.1109/TCYB.2025.3560406","DOIUrl":"10.1109/TCYB.2025.3560406","url":null,"abstract":"This article proposes a novel multicontroller-based fault tolerance method to cope with a class of high-order sub-fully actuated systems (sub-FASs) with nonlinear uncertainties and actuator faults. As a promising control-oriented theory, the FAS approach is a convenient and powerful tool for nonlinear control. However, the stabilization of sub-FASs, whose input matrix function is not globally invertible, is more sophisticated and challenging due to the issue of control singularity. To address the global fault-tolerant stabilization of uncertain sub-FASs, a high-order nonlinear system model with both multiplicative and additive actuator faults is considered. By introducing the concepts of linear singular set and singularity function, the entire state space is analytically divided into several regions. Then, according to the initial states of system, three different control strategies are developed to overcome singularity and achieve global stabilization, including a FAS-based stabilizing control law, a singularity-avoid tracking control strategy, as well as a singularity-free switching control strategy. The closed-loop response of the faulty system is proven to be ultimately uniformly bounded in all cases, and the effectiveness of proposed method is illustrated through a numerical example.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 7","pages":"3525-3536"},"PeriodicalIF":9.4,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143884575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adaptive Fault Tolerant Consensus Tracking Control for Flexible Manipulators MASs With Input Quantization and Time-Varying Delay","authors":"Wei Zhao;Xing Li;Yu Liu;Zhijun Li","doi":"10.1109/TCYB.2025.3560891","DOIUrl":"10.1109/TCYB.2025.3560891","url":null,"abstract":"This article mainly investigates the problem of vibration suppression and angle cooperative tracking control of a multiple flexible manipulators described by partial differential equations (PDEs) with input quantization, actuator failures, and unmodeled system dynamics. An intermediate control law is designed, and a smooth function with a positive integrable time-varying function is introduced. Besides, a new smooth function is constructed in the control law to handle the influence of quantization and actuator faults. Under the designed controller, the angles of all flexible manipulators can reach consensus through mutual communication, and the elastic deformation of each flexible manipulator can also be suppressed. Furthermore, the asymptotic stability of a closed-loop system is realized based on the Lyapunov function. Finally, numerical simulation validates the effectiveness of the method.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 6","pages":"2597-2607"},"PeriodicalIF":9.4,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143884574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Normal Forms of Linear Time-Varying Systems With Applications to Output-Feedback Stabilization and Tracking","authors":"Bin Zhou;Jiacheng Dong;Guangbin Cai","doi":"10.1109/TCYB.2025.3547895","DOIUrl":"10.1109/TCYB.2025.3547895","url":null,"abstract":"This article examines the normal form of a multiple-input-multiple-output (MIMO) linear time-varying (LTV) system. It explores the transformation of such a system into its normal form using an LTV transformation, if it possesses a general relative degree that may not be uniform (strict). Based on the obtained normal form, the internal and external dynamics as well as the inverse system of the LTV system are investigated. Additionally, the zero dynamics can be effectively parameterized. Moreover, by employing the derived LTV normal form, the proportional-derivative output-feedback stabilization and output tracking problems are resolved. Finally, an illustrative example is presented to showcase the effectiveness of the proposed controllers.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 6","pages":"2671-2683"},"PeriodicalIF":9.4,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143884572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fixed-Time Resilient Distributed NE Seeking Control for Nonlinear MASs Against DoS Attacks","authors":"Shihan Zhou;Chao Deng;Sha Fan;Bohui Wang","doi":"10.1109/TCYB.2025.3561161","DOIUrl":"10.1109/TCYB.2025.3561161","url":null,"abstract":"In this article, the fixed-time resilient distributed Nash equilibrium (NE) control problem is addressed for nonlinear multiagent systems (MASs) under denial-of-service (DoS) attacks. Unlike existing results on NE seeking in noncooperative games, this article first investigates the layered fixed-time resilient distributed NE control problem for nonlinear MASs against DoS attacks, independent of initial conditions. To address these challenges, the novel layered NE control strategy is proposed, comprising a resilient fixed-time distributed NE seeking algorithm layer, an improved fixed-time performance enhancement layer, and an adaptive controller design layer. Specifically, a fixed-time resilient distributed NE seeking algorithm is first designed to guarantee players actions to converge toward the NE against DoS attacks. Then, novel high-order fixed-time filters are proposed to generate improved actions with smooth characteristics and converge to the actions in the aforementioned fixed-time algorithm. Based on the developed filters, a decentralized fuzzy adaptive controller is developed to achieve bounded tracking within the fixed time using the backstepping technique. Finally, a numerical simulation is conducted to validate the efficacy of the developed method.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 7","pages":"3368-3378"},"PeriodicalIF":9.4,"publicationDate":"2025-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143884571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Event-Triggered Model-Free Adaptive Formation Constrained Control for Nonlinear Heterogeneous Multiagent Systems","authors":"Weiming Zhang;Dezhi Xu;Yujian Ye;Wei Hua;Bin Jiang","doi":"10.1109/TCYB.2025.3557383","DOIUrl":"10.1109/TCYB.2025.3557383","url":null,"abstract":"This article aims to address the formation control issue of the unknown nonaffine nonlinear heterogeneous multiagent system (MAS) considering formation tracking accuracy and computational cost. A novel dynamic prescribed boundary-based event-triggered mechanism is proposed first to flexibly adjust the emphasis on these two indicators, and applied to data modeling and controller design simultaneously to reduce their computational cost. On one hand, an observer-based pseudo gradient estimation algorithm is designed under event-triggered framework for model reconfiguration with only input/output data of system rather than mathematical dynamics. On the other hand, an event-triggered constrained control strategy is developed with several modules to cope with complex scenarios. Concretely, a data-driven anti-windup compensator is designed in case of input constraint, and an improved prescribed performance-based fractional order terminal sliding mode controller is explored for enhancement of the formation tracking accuracy and robustness of the controlled MAS with rigorous stability analysis. Both numerical simulation and hard-in-the-loop experiment on distributed energy storage systems are performed to attest the efficacy of the proposed formation control strategy.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 6","pages":"2806-2819"},"PeriodicalIF":9.4,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143876033","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Masoud Zare Shahabadi, Hajar Atrianfar, Hossein A. Abyaneh
{"title":"An Enhanced Detection Scheme and Distributed Resilient Asynchronous Event-Triggered Control of AC Microgrids Subject to Replay Attacks","authors":"Masoud Zare Shahabadi, Hajar Atrianfar, Hossein A. Abyaneh","doi":"10.1109/tcyb.2025.3559379","DOIUrl":"https://doi.org/10.1109/tcyb.2025.3559379","url":null,"abstract":"","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"4 1","pages":""},"PeriodicalIF":11.8,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143876035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"IKT-BT: Indirect Knowledge Transfer Behavior Tree Framework for Multirobot Systems Through Communication Eavesdropping","authors":"Sanjay Sarma Oruganti Venkata;Ramviyas Parasuraman;Ramana Pidaparti","doi":"10.1109/TCYB.2025.3560564","DOIUrl":"10.1109/TCYB.2025.3560564","url":null,"abstract":"Multiagent and multirobot systems (MRS) often rely on direct communication for information sharing. This work explores an alternative approach inspired by eavesdropping mechanisms in nature that involves casual observation of agent interactions to enhance decentralized knowledge dissemination. We achieve this through a novel indirect knowledge transfer through behavior trees (IKT-BT) framework tailored for a behavior-based MRS, encapsulating knowledge and control actions in behavior trees (BT). We present two new BT-based modalities—eavesdrop-update (EU) and eavesdrop-buffer-update (EBU)—incorporating unique eavesdropping strategies and efficient episodic memory management suited for resource-limited swarm robots. We theoretically analyze the IKT-BT framework for an MRS and validate the performance of the proposed modalities through extensive experiments simulating a search and rescue mission. Our results reveal improvements in both global mission performance outcomes and agent-level knowledge dissemination with a reduced need for direct communication.","PeriodicalId":13112,"journal":{"name":"IEEE Transactions on Cybernetics","volume":"55 6","pages":"2558-2570"},"PeriodicalIF":9.4,"publicationDate":"2025-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143876032","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}