{"title":"Toward Universal Controller: Performance-Aware Self-Optimizing Reinforcement Learning for Discrete-Time Systems With Uncontrollable Factors","authors":"Jianfeng Zhang;Haoran Zhang;Chunhui Zhao","doi":"10.1109/TSMC.2025.3539349","DOIUrl":"https://doi.org/10.1109/TSMC.2025.3539349","url":null,"abstract":"The industrial system usually contains not only controllable variables (CVs) but also uncontrollable variables (unCVs), e.g., weather conditions and friction. These unCVs have a direct impact on system control performance. Despite the success of current deep reinforcement learning (DRL) control algorithms, most of them neglect the impact of unCVs, which can cause the deterioration of control performance and instability of the system. To perceive and eliminate the impact of unCVs, a performance-aware self-optimizing universal controller (PASOUC) is designed in this article. The PASOUC aims at integrating the representation of unCVs and controller design to perceive and eliminate the impact of unCVs under different conditions, which goes beyond most existing control methods. Technically, a historical trajectory-inspired control performance perceptron is developed to perceive the impact of unCVs on system control performance under different conditions. Subsequently, a new performance-aware reward is designed to integrate the representation of unCVs and controller design while training the DRL controller. In addition, the domain randomization (DR) training strategy is employed to learn a universal control policy, which can access the approximate optimal trajectory under nonideal conditions. In this way, the impact of unCVs can be eliminated. To handle the low efficiency of the DR training, the policy improvement-policy proximal optimization (PI-PPO) is proposed to enhance the convergence speed of the DR training by performing explicit policy improvement. Finally, illustrative examples are presented to demonstrate the superiority of the proposed method.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 5","pages":"3249-3260"},"PeriodicalIF":8.6,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143839827","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Adaptive Fuzzy Distributed Optimal Control for Nonlinear Multiagent Systems-Based Multiplayer Differential Graphical Game","authors":"Wei Wu;Yi Zuo;Shaocheng Tong","doi":"10.1109/TSMC.2025.3539998","DOIUrl":"https://doi.org/10.1109/TSMC.2025.3539998","url":null,"abstract":"This article proposes a new adaptive fuzzy distributed output-feedback optimal control methodology for high-order strict-feedback nonlinear multiagent systems (MASs), which is composed of a fuzzy feedforward output-feedback distributed controller and a fuzzy error feedback correction distributed optimal controller. The fuzzy feedforward distributed controller is designed by backstepping design technique and fuzzy state observer, which can solve unmeasured states problem of the agents and transform the high-order strict-feedback nonlinear MASs into a linearizable feedback nonlinear MASs, while the fuzzy error feedback correction distributed controller is designed for the error nonlinear MASs based on adaptive dynamic programming and multiplayer differential graphical game, which can find a Nash equilibrium solution of the muti-player differential graphical game. It is proved that the formulated adaptive distributed fuzzy optimal control scheme can achieve the global optimal control objective, and make the controlled MASs stable, the followers output follow the leaders output. The simulation results on ship autopilot systems demonstrate its effectiveness.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 5","pages":"3202-3212"},"PeriodicalIF":8.6,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143839940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Neural Network for Distributed Collaboration of Multiple Manipulators With Switching Topologies: A Game-Theoretic Perspective","authors":"Mei Liu;Wenxin Mu;Xin Lv;Zhongbo Sun;Long Jin","doi":"10.1109/TSMC.2025.3540403","DOIUrl":"https://doi.org/10.1109/TSMC.2025.3540403","url":null,"abstract":"Since distributed control strategies can effectively reduce the operating load of the central processor, they have become a prominent research direction in the field of controlling multiple manipulators. However, existing distributed approaches predominantly rely on fixed topologies, overlooking the dynamic nature of task requirements. To address this limitation, this article proposes a distributed collaboration scheme that incorporates switching topologies. The distributed control problem is further formulated as a game-theoretic framework involving multiple manipulators. A dynamic neural network solver is then developed to approximate the optimal Nash equilibrium strategy, with its convergence and stability proven through theoretical analysis. The effectiveness of the proposed scheme and solver is validated through a series of physical experiments.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 5","pages":"3213-3221"},"PeriodicalIF":8.6,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143839938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sampled-Data Consensus for Multiagent Systems With Open Topology and Packet Loss","authors":"Haitao Zhu;Jianquan Lu;Yang Liu;Lingzhong Zhang","doi":"10.1109/TSMC.2025.3538930","DOIUrl":"https://doi.org/10.1109/TSMC.2025.3538930","url":null,"abstract":"This article studies a more realistic issue in multiagent systems (MASs) named open topology, where the network scale tends to be variable as agents may join, leave, or be replaced. In order to reveal the influence that the variation of network scale brings on the dynamic behavior of MASs, a novel transition process containing impulse is provided. Then by applying the Halanay-like inequality, the distributed sampled-data control is designed to ensure the consensus property of the MASs under open topology. It is shown that the derived results can degenerate into the consensus criteria for general MASs with fixed-scale topology. Also, the potential packet loss phenomenon existing in controller is considered, which is integrated into the consensus performance via the average packet loss interval (APLI) condition, thereby guaranteeing the robustness of consensus performance against packet loss. Moreover, within the framework of open topology, it is proved that the admissible packet loss for MASs can be estimated in an average sense. At the end, the availability of this article is demonstrated by discussing a numerical example.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 5","pages":"3130-3142"},"PeriodicalIF":8.6,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143840060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Bo Liu;Jing Guo;Yaowei Wang;Chengrong Yang;Fengtao Nan;Yun Yang
{"title":"Intelligent Inspection of Electronic Devices in Specific Environments via a Novel Cascade Network of Combining Mixed Sampling and Nonstrided Convolution","authors":"Bo Liu;Jing Guo;Yaowei Wang;Chengrong Yang;Fengtao Nan;Yun Yang","doi":"10.1109/TSMC.2025.3539699","DOIUrl":"https://doi.org/10.1109/TSMC.2025.3539699","url":null,"abstract":"In environments where intelligent video surveillance systems (IVSSs) are deployed, particularly in review room, the detection of electronic devices constitutes a crucial task. Nevertheless, this task presents significant challenges attributed to the high rates of false positives and false negatives in electronic device detection (EDD), compounded by the low resolution of objects when viewed from multiple angles.To address these challenges, we propose a deep learning-based cascaded detection framework. Specifically, we design a mixed region sampling (MRS) method to enhance the foreground perception with background information and image details. We design a nonstrided downsampling method (ASDP) to map the attention spatial features to depth and improve the detection of low-resolution objects with fine-grained features. We enhance the model’s robustness to different viewing angles by feature perturbation during training. Moreover, we use a cascaded strategy to reduce false positives. To evaluate our method, we construct a real review room dataset (EDD) with 28,000 images from multiple angles. Our method improves the multiview generalization performance by 4.48% mAP and 5.62% mAR. On the public datasets Pascal VOC-2007 and visDrone-2019, our method is also superior to other suboptimal methods. We propose a framework for review environment detection, which is accurate, fast, and generalizable to other scenarios.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 5","pages":"3287-3299"},"PeriodicalIF":8.6,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143839828","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Robust Practical Stabilization for Complex Dynamical Networks With DoS Attacks and Actuator Saturation","authors":"Xueya Shi;Zhinan Peng;Junzhi Yu;Hongfei Li","doi":"10.1109/TSMC.2025.3539781","DOIUrl":"https://doi.org/10.1109/TSMC.2025.3539781","url":null,"abstract":"This article addresses the problem of designing an attack-resilient adaptive event-triggered (AET) controller for complex dynamical networks (CDNs) under DoS attacks and actuator saturation, with a focus on robust practical stability (RPS). First, considering the impact of DoS attacks on closed-loop systems, an AET controller against DoS attacks is designed. Unlike other event-triggered controllers, the complete timeline is divided, and the AET controller is built with two switching modes based on the intervals of dormant and active periods of DoS attacks in which the system is located. Second, to reconcile AET controller with actuator saturation, a switched system modeling approach is established that explicitly incorporates saturation constraints into the coupled network dynamics. Third, a switched Lyapunov-Krasovskii functional (LKF) is proposed, with which sufficient conditions are provided to ensured the RPS, and a joint design strategy is developed for the desired triggered matrix and feedback gain using linear matrix inequalities (LMIs). Moreover, the results are generalized to the case of actuator faults, indicating that the system is able to achieve RPS with actuator faults. Finally, the proposed method is verified through an example.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 5","pages":"3108-3118"},"PeriodicalIF":8.6,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143839924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Dual Perspective Secure Analysis for Local Estimate-Based FDI Attacks in Networked Systems","authors":"Fuyi Qu;Hao Liu;Cheng Tan;Yuzhe Li","doi":"10.1109/TSMC.2025.3539835","DOIUrl":"https://doi.org/10.1109/TSMC.2025.3539835","url":null,"abstract":"This article discusses the security concerns related to networked systems, where the sensor sends the local estimate to the remote estimator, which may be attacked. Traditionally, in the remote state estimation with the innovation or raw measurement case, denial of service (DoS) and false data injection (FDI) attacks are investigated thoroughly. Notably, for remote state estimation with local estimate cases considered in this article, most existing works consider DoS attacks but not FDI attacks, negatively affecting remote state estimation performance. Furthermore, current detection mechanisms encounter challenges when identifying such attacks due to the unavailable innovation or raw measurement. As such, we study FDI attacks under this framework and provide the corresponding secure analysis using a dual-perspective approach. Specifically, we propose a detector to detect such attacks using the prior information extracted from the remote estimator. Then, we analyze the existence of stealthy attacks and characterize the corresponding performance evaluation for the remote estimation under such attacks. Following this, we construct the optimal attack scheme, maximizing the expected average and terminal estimation error covariances, respectively. To reduce the above vulnerability, we develop a co-design transmission strategy and offer an analytical detection performance evaluation under different attack scenarios. Finally, simulations are provided to illustrate the proposed results.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 5","pages":"3312-3325"},"PeriodicalIF":8.6,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143839829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Variational Bayesian Inference-Based Robust Dissimilarity Analytics Model for Industrial Fault Detection","authors":"Wanke Yu;Biao Huang;Gaoxi Xiao;Chuanke Zhang","doi":"10.1109/TSMC.2025.3538854","DOIUrl":"https://doi.org/10.1109/TSMC.2025.3538854","url":null,"abstract":"Due to various reasons, outliers, ambient noise and missing data inevitably exist in the industrial processes, and thus the robustness is important when establishing monitoring models. In this study, a robust dissimilarity analytics model (RDAM) is established with Laplace distribution to detect process anomalies in noisy environment. Because of the heavy-tailed characteristic of Laplace distribution, the proposed RDAM method is more robust to ambient noise and outliers when compared to Gaussian distribution-based models. Besides, the missing data problem is also considered and solved in the model development procedure. Using the variational Bayesian inference, the model parameters and latent variables of the RDAM model can be estimated. After that, a monitoring strategy is designed based on the obtained results with both static and dynamic statistics. By this means, both the static deviation of the current sample and the temporal correlation within the process data can be effectively revealed. A simulated example and a real low-pressure heater process are adopted to illustrate the performance of the proposed RDAM method. Specifically, the proposed RDAM method is robust to the ambient noise and missing values, and it has better detection sensitivity for the process anomalies than the selected comparison methods.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 5","pages":"3275-3286"},"PeriodicalIF":8.6,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143839941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Dynamic Quantized Control of Fuzzy Semi-Markov Jump Systems With Fading Channels: An Improved Event-Triggered Mechanism","authors":"Meng-Jie Hu;Ju H. Park;Jun Cheng;Yan-Wu Wang","doi":"10.1109/TSMC.2025.3537276","DOIUrl":"https://doi.org/10.1109/TSMC.2025.3537276","url":null,"abstract":"This article focuses on the dynamic quantized control for Takagi–Sugeno fuzzy semi-Markov jump systems (T–S FSMJSs) under fading channels and deception attacks, employing an improved event-triggered mechanism (ETM) strategy. Specifically, a more generalized semi-Markov process (SMP) that is governed by a higher-level deterministic switching signal (HLDSS) is addressed. A novel dynamic ETM is skillfully developed based on the information of quantization and two internal dynamic adjusting variables to further enhance the network bandwidth utilization. The dual asynchronous phenomenon between the plant and controller (asynchronous modes and mismatched premise variables) is addressed. This implies that the designed controller is not required to share the same membership functions and modes as the plant, establishing a more rational structure. The hidden semi-Markov model (HSMM) is attained to characterize the stochastic varying channel fading amplitudes. The Lyapunov function incorporating mode and fuzzy information is constructed to establish sufficient criteria ensuring the strictly dissipative performance and mean-square exponential stability (MSES) of the resulting closed-loop systems, and a new security fuzzy dual asynchronous controller is then developed. Finally, the efficacy and applicability of the results are demonstrated through numerical and practical examples.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 5","pages":"3519-3531"},"PeriodicalIF":8.6,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143839998","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Protocol-Based SMC for Fuzzy Semi-Markov Switching Systems With Multizone Probabilitic Time-Varying Delays","authors":"Jiangming Xu;Jun Cheng;Mohammed Chadli;Wenhai Qi","doi":"10.1109/TSMC.2025.3538550","DOIUrl":"https://doi.org/10.1109/TSMC.2025.3538550","url":null,"abstract":"This study addresses the sliding mode control (SMC) for Takagi-Sugeno (T-S) fuzzy semi-Markov switching systems (FSMSSs) characterized by multizone probabilistic time-varying delays. The dynamic behaviors of FSMSSs are captured through a comprehensive semi-Markov process framework that accommodates arbitrary switching scenarios. In addition, a tailored SMC law that ensures the system’s trajectory reaches and maintains a preset sliding surface over a specified finite time is applied. To address the challenges of communication load, a novel multizone probabilistic dynamic event-triggered protocol is introduced, leveraging the time-varying nature of transmission delays and incorporating two adjustable internal dynamic variables. The establishment of sufficient conditions for ensuring the stochastic finite-time stability of the closed-loop system is achieved through an effective Lyapunov functional methodology. Finally, the validity and superiority of the proposed methodologies is demonstrated by a mass-spring-damper mechanical system.","PeriodicalId":48915,"journal":{"name":"IEEE Transactions on Systems Man Cybernetics-Systems","volume":"55 4","pages":"3026-3035"},"PeriodicalIF":8.6,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143655040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}