ISA transactions最新文献

筛选
英文 中文
Finite frequency distributed fault detection in sensor networks with memory event-triggered scheme and deception attacks 基于记忆事件触发和欺骗攻击的传感器网络有限频率分布式故障检测。
IF 6.3 2区 计算机科学
ISA transactions Pub Date : 2025-03-17 DOI: 10.1016/j.isatra.2025.03.008
Peng Cheng , Chenxiao Cai , PooGyeon Park
{"title":"Finite frequency distributed fault detection in sensor networks with memory event-triggered scheme and deception attacks","authors":"Peng Cheng ,&nbsp;Chenxiao Cai ,&nbsp;PooGyeon Park","doi":"10.1016/j.isatra.2025.03.008","DOIUrl":"10.1016/j.isatra.2025.03.008","url":null,"abstract":"<div><div>This paper concentrates on the finite frequency distributed fault detection (FD) problem for discrete-time nonlinear switched systems (SSs) with time-varying delays through sensor networks. First, the switching mechanism is described by the so-called sojourn probability method. Next, a distributed memory event-triggered scheme (METS) is employed to alleviate the network bandwidth load, and the adverse effects of malicious deception attacks are considered and analyzed. Furthermore, a finite frequency distributed FD filter (FDF) subject to METS and deception attacks is developed corresponding to each sensor node, and the existence conditions of the FDF are offered in the form of linear matrix inequalities. Lyapunov stability analysis shows that the resulting augmented filter system is mean-square stable and meets the specified full-frequency <span><math><msub><mrow><mi>H</mi></mrow><mrow><mi>∞</mi></mrow></msub></math></span> level bound and finite-frequency <span><math><msub><mrow><mi>H</mi></mrow><mrow><mo>−</mo></mrow></msub></math></span> level bound. Comparative simulations demonstrate that the proposed finite-frequency distributed FDF can detect unknown faults in nonlinear SSs on sensor networks more quickly and accurately than some existing full-frequency solutions.</div></div>","PeriodicalId":14660,"journal":{"name":"ISA transactions","volume":"161 ","pages":"Pages 24-35"},"PeriodicalIF":6.3,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143789471","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Periodic disturbance rejection-based control of Switched Reluctance Motors Using ADRC and IMP for effective ripple reduction 利用 ADRC 和 IMP 对开关磁阻电机进行基于周期干扰抑制的控制,以有效降低纹波。
IF 6.3 2区 计算机科学
ISA transactions Pub Date : 2025-03-17 DOI: 10.1016/j.isatra.2025.03.007
Carlos Vergara-Ramírez , John Cortés-Romero , Horacio Coral-Enriquez
{"title":"Periodic disturbance rejection-based control of Switched Reluctance Motors Using ADRC and IMP for effective ripple reduction","authors":"Carlos Vergara-Ramírez ,&nbsp;John Cortés-Romero ,&nbsp;Horacio Coral-Enriquez","doi":"10.1016/j.isatra.2025.03.007","DOIUrl":"10.1016/j.isatra.2025.03.007","url":null,"abstract":"<div><div>This paper proposes a control scheme that combines Active Disturbance Rejection (ADR) Control and the Internal Model Principle (IMP) to reduce ripple in Switched Reluctance Motors (SRMs) operating at variable speeds. The control architecture consists of two stages: an error-based ADRC for current loops and a periodic disturbance rejection (ADR+IMP) scheme for velocity control. The latter stage exploits two key SRM characteristics: the periodic behavior of signals and disturbances, and the angular-domain periodic nature of torque ripple. Experimental validation is performed on a laboratory test-bed using a 3-phase 12/8 SRM, where the proposed methodology is systematically applied. Results demonstrate the effectiveness of the control scheme in achieving significant ripple reduction under variable velocity profiles.</div></div>","PeriodicalId":14660,"journal":{"name":"ISA transactions","volume":"160 ","pages":"Pages 298-308"},"PeriodicalIF":6.3,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143733142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Composite intelligent learning-based tracking control for discrete-time repetitive process 基于复合智能学习的离散时间重复过程跟踪控制。
IF 6.3 2区 计算机科学
ISA transactions Pub Date : 2025-03-17 DOI: 10.1016/j.isatra.2025.03.005
Rongni Yang , Jianqiang Hao , Peng Shi , Imre J. Rudas
{"title":"Composite intelligent learning-based tracking control for discrete-time repetitive process","authors":"Rongni Yang ,&nbsp;Jianqiang Hao ,&nbsp;Peng Shi ,&nbsp;Imre J. Rudas","doi":"10.1016/j.isatra.2025.03.005","DOIUrl":"10.1016/j.isatra.2025.03.005","url":null,"abstract":"<div><div>In this work, a new composite iterative learning control (ILC) algorithm for the tracking issue of a class of discrete-time systems that operate repetitively over a finite time duration is developed. Particularly, the proposed intelligent learning process consists of two phases to achieve an enhanced tracking performance: the gain-adaptive iterative learning control (GAILC) phase and the sliding mode iterative learning control (SMILC) phase, respectively. Moreover, the switching of the two phases is determined by the tracking error. For GAILC phase, a prediction of tracking error based adaptive gain sequence is adopted to achieve a fast convergence in tracking error. For SMILC phase, an appropriate sliding surface function in the iteration domain is established, and then a novel SMILC law with a fractional power term is presented to achieve a high tracking precision. Finally, comparative simulations including a DC motor example are provided to validate the effectiveness and advantage of the proposed ILC strategy.</div></div>","PeriodicalId":14660,"journal":{"name":"ISA transactions","volume":"160 ","pages":"Pages 122-130"},"PeriodicalIF":6.3,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143733141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fault reconstruction approach for saturated dynamic systems using adaptive estimation and optimization 基于自适应估计和优化的饱和动态系统故障重构方法。
IF 6.3 2区 计算机科学
ISA transactions Pub Date : 2025-03-15 DOI: 10.1016/j.isatra.2025.03.012
Zhi-Wei Gao, Qiang Huang, Yuanhong Liu
{"title":"Fault reconstruction approach for saturated dynamic systems using adaptive estimation and optimization","authors":"Zhi-Wei Gao,&nbsp;Qiang Huang,&nbsp;Yuanhong Liu","doi":"10.1016/j.isatra.2025.03.012","DOIUrl":"10.1016/j.isatra.2025.03.012","url":null,"abstract":"<div><div>Modern industrial systems are becoming more complex and expensive, which have higher requirements on safety and reliability, and less tolerance on system performance degradation due to anomalies and faults. As a result, there is a strong motivation to address real-time monitoring and diagnosis techniques to detect a fault at an early stage and assess the severity of the faulty component. In this paper, real-time input and output data are used to monitor real-time and reconstruct faults for saturated dynamic systems using adaptive estimation and optimization techniques. Specifically, the saturation signals are partitioned into the conventional inputs without saturation and the beyond-saturation input signals. An unknown input decoupling approach is used to decouple unknown input uncertainty partially, and adaptive techniques are addressed to simultaneously reconstruct the additive faults and the beyond-saturation input signals. The estimator gains are obtained via the optimization for solving strict linear matrix inequalities to attenuate the effect from un-decoupled uncertainty to the estimation error dynamics. It is worth highlighting that the saturated input signals can also be reconstructed, which is particularly important when the saturated inputs cannot be measured directly, or it is costly to install extra sensors for the measurement. The proposed approach is to design offline, but implement using real-time input and output data, implying excellent real-time performance. The effectiveness of the proposed algorithm is demonstrated by an aircraft system and a single-link flexible joint robotic system.</div></div>","PeriodicalId":14660,"journal":{"name":"ISA transactions","volume":"161 ","pages":"Pages 48-65"},"PeriodicalIF":6.3,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143712679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Probability-guaranteed set-membership filtering for nonlinear 2-D systems with measurement outliers under the adaptive event-triggered mechanism 自适应事件触发机制下具有测量异常值的非线性二维系统的概率保证集隶属度滤波。
IF 6.3 2区 计算机科学
ISA transactions Pub Date : 2025-03-13 DOI: 10.1016/j.isatra.2025.02.035
Hui Yu , Dongjie Peng , Wei Yang , Dongyan Chen
{"title":"Probability-guaranteed set-membership filtering for nonlinear 2-D systems with measurement outliers under the adaptive event-triggered mechanism","authors":"Hui Yu ,&nbsp;Dongjie Peng ,&nbsp;Wei Yang ,&nbsp;Dongyan Chen","doi":"10.1016/j.isatra.2025.02.035","DOIUrl":"10.1016/j.isatra.2025.02.035","url":null,"abstract":"<div><div>This paper is primarily devoted to investigating the probability-guaranteed set-membership (PGSM) filtering problem for a class of nonlinear two-dimensional (2-D) systems with measurement outliers under the adaptive event-triggered mechanism (AETM). Here, the sensors are connected to the remote filter through a bandwidth-limited communication network. With the purpose of reducing the communication burden, a novel AETM is proposed to optimize the scheduling of transmitted data to the filter, where the triggering threshold is allowed to be dynamically adjusted based on the transmission error. This paper aims to develop a two-step recursive PGSM filter, ensuring that the filtering error remains within the prescribed ellipsoidal set with a specified probability. To safeguard the filtering process against performance degradation caused by measurement outliers, a saturation function is incorporated into the filter structure to restraint the impact of outlier-contaminated innovations. By utilizing the mathematical induction approach and convex optimization technique, a sufficient condition is derived to guarantee the existence of the desired PGSM filter, and the filter gains are obtained in terms of the solutions to a series of convex optimization problems with ellipsoidal constraints. In the end, an illustrative example is implemented to reveal the effectiveness of the addressed filter design scheme. The results demonstrate that, compared with the Kalman filter and set-membership (SM) filter, the proposed PGSM filter exhibits superior filtering performance and low upper bound of filtering error in the presence of unknown but bounded (UBB) noise.</div></div>","PeriodicalId":14660,"journal":{"name":"ISA transactions","volume":"160 ","pages":"Pages 87-96"},"PeriodicalIF":6.3,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143694829","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Envelope spectrum knowledge-guided domain invariant representation learning strategy for intelligent fault diagnosis of bearing 包络谱知识引导的轴承故障智能诊断领域不变表示学习策略。
IF 6.3 2区 计算机科学
ISA transactions Pub Date : 2025-03-11 DOI: 10.1016/j.isatra.2025.03.004
Zhi Tang , Lin Bo , Hao Bai , Zuqiang Su , Shuxian Wang , Yanhao Zhao
{"title":"Envelope spectrum knowledge-guided domain invariant representation learning strategy for intelligent fault diagnosis of bearing","authors":"Zhi Tang ,&nbsp;Lin Bo ,&nbsp;Hao Bai ,&nbsp;Zuqiang Su ,&nbsp;Shuxian Wang ,&nbsp;Yanhao Zhao","doi":"10.1016/j.isatra.2025.03.004","DOIUrl":"10.1016/j.isatra.2025.03.004","url":null,"abstract":"<div><div>Deep learning has significantly advanced bearing fault diagnosis. Traditional models rely on the assumption of independent and identically distributed, which is frequently violated due to variations in rotational speeds and loads during bearing fault diagnosis. The fault diagnosis of the bearing based on representation learning lacks the consideration of spectrum knowledge and representation diversity under multiple working conditions. Therefore, this study presents a domain-invariant representation learning strategy (DIRLs) for diagnosing bearing faults across differing working conditions. DIRLs, by leveraging envelope spectrum knowledge distillation, captures the Fourier characteristics as domain-invariant features and secures robust health state representations by aligning high-order statistics of the samples under different working conditions. Moreover, an innovative loss function, which maximizes the two-paradigm metric of the health state representation, is designed to enrich representation diversity. Experimental results demonstrate an average AUC improvement of 28.6 % on the Paderborn-bearing dataset and an overall diagnostic accuracy of 88.7 % on a private bearing dataset, validating the effectiveness of the proposed method.</div></div>","PeriodicalId":14660,"journal":{"name":"ISA transactions","volume":"160 ","pages":"Pages 205-217"},"PeriodicalIF":6.3,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143659841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Motion control strategy for robotic arm using deep cascaded feature-enhancement Bayesian broad learning system with motion constraints 使用带运动约束的深度级联特征增强贝叶斯广义学习系统的机械臂运动控制策略。
IF 6.3 2区 计算机科学
ISA transactions Pub Date : 2025-03-11 DOI: 10.1016/j.isatra.2025.02.027
Jiyong Zhou , Guoyu Zuo , Xiang Li , Shuangyue Yu , Shuaifeng Dong
{"title":"Motion control strategy for robotic arm using deep cascaded feature-enhancement Bayesian broad learning system with motion constraints","authors":"Jiyong Zhou ,&nbsp;Guoyu Zuo ,&nbsp;Xiang Li ,&nbsp;Shuangyue Yu ,&nbsp;Shuaifeng Dong","doi":"10.1016/j.isatra.2025.02.027","DOIUrl":"10.1016/j.isatra.2025.02.027","url":null,"abstract":"<div><div>Intelligent control strategies can significantly enhance the efficiency of model parameter adjustment. However, existing intelligent motion control strategies for robotic arms based on the broad learning system lack sufficient accuracy and fail to account for the effects of joint motion limitations on overall control performance. To address the aforementioned challenges, this paper proposes a robotic arm motion control strategy based on a deep cascaded feature-enhanced Bayesian broad learning system with motion constraints (MC-DCBLS). Firstly, the motion control strategy based on a deep cascaded feature-enhanced Bayesian broad learning system (DCBBLS) is designed, which simplifies the modeling process and significantly improves control accuracy. Secondly, the motion constraint mechanism is introduced to optimize the control strategy to ensure that the robotic arm motion does not break through the physical limit. Finally, the parameter constraints of the control strategy network were obtained by introducing the Lyapunov theory to ensure the stability of the robotic arm motion control. The effectiveness of the proposed control strategy was validated through both simulations and physical experiments. The results demonstrated that the strategy significantly improved the accuracy of robotic arm motion control, with the root mean square error (RMSE) in position tracking reduced to 0.038 rad. This represents a 61.26% reduction in error compared to existing techniques.</div></div>","PeriodicalId":14660,"journal":{"name":"ISA transactions","volume":"160 ","pages":"Pages 268-278"},"PeriodicalIF":6.3,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143631147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Global fixed-time position-constrained guidance and adaptive fuzzy prescribed performance control using novel shift function for multiple unmanned surface vehicles formation 基于新颖移位函数的多水面无人车辆编队全局定时位置约束制导和自适应模糊预定性能控制。
IF 6.3 2区 计算机科学
ISA transactions Pub Date : 2025-03-08 DOI: 10.1016/j.isatra.2025.02.034
Haiyan Tong, Mingxiao Sun, Tiantian Luan
{"title":"Global fixed-time position-constrained guidance and adaptive fuzzy prescribed performance control using novel shift function for multiple unmanned surface vehicles formation","authors":"Haiyan Tong,&nbsp;Mingxiao Sun,&nbsp;Tiantian Luan","doi":"10.1016/j.isatra.2025.02.034","DOIUrl":"10.1016/j.isatra.2025.02.034","url":null,"abstract":"<div><div>Guidance and control of multiple unmanned surface vehicles (Multi-USVs) present many challenges due to their under-actuation and the unknown environmental disturbance. This research addresses the formation guidance and control problems of multi-USVs by designing a global fixed-time constrained guidance and control formation approach. First, a global fixed-time control Lyapunov function (GFCLF) is proposed using an innovative shift function to deal with the fixed-time output partial constraint. Subsequently, a fixed-time asymmetric position-constrained guidance algorithm for multi-USVs formation is designed by combining the line-of-sight guidance principle, the leader–follower structure, and the suggested GFCLF. Second, a global fixed-time prescribed performance function (GFPPF) is designed to solve the global tracking error performance constraint problem. Then, global fixed-time adaptive fuzzy prescribed performance control laws are developed to achieve the tracking control for the multi-USVs formation task, in which a fixed-time adaptive fuzzy logic system is designed to approximate the unknown disturbance of USVs. Furthermore, the closed-loop control system stability analysis is proven to support that all tracking error signals are bounded in a fixed time. Finally, simulations and comparative cases using the physical USV model are studied to demonstrate the practicality and superiority of theoretical results.</div></div>","PeriodicalId":14660,"journal":{"name":"ISA transactions","volume":"160 ","pages":"Pages 58-71"},"PeriodicalIF":6.3,"publicationDate":"2025-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143618131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive terminal super-twisting prescribed performance controller for near-space vehicle based on data-driven model 基于数据驱动模型的近空飞行器自适应末端超扭规定性能控制器。
IF 6.3 2区 计算机科学
ISA transactions Pub Date : 2025-03-08 DOI: 10.1016/j.isatra.2025.03.001
Tianchen Zhang , Yibo Ding , Xiaokui Yue , Naying Li
{"title":"Adaptive terminal super-twisting prescribed performance controller for near-space vehicle based on data-driven model","authors":"Tianchen Zhang ,&nbsp;Yibo Ding ,&nbsp;Xiaokui Yue ,&nbsp;Naying Li","doi":"10.1016/j.isatra.2025.03.001","DOIUrl":"10.1016/j.isatra.2025.03.001","url":null,"abstract":"<div><div>A data-driven adaptive terminal super-twisting prescribed performance controller (DASTPC) is designed for near-space vehicle (NSV) to satisfy transient and steady-state performance, and prevent scramjet choking. Firstly, a novel predetermined-time performance function is proposed to guarantee that tracking error can converge to a prescribed bound of small residual sets at the predetermined time. Compared with traditional performance functions, the predetermined-time performance function can achieve faster respond speed, realize more accurate convergence, and avoid overlarge initial value of actuators. Secondly, by combining the predetermined-time performance function with sliding mode control, a novel non-singular fast terminal sliding surface and an improved adaptive super-twisting reaching law are proposed to improve computational efficiency and accelerate convergent rate of system. The adaptive reaching law can avoid excessive gains and attenuate chattering by automatically tuning control gain. Thirdly, a deep recurrent neural network-based long-short term memory (LSTM) is employed to learn time-series historical flight dynamics data offline, so as to construct a data-driven LSTM training model. This data-driven model replaces nominal dynamics model of NSV in DASTPC, effectively suppressing model uncertainties. In addition, a homogeneous high-order sliding mode observer is utilized to compensate for external disturbances, avoiding excessive parameter estimation. Since boundary conditions of the predetermined-time performance function are fully satisfied, the DASTPC can effectively restrict amplitude of angle of attack, thus ensuring the intake condition of scramjet. Ultimately, to illustrate the superiority of DASTPC, several sets of simulations are performed on NSV subject to prescribed performance bound, external disturbances and parameter perturbations.</div></div>","PeriodicalId":14660,"journal":{"name":"ISA transactions","volume":"160 ","pages":"Pages 1-18"},"PeriodicalIF":6.3,"publicationDate":"2025-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143626835","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A general TD-Q learning control approach for discrete-time Markov jump systems 离散时间马尔可夫跳跃系统的一般 TD-Q 学习控制方法。
IF 6.3 2区 计算机科学
ISA transactions Pub Date : 2025-03-06 DOI: 10.1016/j.isatra.2025.02.032
Jiwei Wen , Huiwen Xue , Xiaoli Luan , Peng Shi
{"title":"A general TD-Q learning control approach for discrete-time Markov jump systems","authors":"Jiwei Wen ,&nbsp;Huiwen Xue ,&nbsp;Xiaoli Luan ,&nbsp;Peng Shi","doi":"10.1016/j.isatra.2025.02.032","DOIUrl":"10.1016/j.isatra.2025.02.032","url":null,"abstract":"<div><div>This paper develops a novel temporal difference Q (TD-Q) learning approach, designed to address the robust control challenge in discrete-time Markov jump systems (MJSs) which are characterized by entirely unknown dynamics and transition probabilities (TPs). The model-free TD-Q learning method is uniquely comprehensive, including two special cases: Q learning for MJSs with unknown dynamics, and TD learning for MJSs with undetermined TPs. We propose an innovative ternary policy iteration framework, which iteratively refines the control policies through a dynamic loop of alternating updates. This loop consists of three synergistic processes: firstly, aligning TD value functions with current policies; secondly, enhancing Q-function’s matrix kernels (QFMKs) using these TD value functions; and thirdly, generating greedy policies based on the enhanced QFMKs. We demonstrate that, with a sufficient number of episodes, the TD value functions, QFMKs, and control policies converge optimally within this iterative loop. To illustrate efficiency of the developed approach, we introduce a numerical example that highlights its substantial benefits through a thorough comparison with current learning control methods for MJSs. Moreover, a structured population dynamics model for pests is utilized to validate the practical applicability.</div></div>","PeriodicalId":14660,"journal":{"name":"ISA transactions","volume":"160 ","pages":"Pages 111-121"},"PeriodicalIF":6.3,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143607623","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信