{"title":"Adaptive Optimal Control for Discrete-Time Linear Systems via Hybrid Iteration","authors":"Omar Qasem, Weinan Gao, Hector M. Gutierrez","doi":"10.1109/DDCLS58216.2023.10167108","DOIUrl":"https://doi.org/10.1109/DDCLS58216.2023.10167108","url":null,"abstract":"In this paper, a novel dynamic programming (DP) and adaptive dynamic programming (ADP) algorithms are proposed, namely hybrid iteration (HI), for discrete-time linear systems. The proposed HI approach fill up the performance gap of two well- known DP algorithms, i.e., policy iteration (PI) and value iteration (VI). In particular, HI drops the need of the prior knowledge of an initial stabilizing control policy required in PI, and at the same time it maintains a fast quadratic convergence rate compared with VI. A data-driven adaptive optimal controller design is also proposed based on the proposed HI algorithm. Simulation results for randomly generated discrete-time linear systems with different system orders demonstrate that the proposed HI approach can significantly save CPU time and reduce the number of learning iterations to converge to the optimal solution comparing with the VI approach. The data-driven HI method is implemented to an application of turbocharged diesel engine with exhaust gas recirculation, and the simulation results illustrate the efficacy of the proposed HI method.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"130 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122761865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Fault-tolerant Tracking Control for Mobile Robots Based on the Framework of Intermediate Estimator and MPC","authors":"Liang-Huan Ying, Jun-Wei Zhu","doi":"10.1109/DDCLS58216.2023.10166441","DOIUrl":"https://doi.org/10.1109/DDCLS58216.2023.10166441","url":null,"abstract":"During the actual motion of the wheeled mobile robot(WMR), actuator faults caused by ageing or system components' misoperation may significantly impact the real-time control performance. Therefore, this paper proposed a fault-tolerant tracking control approach based on MPC and intermediate estimator (IE) that the observer matching condition need not be satisfied. First, a set of reference trajectories is generated from the virtual system, and a nominal tracking error system is obtained based on the relative position of the actual system. Then, an IE is used to estimate the state error and actuator fault of WMR so that an estimation-based predictive model and a fault-compensated composite control law can be obtained to ensure stable control of WMR with an actuator fault. Finally, the simulation that compared to the nominal MPC showed that this fault-tolerant control algorithm has good performance in adapting to actuator faults, which verifies this algorithm's effectiveness.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"99 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122971526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"In-distribution stability analysis for neural Markovian jump systems: A delay-feedback control method","authors":"Xiaohang Li","doi":"10.1109/DDCLS58216.2023.10167380","DOIUrl":"https://doi.org/10.1109/DDCLS58216.2023.10167380","url":null,"abstract":"In this paper, a new idea of in-distribution stability is analyzed for a class of neural Markovian jump systems with non- differential time-delays and type disturbance. To achieve such a goal, an asynchronous state-feedback controller is proposed to facilitate the design. Consider the fact that there always exist delays during practical signal transmissions, and therefore a new asynchronous delay-feedback control is reconstructed to render the closed-loop system to satisfy two preconditions. Whereupon, the closed-loop system is proved to be in-distribution stable through three steps. Note that the boundary of the designed controller does not need to be known in advance, which shows superiority over the existing delay-involved controllers.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114175432","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hao Wu, Yinghao Zhao, Xu Yang, Jian Huang, Jiarui Cui
{"title":"Fault detection for rolling bearings by multi-sensor information fusion method with adaptive weights","authors":"Hao Wu, Yinghao Zhao, Xu Yang, Jian Huang, Jiarui Cui","doi":"10.1109/DDCLS58216.2023.10166660","DOIUrl":"https://doi.org/10.1109/DDCLS58216.2023.10166660","url":null,"abstract":"Driven by the increasing needs for production safety, a fault detection method based on multi-sensor fusion with adaptive weight coefficients is proposed in this paper to make full use of multi-measuring points information. To this end, considering the different information among multi-measuring points, the variance contribution rate (VCR) of vibration signals are used to design adaptive weight coefficients for data fusion to fully utilize the information contained in each vibration signal. On this basis, the least atoms contain time domain and frequency domain are extracted based on dictionary sparse representation (DSR) algorithm to represent the feature information of the original signal to weaken the influence of the curse of dimensionality. Finally, K-nearest neighbor distance is used in sparse residual space (SRS) for fault detection (K-SRS). The effectiveness of the proposed method is demonstrated by the rolling bearings data, and results show the advantage of our proposed approach.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129531588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Data-Based Approximate Optimal Control for Unknown Nonaffine Systems via Dynamic Feedback","authors":"Jin-Jye Lin, Bo Zhao, Derong Liu","doi":"10.1109/DDCLS58216.2023.10166780","DOIUrl":"https://doi.org/10.1109/DDCLS58216.2023.10166780","url":null,"abstract":"In this paper, an integral reinforcement learning (IRL)-based approximate optimal control (AOC) method for unknown nonaffine systems is developed by using dynamic feedback. For optimal control problems of nonaffine systems, optimal control policy cannot be expressed explicitly since the input gain matrix is unknown. Therefore, the nonaffine system is transformed into an augmented affine system by introducing a dynamic feedback signal as the virtual control input. Moreover, by designing an appropriate value function for the augmented affine system, the optimal control of augmented affine system is formulated as the AOC for unknown nonaffine systems. Moreover, the IRL method is adopted to derive the approximate solution of Hamilton-Jacobi-Bellman equation via the critic-only structure. Theoretical analysis concludes that the closed-loop system is uniformly ultimately bounded by using the developed IRL-based AOC scheme. An example is utilized to demonstrate the effectiveness of the present approach.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129547117","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
S. Zhou, Zhenhua Han, Yunchen Zhou, Yunda Zhao, Hao Wang, Youwu Du
{"title":"Load Tooth Contact Analysis of Composite Cycloidal Planetary Transmission with Small Tooth Difference","authors":"S. Zhou, Zhenhua Han, Yunchen Zhou, Yunda Zhao, Hao Wang, Youwu Du","doi":"10.1109/DDCLS58216.2023.10166206","DOIUrl":"https://doi.org/10.1109/DDCLS58216.2023.10166206","url":null,"abstract":"Based on the profile equation of composite cycloidal tooth, the coordinate system of the contact analysis model of composite cycloidal gear teeth is established. Depending on the conjugate contact conditions of gears, the vector equation of tooth contact analysis (TCA) is created. The backlash and transmission error of the composite cycloidal planetary gear pair with small tooth difference (Abbreviated as composite cycloidal gear pair) after modification are calculated by solving the equation. The loaded tooth contact analysis (LTCA) model of the composite cycloid is built using Hertzian contact theory, a moment balancing equation, and deformation compatibility condition. The composite cycloidal transmission error and contact force in the load tooth contact analysis mode are solved. The distribution of contact force under different crank shaft angles and the sensitivity of bearing transmission error and contact force to tooth profile adjustment coefficient are analyzed with examples. The results show that the transmission accuracy of composite cycloid gears under load conditions is within 1 arcmin, which meets the requirements for precision transmission performance.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128423083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhaoqin Wu, Weixun Li, Xiangyang Du, Jing Xiao, Limin Zhang
{"title":"Group Consensus for Second-order Linear Multi-agent Systems with Intermittent Control","authors":"Zhaoqin Wu, Weixun Li, Xiangyang Du, Jing Xiao, Limin Zhang","doi":"10.1109/DDCLS58216.2023.10166220","DOIUrl":"https://doi.org/10.1109/DDCLS58216.2023.10166220","url":null,"abstract":"This paper focused on the tracking group consensus problem in a second-order multi-agent system, which is based on a multigroup network structure under the constraint of non-periodic intermittent communication. We proposed a new intermittent consensus protocol and performed a convergence analysis using Lyapunov stability theory and algebraic graph theory. The convergence analysis mainly adopts two assumptions on which the protocol relies, which are that the communication interval is time-varying and the inter-group relations. Based on these assumptions, sufficient conditions for achieving group consensus in a second-order multi-agent system with non-periodic intermittent communication are derived. To verify the conclusions drawn from the mathematical analysis, the paper finally used MATLAB to perform numerical simulations to verify the validity and correctness of the results.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128729129","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Differential Dynamic Programming for Finite-Horizon Multi-Player Non-Zero-Sum Differential Games of Nonlinear Systems","authors":"Yuqi Zhang, Bin Zhang","doi":"10.1109/DDCLS58216.2023.10166320","DOIUrl":"https://doi.org/10.1109/DDCLS58216.2023.10166320","url":null,"abstract":"In this paper, an iterative algorithm based on differential dynamic programming (DDP) is developed to solve the finite-horizon multi-player non-zero-sum (NZS) games. By using the DDP, the coupled Hamilton-Jacobi (HJ) equations are expanded from partial differential forms to higher-order differential forms. By approximating the value functions and optimal control policies through several finite sets of basis functions, the DDP expansions are transformed into algebraic matrix equations in integral forms. Then a policy iteration (PI) algorithm is provided to solve the feedback Nash equilibrium of above NZS games. Finally, two simulation examples are given to demonstrate the feasibility of the developed algorithm.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129272388","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Research on Privacy Protection of Internet of Vehicles Based on Elliptic Curve Ring Signature","authors":"Weiqi Mao, Bo Mi, Darong Huang, Haoyu Ma","doi":"10.1109/DDCLS58216.2023.10166665","DOIUrl":"https://doi.org/10.1109/DDCLS58216.2023.10166665","url":null,"abstract":"With the continuous development of new energy vehicles in recent years, the application of the Internet of Vehicles (IoV)has also been rapidly expanded. However, behind the development of this series of technologies, there will also be many security issues has happened. In response to this phenomenon, this paper uses elliptic curve ring signature technology to protect the privacy of messages signed by vehicle users. By hiding their public key in a user group, the other party will not find the information through the signed message. source. Prevent the disclosure of identity and privacy of vehicle users in the process of anonymous voting. The simulation results show that our method is feasible in the Internet of Vehicles (IoV) because it is completely sufficient to support this scheme from the perspective of time.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129431647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Reinforcement Learning Driving Strategy based on Auxiliary Task for Multi-Scenarios Autonomous Driving","authors":"Jingbo Sun, Xing Fang, Qichao Zhang","doi":"10.1109/DDCLS58216.2023.10166271","DOIUrl":"https://doi.org/10.1109/DDCLS58216.2023.10166271","url":null,"abstract":"Reinforcement learning (RL) has made great progress in autonomous driving applications. However, using one RL based driving policy for multi-scenarios autonomous driving is still challenging for RL in autonomous driving. There are different observations and reward measurements in different scenarios. At the same time, there is also the problem of multi-source heterogeneous observation in autonomous driving. To address the problems above, we propose a reinforcement learning framework based on the auxiliary task. Firstly, we designed a reward function to enable vehicles to learn safe and efficient strategies. Further, an auxiliary task is designed to learn the characteristics of different scenarios so that the ego agent can adopt different strategies for different scenarios. Finally, in order to handle the driving problem in multiple scenarios, we propose a representation network based on Multi-layer perceptron (MLP), Convolutional neural network (CNN), and Transformer networks to learn multi-source heterogeneous observation. The multi-source heterogeneous observation consists of the ego vehicle state, the bird's eye view (BEV) state and neighbour vehicle states. Experiments show that our method achieves a higher success rate compared to a popular reinforcement learning algorithm.","PeriodicalId":415532,"journal":{"name":"2023 IEEE 12th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130233484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}