{"title":"Distributed Optimal Coordination Control for Continuous-Time Nonlinear Multi-Agent Systems With Input Constraints","authors":"Y. Deng, Jun Xiao, Qinglai Wei","doi":"10.1109/DDCLS49620.2020.9275176","DOIUrl":"https://doi.org/10.1109/DDCLS49620.2020.9275176","url":null,"abstract":"This paper is concerned with an optimal coordination control problem for nonlinear multi-agent systems (MASs) with constraints of the control inputs. The idea of daptive dynamic programming (ADP) algorithm is to use the policy iteration to solve the coupled Hamilton-Jacobi equations. First, a suitable non-quadratic functional is introduced into the cost function to transform the question into an optimization problem. Second, a distributed control law is designed for each agent, which aims that the cost function of the MASs converge to Nash equilibrium. Next, the analysis of the convergence is indicated that the iterative cost functions of nonlinear multi-agent systems is convergent. Neural network (NNs) are used to approximate the cost functions for the calculation of the control laws. Finally, simulation results show the effectiveness of the coordination control algorithm.","PeriodicalId":420469,"journal":{"name":"2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133018789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Fast-Convergence Method of Monte Carlo Counterfactual Regret Minimization for Imperfect Information Dynamic Games","authors":"Xiaoyang Hu, Li Xia, Jun Yang, Qianchuan Zhao","doi":"10.1109/DDCLS49620.2020.9275075","DOIUrl":"https://doi.org/10.1109/DDCLS49620.2020.9275075","url":null,"abstract":"Among existing algorithms for solving imperfect-information extensive-form games, Monte Carlo Counterfactual Regret Minimization (MCCFR) and its variants are the most popular ones. However, MCCFR suffers from slow convergence due to its high variance in estimating values. In this paper, we introduce Semi-OS, a fast-convergence method developed from Outcome-Sampling MCCF R (OS), the most popular variant of MCCFR. Semi-OS makes two novel modifications to OS. First, Semi-OS stores all histories and their values at each information set. Second, after each time we update the strategy, Semi-OS requires a full game-tree traversal to update these values. These two modifications yield a better estimation of regrets. We show that, by selecting an appropriate discount rate, Semi-OS not only significantly speeds up the convergence rate in Leduc Poker but also statistically outperforms OS in head-to-head matches of Leduc Poker, a common testbed of imperfect information games, involving 200,000 hands.","PeriodicalId":420469,"journal":{"name":"2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133525052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jian Yao, Ze-yu Zhang, Li Xia, Jun Yang, Qianchuan Zhao
{"title":"Solving Imperfect Information Poker Games Using Monte Carlo Search and POMDP Models","authors":"Jian Yao, Ze-yu Zhang, Li Xia, Jun Yang, Qianchuan Zhao","doi":"10.1109/DDCLS49620.2020.9275053","DOIUrl":"https://doi.org/10.1109/DDCLS49620.2020.9275053","url":null,"abstract":"Recent advances achieved in the field of reinforcement learning have led AI algorithms capable of beating world champions in some perfect information games like Chess and Go. However, the AI approach to imperfect information games (such as Poker) is much more difficult because the complexities in estimating hidden information and behaviors of opponents may become extremely challenging. Since Markov Decision Process (MDP) is the underlying mathematical model of reinforcement learning with perfect information games, Partially Observable Markov Decision Process (POMDP) deserves research attention for studying the games with imperfect information. In this paper, we study a 16-cards Rhode Island Hold’em poker game and present a POMDP model to formulate this imperfect information extensive game. Based on the POMDP model, we use Bayesian approach to estimate the opponent’s hand and transform the original problem to several perfect information games. Furthermore, to handle the challenge of explosively huge storage space and computation burdens, we develop a Monte Carlo optimization algorithm to estimate the action values of the POMDP model. Finally, we conduct numerical experiments in the Rhode Island Hold’em poker game to demonstrate the effectiveness of our approach.","PeriodicalId":420469,"journal":{"name":"2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133544361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Calculation Method of Carbon Emission in Production Process for Optimization of Polyester Low Elastic Yarn Process","authors":"Ning Li, Jingfeng Shao","doi":"10.1109/DDCLS49620.2020.9275279","DOIUrl":"https://doi.org/10.1109/DDCLS49620.2020.9275279","url":null,"abstract":"Aiming at the problems of many parameters and energy consumption fluctuation in the production process of polyester low elastic yarn, firstly, the mechanism of energy consumption fluctuation and coupling of various parameters was analyzed. Based on the carbon footprint theory, an energy consumption analysis model of polyester low elastic yarn process was constructed around the production process of polyester low elastic yarn. Furthermore, the carbon footprint accounting was carried out for the process flow, and then the carbon emission and key process parameters were fitted, on the basis of the function relationship between the parameters, a process optimization model of polyester low elastic yarn based on the combination of signal-to-noise ratio orthogonal test and comprehensive weighting VIKOR method was constructed. Finally, through the experimental verification and analysis, the results show that the model reduces the carbon emission of the production process by 4.58% compared with the initial state by optimizing the key process parameters of polyester low elastic yarn production. On the premise of reducing energy consumption, the quality of polyester low elastic yarn has been improved.","PeriodicalId":420469,"journal":{"name":"2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132440745","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Honghai Ji, Hao Liu, Shida Liu, Li Wang, Lingling Fan
{"title":"Data-driven Adaptive Cooperative Control for Urban Traffic Signal Timing in Multi-intersections","authors":"Honghai Ji, Hao Liu, Shida Liu, Li Wang, Lingling Fan","doi":"10.1109/DDCLS49620.2020.9275110","DOIUrl":"https://doi.org/10.1109/DDCLS49620.2020.9275110","url":null,"abstract":"This paper presents a new distributed data-driven adaptive cooperative control method (DDACC) for urban traffic signal timing which can achieve the multi-directional queuing length balance with changeable cycle in multi-intersections. This method can guarantee the consensus convergence of the distributed coordinated errors of queuing length with the goal of reducing traffic congestion in multi-agent traffic systems. The proposed DDACC has three novel features, merely using the collected I/O traffic queueing length data and network topology of multi-directional signal controllers at multi-intersections, considering maximum and minimum green time constraints, well working on both undersaturation and supersaturation traffic flow conditions. The results are illustrated by numerical and experimental comparison simulations which are performed on a VISSIM-VB-MATLAB joint simulation platform.","PeriodicalId":420469,"journal":{"name":"2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131278021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Jiguang Xue, Chunsheng Yan, Danxue Wang, Jun Wang, Jun Wu, Zehua Liao
{"title":"Adaptive Dynamic Programming Method for Optimal Battery Management of Battery Electric Vehicle","authors":"Jiguang Xue, Chunsheng Yan, Danxue Wang, Jun Wang, Jun Wu, Zehua Liao","doi":"10.1109/DDCLS49620.2020.9275259","DOIUrl":"https://doi.org/10.1109/DDCLS49620.2020.9275259","url":null,"abstract":"One of the main influencing factors of battery electric vehicle (BEV) application is the high-cost of the battery. We consider to apply the battery of BEV to smart residential environments when the BEV is idle, so that we can lower the utility cost. Therefore, an adaptive dynamic programming (ADP) method is designed to solve the optimal battery management, which avoids the dimension disaster of the complex nonlinear BEV system. First, the operation modes of the battery are analyzed, and the problem statement is carried out. Then, the corresponding self-learning optimization algorithm is developed based on ADP. Finally, numerical results by experiment simulations are used to verify the ADP algorithm.","PeriodicalId":420469,"journal":{"name":"2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127426540","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Mingxuan Xia, Zehui Mao, Rui Zhang, B. Jiang, Muheng Wei
{"title":"A New Compound Fault Diagnosis Method for Gearbox Based on Convolutional Neural Network","authors":"Mingxuan Xia, Zehui Mao, Rui Zhang, B. Jiang, Muheng Wei","doi":"10.1109/DDCLS49620.2020.9275264","DOIUrl":"https://doi.org/10.1109/DDCLS49620.2020.9275264","url":null,"abstract":"This paper focus on the fault diagnosis problem for the compound faults of rotating machine, in which the rolling bearing and the sun gear faults simultaneously occurred are considered as the compound fault. Considering the traditional compound fault diagnosis methods usually utilize the manual fault features extraction, which are mainly dependent on engineering experience, we propose a compound fault diagnosis method named multi-sensor based convolutional neural network (MCNN). For vibration signals of compound faults, the different transmission paths and the positions of the sensors means one part of the embedded single faults may have higher energy. The vibration signals collected from three sensors at different positions can help guarantee the completeness of the characteristics of the compound fault. Then, the multi-sensor signals are combined together and fused by the convolutional operation of the convolutional neural network (CNN) model. The CNN model, which can automatically extract features from the vibration signals and achieve classification, is used for fault extraction and fault recognition. The experiments are presented on the physical platform of power transmission, and the proposed fault diagnosis method can be verified with the satisfied performance.","PeriodicalId":420469,"journal":{"name":"2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115909726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A Deep Learning Method for Rolling Bearing Fault Diagnosis through Heterogeneous Data","authors":"Wei Zhou, Yandong Hou","doi":"10.1109/DDCLS49620.2020.9275189","DOIUrl":"https://doi.org/10.1109/DDCLS49620.2020.9275189","url":null,"abstract":"Vibration signals of rolling bearing have multiple heterogeneous forms. Traditional fault diagnosis methods use 1D time-series signals or converted 2D signals for fault diagnosis. However, using the former will lose the spatial neighborhood features; using the latter will ignore time-series features, which caused information waste. In this paper, a new heterogeneous form of bearing vibration signals is proposed to address the problem. Our contributions of include: First, we proposed dynamic waveform sequences, which is a new heterogeneous form and can simultaneously reflect time-series features and spatial neighborhood features in vibration signals. Second, the CCLSTM (Conv-ConvLSTM) model is designed to extract the above two features layer by layer. Relying on the powerful feature extraction capability of CCLSTM, it is possible to simultaneously extract the time-series features and spatial neighborhood features in a single fault diagnosis network. The experimental verification through real bearing fault data sets shows that this method can effectively improve the diagnostic accuracy.","PeriodicalId":420469,"journal":{"name":"2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124509840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Sliding-Mode Control of Dynamic Wireless Charging EV System","authors":"Yayan Kang, Yang Song, Cheng Peng, Ling Deng","doi":"10.1109/DDCLS49620.2020.9275270","DOIUrl":"https://doi.org/10.1109/DDCLS49620.2020.9275270","url":null,"abstract":"The main challenge of electric vehicle (EV) dynamic wireless technology is the fluctuation of mutual inductance caused by the movement of EV, which leads to the instability of system. Based on the variable structure control, this paper proposes the output power regulation method of EV. Firstly, we use Biot-Savart Law to derive the mathematical expression of mutual inductance between transmitter and receiver of dynamic wireless charging (DWC) system. According to the mathematical expression, mutual inductance is related to the lateral misalignment, longitudinal offset and vertical distance of the transmitter coil and receiver coil. Then, the state space equation based on Kirchhoff's voltage / current law is established for DWC system. Finally, in order to ensure the stability of the output power, a sliding mode controller is used to adjust the transmitted power for the DWC system and track the reference input. Through simulation, it is proved that the system output is consistent under the condition of mutual inductance fluctuation brought by the relative distance change between transmitter and receiver.","PeriodicalId":420469,"journal":{"name":"2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124547393","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Event-Triggered Consensus Output Tracking Strategy for Multiagent Systems Utilizing Model-Free Adaptive Control","authors":"Weizhao Song, Jian Feng","doi":"10.1109/DDCLS49620.2020.9275167","DOIUrl":"https://doi.org/10.1109/DDCLS49620.2020.9275167","url":null,"abstract":"In this article, a model-free-adaptive-control-based (MFAC-based) event-triggered (ET) consensus output tracking problem for multiagent systems (MASs) is investigated. The dynamic models of agents are unknown, and only a subset of agents can acquire the reference trajectory. The consensus tracking algorithm is designed by the real-time input/output data and pseudo-partial-derivative (PPD), which is an important parameter of MFAC approach. An output observer is built to design the centralized ET mechanism. Then, the boundedness analysis that the tracking error is uniformly ultimately bounded (UUB) is given. Finally, a simulation experiment is provided to verify the feasibility of the ET consensus output tracking strategy for MASs.","PeriodicalId":420469,"journal":{"name":"2020 IEEE 9th Data Driven Control and Learning Systems Conference (DDCLS)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123449437","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}