Libin Dong, Cheng Yang, Keyao Wang, Yi Wang, Quan Li, Jiandong Sun
{"title":"并行Agent的设计与应用","authors":"Libin Dong, Cheng Yang, Keyao Wang, Yi Wang, Quan Li, Jiandong Sun","doi":"10.1109/DTPI55838.2022.9998899","DOIUrl":null,"url":null,"abstract":"The parallel agent learns the model according to the environment feedback, and selects the appropriate amount of action. It consists of automatic modeling system, automatic simulation system and parallel execution system. The automatic modeling system sends an excitation signal to the environment and establishes the model according to the system state data. The automatic simulation system automatically carries out the step response simulation experiment by using predictive dynamic optimization technology. The parallel execution system will automatically switch with the actual system without disturbance, and carry out step disturbance test. If the performance is excellent, it will be put into application. The parallel agent is divided into two processes: learning and experiment. The learning process refers to that agents collect feedback information and learn the model. The experimental process refers to that the agent constructs the predictive dynamic optimization closed-loop control system based on the obtained model and carries out the step response simulation experiment, and switches to the actual system. If the step disturbance test has excellent performance, it will be put into operation. Based on the parallel control theory, the parallel agent automatically completes the whole process according to the step sequence and has a certain adaptive ability.","PeriodicalId":409822,"journal":{"name":"2022 IEEE 2nd International Conference on Digital Twins and Parallel Intelligence (DTPI)","volume":"103 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Design and Application of A Parallel Agent\",\"authors\":\"Libin Dong, Cheng Yang, Keyao Wang, Yi Wang, Quan Li, Jiandong Sun\",\"doi\":\"10.1109/DTPI55838.2022.9998899\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The parallel agent learns the model according to the environment feedback, and selects the appropriate amount of action. It consists of automatic modeling system, automatic simulation system and parallel execution system. The automatic modeling system sends an excitation signal to the environment and establishes the model according to the system state data. The automatic simulation system automatically carries out the step response simulation experiment by using predictive dynamic optimization technology. The parallel execution system will automatically switch with the actual system without disturbance, and carry out step disturbance test. If the performance is excellent, it will be put into application. The parallel agent is divided into two processes: learning and experiment. The learning process refers to that agents collect feedback information and learn the model. The experimental process refers to that the agent constructs the predictive dynamic optimization closed-loop control system based on the obtained model and carries out the step response simulation experiment, and switches to the actual system. If the step disturbance test has excellent performance, it will be put into operation. Based on the parallel control theory, the parallel agent automatically completes the whole process according to the step sequence and has a certain adaptive ability.\",\"PeriodicalId\":409822,\"journal\":{\"name\":\"2022 IEEE 2nd International Conference on Digital Twins and Parallel Intelligence (DTPI)\",\"volume\":\"103 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 2nd International Conference on Digital Twins and Parallel Intelligence (DTPI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DTPI55838.2022.9998899\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 2nd International Conference on Digital Twins and Parallel Intelligence (DTPI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DTPI55838.2022.9998899","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
The parallel agent learns the model according to the environment feedback, and selects the appropriate amount of action. It consists of automatic modeling system, automatic simulation system and parallel execution system. The automatic modeling system sends an excitation signal to the environment and establishes the model according to the system state data. The automatic simulation system automatically carries out the step response simulation experiment by using predictive dynamic optimization technology. The parallel execution system will automatically switch with the actual system without disturbance, and carry out step disturbance test. If the performance is excellent, it will be put into application. The parallel agent is divided into two processes: learning and experiment. The learning process refers to that agents collect feedback information and learn the model. The experimental process refers to that the agent constructs the predictive dynamic optimization closed-loop control system based on the obtained model and carries out the step response simulation experiment, and switches to the actual system. If the step disturbance test has excellent performance, it will be put into operation. Based on the parallel control theory, the parallel agent automatically completes the whole process according to the step sequence and has a certain adaptive ability.