{"title":"Regret Analysis of Certainty Equivalence Policies in Continuous-Time Linear-Quadratic Systems","authors":"Mohamad Kazem Shirani Faradonbeh","doi":"10.48550/arXiv.2206.04434","DOIUrl":"https://doi.org/10.48550/arXiv.2206.04434","url":null,"abstract":"This work theoretically studies a ubiquitous reinforcement learning policy for controlling the canonical model of continuous-time stochastic linear-quadratic systems. We show that randomized certainty equivalent policy addresses the exploration-exploitation dilemma in linear control systems that evolve according to unknown stochastic differential equations and their operating cost is quadratic. More precisely, we establish square-root of time regret bounds, indicating that randomized certainty equivalent policy learns optimal control actions fast from a single state trajectory. Further, linear scaling of the regret with the number of parameters is shown. The presented analysis introduces novel and useful technical approaches, and sheds light on fundamental challenges of continuous-time reinforcement learning.","PeriodicalId":347792,"journal":{"name":"International Conference on System Theory, Control and Computing","volume":"102 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-06-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133690477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Cyber Security Reactivity in Crisis Times and Critical Infrastructures","authors":"M. Șcheau, V. Gaftea, M. Achim, C. Cotoc","doi":"10.1109/ICSTCC50638.2020.9259695","DOIUrl":"https://doi.org/10.1109/ICSTCC50638.2020.9259695","url":null,"abstract":"Technological and social developments lead to repositioning of criminal actions and adapted responses of Information Systems. On the invisible front, confrontations between national or international organizations and entities that are strictly economically motivated or supported by terrorist groups take place. The direct and indirect effects are difficult to predict due to the high degree of uncertainty of the phenomenon as a whole. The mobility of the relevant factors is quite high and that is why the algorithms use probabilistic models. The losses are quantified as a post factum effect of the events. The present study aims to present Data Mining and Analysis of possible impact that can be felt, starting from a review of actions against official institutions, transformations that occur in crises and finally, a set of proposals in support of alignment with common international standards.","PeriodicalId":347792,"journal":{"name":"International Conference on System Theory, Control and Computing","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2020-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123222532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Modeling and Control of Anti-lock Braking systems considering different representations for tire-road interaction","authors":"I. Maia","doi":"10.1109/ICSTCC.2019.8885694","DOIUrl":"https://doi.org/10.1109/ICSTCC.2019.8885694","url":null,"abstract":"Mobility and traffic safety is one of the major concerns in today’s society as traffic accidents continue to be a leading cause of death on the planet. Traffic safety is influenced by several factors, one of the most important being vehicle control, by the driver, in all situations. Modern safety systems, among them, the anti-locking brake system (ABS) has allowed the reduction of uncontrollable incidents in traffic, protecting the passengers from accidents. Together with safety issues, the costumers growing demands when purchasing vehicles regarding efficiency, lower consumption, reduced emissions and high comfort encourages engineers to seek for better model representations of the vehicle dynamics and improved and simplified control techniques. This article approaches a modeling and simulation of an ABS system considering different theories for implementation of tire-road contact.","PeriodicalId":347792,"journal":{"name":"International Conference on System Theory, Control and Computing","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116027558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
A. Pica, G. Predusca, L. Circiumarescu, N. Angelescu, D. Puchianu
{"title":"Analysis of file transfer traffic using MPLS technology","authors":"A. Pica, G. Predusca, L. Circiumarescu, N. Angelescu, D. Puchianu","doi":"10.1109/ICSTCC55426.2022.9931894","DOIUrl":"https://doi.org/10.1109/ICSTCC55426.2022.9931894","url":null,"abstract":"The paper aims to analyse high-speed Multiprotocol Label Switching (MPLS) networks, specifying the basic principles of label switching, alternative routing optimization mechanisms, error recovery problems and virtual private network implementation solutions. The main goal of MPLS is to integrate IP traffic with high-speed optical networks, which are becoming increasingly accessible. On the other hand, traffic speed is only one aspect of QoS; another key dimension is the ability to recover quickly from error. Therefore, the most important task of this paper is to optimize the use of network bandwidth without using too essential information.","PeriodicalId":347792,"journal":{"name":"International Conference on System Theory, Control and Computing","volume":"5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122151095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Horatiu Roibu, Lidia-Cristina Bazavan, I. Reșceanu, Dan Andritoiu, N. Bîzdoaca
{"title":"Automated Modular Architecture with Cooperative Facilities","authors":"Horatiu Roibu, Lidia-Cristina Bazavan, I. Reșceanu, Dan Andritoiu, N. Bîzdoaca","doi":"10.1109/ICSTCC.2019.8885473","DOIUrl":"https://doi.org/10.1109/ICSTCC.2019.8885473","url":null,"abstract":"The studies and experiments performed by the authors in order to implement a semi-fabricated AGV transfer system led to the need to identify solutions that would allow predictive maintenance as well as intervention with dead times as low as possible. Under these conditions the authors propose a specific modular design concept that ensures a significant increase in system reliability. Each redesigned module has its own predictive maintenance test procedure, communicating with the central system that oversees general maintenance. If one of the modules generates a warning signal, the central system will decide either to continue the activity with warning metering, or, depending on the degree of accusation, by calling the intervention service and by rapidly modifying the defect subsystem due to the modular design. The article presents the experiments on modular redesign and the concept of modularized predictive maintenance management.","PeriodicalId":347792,"journal":{"name":"International Conference on System Theory, Control and Computing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125214397","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}