{"title":"非授权频谱中URLLC传输的模型辅助学习","authors":"A. Hindi, S. Elayoubi, T. Chahed","doi":"10.1109/MASCOTS50786.2020.9285938","DOIUrl":null,"url":null,"abstract":"We focus in this paper on the transport of critical services in unlicensed spectrum, where stringent constraints on latency and reliability are to be met, in the context of Ultra-Reliable Low Latency Communication (URLLC). Since contention-based medium access performs poorly in the case of high traffic load, we propose a new transmission scheme where the transmitter can increase its transmission power when the delay of the packet approaches the delay constraint, increasing by that its chance of being decoded even in case of collision with other lower-power packets. We are however interested in minimizing the usage of high power transmissions, mainly to conserve energy for battery-powered devices and to limit the range of interference. Therefore, we define a transmission policy that makes use of a delay threshold after which the high-power transmission starts, and propose a new online-learning approach based on Multi-Armed Bandit (MAB) in order to identify the policy which achieves minimum energy consumption while guaranteeing reliability. However, we observe that the MAB converges slowly to the optimal policy because the loss event is rare in the load regime of interest. We then propose a model-aided learning approach where a simple analytical model helps estimating the longterm reliability resulting from an action and thus its reward. Our results show a significant enhancement of the convergence towards the optimal policy.","PeriodicalId":272614,"journal":{"name":"2020 28th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Model-Aided Learning for URLLC Transmission in Unlicensed Spectrum\",\"authors\":\"A. Hindi, S. Elayoubi, T. Chahed\",\"doi\":\"10.1109/MASCOTS50786.2020.9285938\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We focus in this paper on the transport of critical services in unlicensed spectrum, where stringent constraints on latency and reliability are to be met, in the context of Ultra-Reliable Low Latency Communication (URLLC). Since contention-based medium access performs poorly in the case of high traffic load, we propose a new transmission scheme where the transmitter can increase its transmission power when the delay of the packet approaches the delay constraint, increasing by that its chance of being decoded even in case of collision with other lower-power packets. We are however interested in minimizing the usage of high power transmissions, mainly to conserve energy for battery-powered devices and to limit the range of interference. Therefore, we define a transmission policy that makes use of a delay threshold after which the high-power transmission starts, and propose a new online-learning approach based on Multi-Armed Bandit (MAB) in order to identify the policy which achieves minimum energy consumption while guaranteeing reliability. However, we observe that the MAB converges slowly to the optimal policy because the loss event is rare in the load regime of interest. We then propose a model-aided learning approach where a simple analytical model helps estimating the longterm reliability resulting from an action and thus its reward. Our results show a significant enhancement of the convergence towards the optimal policy.\",\"PeriodicalId\":272614,\"journal\":{\"name\":\"2020 28th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 28th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MASCOTS50786.2020.9285938\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 28th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MASCOTS50786.2020.9285938","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Model-Aided Learning for URLLC Transmission in Unlicensed Spectrum
We focus in this paper on the transport of critical services in unlicensed spectrum, where stringent constraints on latency and reliability are to be met, in the context of Ultra-Reliable Low Latency Communication (URLLC). Since contention-based medium access performs poorly in the case of high traffic load, we propose a new transmission scheme where the transmitter can increase its transmission power when the delay of the packet approaches the delay constraint, increasing by that its chance of being decoded even in case of collision with other lower-power packets. We are however interested in minimizing the usage of high power transmissions, mainly to conserve energy for battery-powered devices and to limit the range of interference. Therefore, we define a transmission policy that makes use of a delay threshold after which the high-power transmission starts, and propose a new online-learning approach based on Multi-Armed Bandit (MAB) in order to identify the policy which achieves minimum energy consumption while guaranteeing reliability. However, we observe that the MAB converges slowly to the optimal policy because the loss event is rare in the load regime of interest. We then propose a model-aided learning approach where a simple analytical model helps estimating the longterm reliability resulting from an action and thus its reward. Our results show a significant enhancement of the convergence towards the optimal policy.