Model-Free Learning Algorithms for Dynamic Transmission Control in IoT Equipment

2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS) Pub Date : 2021-12-29 DOI:10.1109/ICSPIS54653.2021.9729333

Hanieh Malekijou, Vesal Hakami

引用次数: 0

Abstract

We consider an energy-harvesting IoT device transmitting delay- and jitter-sensitive data over a wireless fading channel. Given the limited harvested energy, our goal is to compute optimal transmission control policies that decide on how many packets of data should be transmitted from the buffer's head-of-line at each discrete timeslot such that a long-run criterion involving the average delay/jitter is either minimized or never exceeds a pre-specified threshold. We utilize a suite of Q-learning-based techniques (from the reinforcement learning theory) to optimize the transmission policy in a model-free fashion. Compared to prior work, our novelty lies in proposing a model-free learning algorithm that enables jitter-aware transmissions by penalizing control decisions with the variance of the delay cost function. Extensive numerical results are presented for performance evaluation.

查看原文本刊更多论文

物联网设备动态传输控制的无模型学习算法

我们考虑一种能量收集物联网设备，通过无线衰落信道传输延迟和抖动敏感数据。给定有限的能量，我们的目标是计算最优的传输控制策略，以决定在每个离散时点应从缓冲区的线路头传输多少数据包，从而使涉及平均延迟/抖动的长期标准最小化或永远不超过预先指定的阈值。我们利用一套基于q学习的技术(来自强化学习理论)以无模型的方式优化传输策略。与之前的工作相比，我们的新颖之处在于提出了一种无模型学习算法，该算法通过用延迟代价函数的方差惩罚控制决策来实现抖动感知传输。广泛的数值结果提出了性能评估。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 7th International Conference on Signal Processing and Intelligent Systems (ICSPIS)

自引率

0.00%

发文量