Average AoI Minimization in an HARQ-based Status Update System under Random Arrivals

Saeid Sadeghi Vilni, Mohammad Moltafet, Markus Leinonen, M. Codreanu
{"title":"Average AoI Minimization in an HARQ-based Status Update System under Random Arrivals","authors":"Saeid Sadeghi Vilni, Mohammad Moltafet, Markus Leinonen, M. Codreanu","doi":"10.1109/IoTaIS56727.2022.9975894","DOIUrl":null,"url":null,"abstract":"We consider a status update system consisting of one source, one butter-aided transmitter, and one receiver. The source randomly generates status update packets and the transmitter sends the packets to the receiver over an unreliable channel using a hybrid automatic repeat request (HARQ) protocol. The system holds two packets: one packet in the butter, which stores the last generated packet, and one packet currently under service in the transmitter. At each time slot, the transmitter decides whether to stay idle, transmit the last generated packet, or retransmit the packet currently under service. We aim to find the optimal actions at each slot to minimize the average age of information (AoI) of the source under a constraint on the average number of transmissions. We model the problem as a constrained Markov decision process (CMDP) problem and solve it for the known and unknown learning environment as follows. First, we use the Lagrangian approach to transform the CMDP problem to an MDP problem which is solved with the relative value iteration (RVI) for the known environment and with deep Q-learning (DQL) algorithm for the unknown environment. Second, we use the Lyapunov method to transform the CMDP problem to an MDP problem which is solved with DQL algorithm for the unknown environment. Simulation results assess the effectiveness of the proposed approaches.","PeriodicalId":138894,"journal":{"name":"2022 IEEE International Conference on Internet of Things and Intelligence Systems (IoTaIS)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Internet of Things and Intelligence Systems (IoTaIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IoTaIS56727.2022.9975894","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We consider a status update system consisting of one source, one butter-aided transmitter, and one receiver. The source randomly generates status update packets and the transmitter sends the packets to the receiver over an unreliable channel using a hybrid automatic repeat request (HARQ) protocol. The system holds two packets: one packet in the butter, which stores the last generated packet, and one packet currently under service in the transmitter. At each time slot, the transmitter decides whether to stay idle, transmit the last generated packet, or retransmit the packet currently under service. We aim to find the optimal actions at each slot to minimize the average age of information (AoI) of the source under a constraint on the average number of transmissions. We model the problem as a constrained Markov decision process (CMDP) problem and solve it for the known and unknown learning environment as follows. First, we use the Lagrangian approach to transform the CMDP problem to an MDP problem which is solved with the relative value iteration (RVI) for the known environment and with deep Q-learning (DQL) algorithm for the unknown environment. Second, we use the Lyapunov method to transform the CMDP problem to an MDP problem which is solved with DQL algorithm for the unknown environment. Simulation results assess the effectiveness of the proposed approaches.
随机到达下基于harq的状态更新系统的平均AoI最小化
我们考虑一个由一个源、一个黄油辅助发送器和一个接收器组成的状态更新系统。源随机生成状态更新数据包,发送方使用混合自动重复请求(HARQ)协议通过不可靠的信道将数据包发送给接收方。系统保存两个包:一个包在黄油中,它存储最后生成的包,另一个包目前在发射机中工作。在每个时隙,发送器决定是否保持空闲,传输最后生成的数据包,或者重传当前正在使用的数据包。我们的目标是在平均传输次数的约束下,找到每个时隙的最优操作,以最小化源的平均信息年龄(AoI)。我们将该问题建模为约束马尔可夫决策过程(CMDP)问题,并对已知和未知的学习环境进行如下求解。首先,我们利用拉格朗日方法将CMDP问题转化为MDP问题,在已知环境下使用相对值迭代(RVI),在未知环境下使用深度q -学习(DQL)算法。其次,我们利用Lyapunov方法将CMDP问题转化为MDP问题,并利用DQL算法对未知环境进行求解。仿真结果验证了所提方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信