Online Learning of Goal-Oriented Status Updating With Unknown Delay Statistics

Fuzhou Peng;Xijun Wang;Xiang Chen
{"title":"Online Learning of Goal-Oriented Status Updating With Unknown Delay Statistics","authors":"Fuzhou Peng;Xijun Wang;Xiang Chen","doi":"10.1109/JSAC.2024.3431522","DOIUrl":null,"url":null,"abstract":"With the proliferation of communication demand, goal-oriented communication goes beyond traditional bit-level approaches by emphasizing the significance of information and its relevance to specific goals. This paper addresses the goal-oriented status updating problem, where detecting status changes is crucial. We employ the Age of Changed Information (AoCI) as a metric, which considers both the timeliness and content of the update. Our goal is to minimize the weighted sum of AoCI and transmission cost without channel delay statistics. The investigated problem is formulated as a semi-Markov decision process (SMDP) and is tackled by converting it into a multi-variable optimization problem. We prove that the optimal updating policy is of threshold type, and derive a nearly closed-form expression for the optimal threshold. When delay statistics are available, the optimal threshold can be obtained by a bisection searching algorithm. In the absence of prior delay statistics, we develop an online learning policy. We demonstrate that the optimality gap decays at a rate of \n<inline-formula> <tex-math>$\\mathcal {O}(\\log K / K)$ </tex-math></inline-formula>\n, where K is the number of samples. Simulation results are presented to compare the performance of various policies under different statistical conditions, showcasing the superiority of our proposed algorithm.","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"42 11","pages":"3293-3305"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10605803/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

With the proliferation of communication demand, goal-oriented communication goes beyond traditional bit-level approaches by emphasizing the significance of information and its relevance to specific goals. This paper addresses the goal-oriented status updating problem, where detecting status changes is crucial. We employ the Age of Changed Information (AoCI) as a metric, which considers both the timeliness and content of the update. Our goal is to minimize the weighted sum of AoCI and transmission cost without channel delay statistics. The investigated problem is formulated as a semi-Markov decision process (SMDP) and is tackled by converting it into a multi-variable optimization problem. We prove that the optimal updating policy is of threshold type, and derive a nearly closed-form expression for the optimal threshold. When delay statistics are available, the optimal threshold can be obtained by a bisection searching algorithm. In the absence of prior delay statistics, we develop an online learning policy. We demonstrate that the optimality gap decays at a rate of $\mathcal {O}(\log K / K)$ , where K is the number of samples. Simulation results are presented to compare the performance of various policies under different statistical conditions, showcasing the superiority of our proposed algorithm.
具有未知延迟统计的目标导向状态更新在线学习
随着通信需求的激增,以目标为导向的通信超越了传统的比特级方法,强调信息的重要性及其与特定目标的相关性。本文探讨了以目标为导向的状态更新问题,其中检测状态变化至关重要。我们采用 "变化信息年龄"(AoCI)作为衡量标准,它同时考虑了更新的及时性和内容。我们的目标是在不统计信道延迟的情况下,使 AoCI 和传输成本的加权和最小化。所研究的问题被表述为半马尔可夫决策过程(SMDP),并通过将其转换为多变量优化问题来解决。我们证明了最优更新策略属于阈值类型,并推导出最优阈值的近似闭式表达式。在有延迟统计数据的情况下,可以通过分段搜索算法获得最佳阈值。在没有先验延迟统计数据的情况下,我们开发了一种在线学习策略。我们证明,最优性差距的衰减率为 $\mathcal {O}(\log K / K)$ ,其中 K 是样本数。仿真结果比较了不同统计条件下各种策略的性能,展示了我们提出的算法的优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信