IEEE Transactions on Information Theory最新文献_第2页

Subspace and DOA Estimation Under Coarse Quantization 粗量化下的子空间与DOA估计

IF 2.9 3区计算机科学

IEEE Transactions on Information Theory Pub Date : 2025-08-13 DOI: 10.1109/TIT.2025.3598702

Sjoerd Dirksen;Weilin Li;Johannes Maly

引用次数: 0

Sample-Efficient Reinforcement Learning From Human Feedback via Information-Directed Sampling 基于信息导向采样的人类反馈样本高效强化学习

IF 2.9 3区计算机科学

IEEE Transactions on Information Theory Pub Date : 2025-08-13 DOI: 10.1109/TIT.2025.3598296

Han Qi;Haochen Yang;Qiaosheng Zhang;Zhuoran Yang

{"title":"Sample-Efficient Reinforcement Learning From Human Feedback via Information-Directed Sampling","authors":"Han Qi;Haochen Yang;Qiaosheng Zhang;Zhuoran Yang","doi":"10.1109/TIT.2025.3598296","DOIUrl":"https://doi.org/10.1109/TIT.2025.3598296","url":null,"abstract":"We study the problem of reinforcement learning from human feedback (RLHF), a critical problem in training large language models, from a theoretical perspective. Our main contribution is the design of novel sample-efficient RLHF algorithms based on information-directed sampling (IDS), an online decision-making principle inspired by information theory. Our algorithms maximize the sum of the value function and a mutual information term that encourages exploration of the unknown environment (which quantifies the information gained about the environment through observed human feedback data). To tackle the challenge of large state spaces and improve sample efficiency, we construct a simplified <italic>surrogate environment and introduce a novel distance measure (named the <inline-formula> <tex-math>$ell _{g}$ </tex-math></inline-formula><italic>-distance), enabling our IDS-based algorithm to achieve a Bayesian regret upper bound of order <inline-formula> <tex-math>$O(H^{3/2}sqrt {log (K(epsilon)) T})$ </tex-math></inline-formula>, where <italic>H is the episode length, <italic>T is the number of episode and <inline-formula> <tex-math>$K(epsilon)$ </tex-math></inline-formula> is related to the covering number of the environment. Specializing to the tabular settings, this regret bound is of order <inline-formula> <tex-math>$tilde {O}(H^{2}sqrt {SAT})$ </tex-math></inline-formula>, where <italic>S and <italic>A are the numbers of states and actions. Finally, we propose an Approximate-IDS algorithm that is computationally more efficient while maintaining nearly the same sample efficiency. The design principle of this approximate algorithm is not only effective in RLHF settings but also applicable to the standard RL framework. Moreover, our work showcases the value of information theory in reinforcement learning and in the training of large language models.","PeriodicalId":13494,"journal":{"name":"IEEE Transactions on Information Theory","volume":"71 10","pages":"7942-7958"},"PeriodicalIF":2.9,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145110261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bregman-Divergence-Based Arimoto-Blahut Algorithm 基于bregman - divergence的Arimoto-Blahut算法

IF 2.9 3区计算机科学

IEEE Transactions on Information Theory Pub Date : 2025-08-12 DOI: 10.1109/TIT.2025.3597943

Masahito Hayashi

引用次数: 0

On Non-Interactive Simulation of Distributed Sources With Finite Alphabets 有限字母分布源的非交互仿真

IF 2.9 3区计算机科学

IEEE Transactions on Information Theory Pub Date : 2025-08-11 DOI: 10.1109/TIT.2025.3597546

Hojat Allah Salehi;Farhad Shirani

{"title":"On Non-Interactive Simulation of Distributed Sources With Finite Alphabets","authors":"Hojat Allah Salehi;Farhad Shirani","doi":"10.1109/TIT.2025.3597546","DOIUrl":"https://doi.org/10.1109/TIT.2025.3597546","url":null,"abstract":"This work presents a Fourier analysis framework for the non-interactive source simulation (NISS) problem. Two distributed agents observe a pair of sequences <inline-formula> <tex-math>$X^{d}$ </tex-math></inline-formula> and <inline-formula> <tex-math>$Y^{d}$ </tex-math></inline-formula> drawn according to a joint distribution <inline-formula> <tex-math>$P_{X^{d}Y^{d}}$ </tex-math></inline-formula>. The agents aim to generate outputs <inline-formula> <tex-math>$U=f_{d}(X^{d})$ </tex-math></inline-formula> and <inline-formula> <tex-math>$V=g_{d}(Y^{d})$ </tex-math></inline-formula> with a joint distribution sufficiently close in total variation to a target distribution <inline-formula> <tex-math>$Q_{UV}$ </tex-math></inline-formula>. Existing works have shown that the NISS problem with finite-alphabet outputs is decidable. For the binary-output NISS, an upper-bound to the input complexity was derived which is <inline-formula> <tex-math>$Oleft ({{exp mathrm {poly}left ({{frac {1}{epsilon }}}right)}}right)$ </tex-math></inline-formula>. In this work, the input complexity and algorithm design are addressed in several classes of NISS scenarios. For binary-output NISS scenarios with doubly-symmetric binary inputs, it is shown that the input complexity is <inline-formula> <tex-math>$Theta left ({{log {frac {1}{epsilon }}}}right)$ </tex-math></inline-formula>, thus providing a super-exponential improvement in input complexity. An explicit characterization of the simulating pair of functions is provided. For general finite-input scenarios, a constructive algorithm is introduced that explicitly finds the simulating functions <inline-formula> <tex-math>$(f_{d}(X^{d}),g_{d}(Y^{d}))$ </tex-math></inline-formula>. The approach relies on a novel Fourier analysis framework. Various numerical simulations of NISS scenarios with IID inputs are provided. Furthermore, to illustrate the general applicability of the Fourier framework, several examples with non-IID inputs, including entanglement-assisted NISS and NISS with Markovian inputs are provided.","PeriodicalId":13494,"journal":{"name":"IEEE Transactions on Information Theory","volume":"71 10","pages":"8048-8079"},"PeriodicalIF":2.9,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145110322","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Properties of Algorithmic Information Distance 算法信息距离的性质

IF 2.9 3区计算机科学

IEEE Transactions on Information Theory Pub Date : 2025-08-08 DOI: 10.1109/TIT.2025.3597092

Marcus Hutter

引用次数: 0

Analysis of Functions of Low Differential Uniformity in Characteristic 2: A New Approach (I) 特性2低差分均匀性函数分析：一种新方法（一）

IF 2.9 3区计算机科学

IEEE Transactions on Information Theory Pub Date : 2025-08-08 DOI: 10.1109/TIT.2025.3597162

Nurdagül Anbar;Tekgül Kalaycı;Alev Topuzoğlu

{"title":"Analysis of Functions of Low Differential Uniformity in Characteristic 2: A New Approach (I)","authors":"Nurdagül Anbar;Tekgül Kalaycı;Alev Topuzoğlu","doi":"10.1109/TIT.2025.3597162","DOIUrl":"https://doi.org/10.1109/TIT.2025.3597162","url":null,"abstract":"We introduce a new concept, the <italic>APN-defect, which can be thought of as measuring the distance of a given function <inline-formula> <tex-math>$G:mathbb {F}_{2^{n}} rightarrow mathbb {F}_{2^{n}}$ </tex-math></inline-formula> to the set of almost perfect nonlinear (APN) functions. This concept is motivated by the detailed analysis of the differential behaviour of non-APN functions (of low differential uniformity) <italic>G using the so-called <italic>difference squares. Indeed, the insight into some structural qualities of S-boxes provided by this new approach is particularly useful in the light of recent refinements of differential cryptanalysis. We describe the relations between the APN-defect and other current concepts of similar nature. Values of APN-defect for several classes of functions of interest, including Dembowski-Ostrom polynomials are given. This enables one to identify the <italic>quasi-APN ones, i.e., those with favourable differential behavior. The difference square corresponding to a modification of the inverse function is determined, its APN-defect depending on <italic>n is evaluated, the partial quadruple system associated to it is described, and the implications are discussed. In the forthcoming second part of this work we further examine the APN-defect of modifications of the inverse function and address some questions concerning CCZ-equivalence. We also study modifications of classes of functions of low differential uniformity over infinitely many extensions of <inline-formula> <tex-math>$mathbb {F}_{2^{n}}$ </tex-math></inline-formula> and present quantitative results on their differential behaviour.","PeriodicalId":13494,"journal":{"name":"IEEE Transactions on Information Theory","volume":"71 10","pages":"8002-8016"},"PeriodicalIF":2.9,"publicationDate":"2025-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145110319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Periodic Gaussian Process Controlled B-Spline for Scalable Modeling of Irregularly Spaced Signals 周期高斯过程控制b样条用于不规则间隔信号的可扩展建模

IF 2.9 3区计算机科学

IEEE Transactions on Information Theory Pub Date : 2025-08-07 DOI: 10.1109/TIT.2025.3595144

Yongxiang Li;Yuanyuan Li;Di Wang

{"title":"Periodic Gaussian Process Controlled B-Spline for Scalable Modeling of Irregularly Spaced Signals","authors":"Yongxiang Li;Yuanyuan Li;Di Wang","doi":"10.1109/TIT.2025.3595144","DOIUrl":"https://doi.org/10.1109/TIT.2025.3595144","url":null,"abstract":"Existing periodic Gaussian process (PGP) modeling methods rely on the regularly-spaced-signal assumption (i.e., signals are evenly spaced) and the integer-period assumption for the sake of computational feasibility. However, such an assumption prevents conventional efficient modeling approaches from working properly on irregularly (unevenly) spaced signals, such as evenly spaced signals with missing data. Moreover, without the integer-period assumption, it is computationally prohibitive to accurately search the decimal period of PGP due to the severe non-convexity of its likelihood function. To address these issues, this study proposes a PGP-controlled B-spline for scalable modeling of irregularly spaced signals with a decimal period. The proposed model integrates PGP with B-spline basis functions, allowing for nonlinear and nonparametric modeling of periodic signals. An explore-exploit optimization is developed to overcome the non-convexity of the likelihood, enabling effective and efficient decimal period estimation. The proposed PGP modeling approach has a linear time complexity. Asymptotic properties of the proposed method are studied, which shed light on the period estimation of other PGP models. Simulation and real case studies are conducted to demonstrate the superiority of the proposed method.","PeriodicalId":13494,"journal":{"name":"IEEE Transactions on Information Theory","volume":"71 10","pages":"7842-7855"},"PeriodicalIF":2.9,"publicationDate":"2025-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145110191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Degree-D Reverse Multiplication-Friendly Embeddings d度反向乘法友好嵌入

IF 2.9 3区计算机科学

IEEE Transactions on Information Theory Pub Date : 2025-08-06 DOI: 10.1109/TIT.2025.3596305

Daniel Escudero;Cheng Hong;Hongqing Liu;Chaoping Xing;Chen Yuan

{"title":"Degree-D Reverse Multiplication-Friendly Embeddings","authors":"Daniel Escudero;Cheng Hong;Hongqing Liu;Chaoping Xing;Chen Yuan","doi":"10.1109/TIT.2025.3596305","DOIUrl":"https://doi.org/10.1109/TIT.2025.3596305","url":null,"abstract":"Reverse multiplication-friendly embeddings have played a crucial role in secure multiparty computation and zero-knowledge proofs. In this work, we generalize the notion of RMFEs to <italic>degree-D RMFEs. We present a general construction of degree-<italic>D RMFEs by generalizing the ideas on algebraic geometry used to construct traditional degree-2 RMFEs. Furthermore, our theory is given in a unified manner for general Galois rings, which include both rings of the form <inline-formula> <tex-math>$mathbb {Z}_{p^{k}}$ </tex-math></inline-formula> and fields like <inline-formula> <tex-math>$mathbb {F}_{p^{k}}$ </tex-math></inline-formula>, which have been treated separately in prior works. We present multiple concrete sets of parameters for degree-<italic>D RMFEs (including <inline-formula> <tex-math>$D=2$ </tex-math></inline-formula>), which can be useful for future works. In the recent work of (Cheon & Lee, Eurocrypt’22), the concept of a <italic>degree-D packing method was formally introduced, which captures the idea of embedding multiple elements of a smaller ring into a larger ring. We show that the generalized notion of RMFEs to <italic>degree-D RMFEs which, in spite of being “more algebraic” than packing methods, turn out to be essentially equivalent. Thus, our constructions of degree-<italic>D RMFEs are also degree-<italic>D packing methods.","PeriodicalId":13494,"journal":{"name":"IEEE Transactions on Information Theory","volume":"71 10","pages":"7990-8001"},"PeriodicalIF":2.9,"publicationDate":"2025-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145110243","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Higher Grassmann Codes III: Quantum Variants 高级格拉斯曼代码III：量子变体

IF 2.9 3区计算机科学

IEEE Transactions on Information Theory Pub Date : 2025-08-06 DOI: 10.1109/TIT.2025.3596479

Mahir Bilen Can;Roy Joshua

引用次数: 0

Distributed Semi-Supervised Inference for Generalized Linear Models With Block-Wise Missing Covariates 具有块型缺失协变量的广义线性模型的分布半监督推理

IF 2.9 3区计算机科学

IEEE Transactions on Information Theory Pub Date : 2025-08-06 DOI: 10.1109/TIT.2025.3596304

Ziyuan Wang;Jin Liu;Jun Shao;Heng Lian;Lei Wang

引用次数: 0