An Entropy Sumset Inequality and Polynomially Fast Convergence to Shannon Capacity Over All Alphabets

Cybersecurity and Cyberforensics Conference Pub Date : 2014-11-25 DOI:10.4230/LIPIcs.CCC.2015.42

V. Guruswami, A. Velingker

{"title":"An Entropy Sumset Inequality and Polynomially Fast Convergence to Shannon Capacity Over All Alphabets","authors":"V. Guruswami, A. Velingker","doi":"10.4230/LIPIcs.CCC.2015.42","DOIUrl":null,"url":null,"abstract":"We prove a lower estimate on the increase in entropy when two copies of a conditional random variable X|Y, with X supported on Zq = {0,1,..., q − 1} for prime q, are summed modulo q. Specifically, given two i.i.d. copies (X1, Y1) and (X2, Y2) of a pair of random variables (X, Y), with X taking values in Zq, we show \n \nH(X1 + X2 | Y1, Y2) - H(X|Y ) ≥ α(q) · H(X|Y)(1 - H(X|Y)) \n \nfor some α(q) > 0, where H (·) is the normalized (by factor log2q) entropy. In particular, if X|Y is not close to being fully random or fully deterministic and H(X|Y) ∈ (γ, 1-γ), then the entropy of the sum increases by Ωq (γ). Our motivation is an effective analysis of the finite-length behavior of polar codes, for which the linear dependence on γ is quantitatively important. The assumption of q being prime is necessary: for X supported uniformly on a proper subgroup of Zq we have H(X + X) = H(X). For X supported on infinite groups without a finite subgroup (the torsion-free case) and no conditioning, a sumset inequality for the absolute increase in (unnormalized) entropy was shown by Tao in [20]. \n \nWe use our sumset inequality to analyze Arikan's construction of polar codes and prove that for any q-ary source X, where q is any fixed prime, and any e > 0, polar codes allow efficient data compression of N i.i.d. copies of X into (H(X) + e)N q-ary symbols, as soon as N is polynomially large in 1/e. We can get capacity-achieving source codes with similar guarantees for composite alphabets, by factoring q into primes and combining different polar codes for each prime in factorization. \n \nA consequence of our result for noisy channel coding is that for all discrete memoryless channels, there are explicit codes enabling reliable communication within e > 0 of the symmetric Shannon capacity for a block length and decoding complexity bounded by a polynomial in 1/e. The result was previously shown for the special case of binary-input channels [7, 9], and this work extends the result to channels over any alphabet.","PeriodicalId":246506,"journal":{"name":"Cybersecurity and Cyberforensics Conference","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"20","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cybersecurity and Cyberforensics Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4230/LIPIcs.CCC.2015.42","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 20

Abstract

We prove a lower estimate on the increase in entropy when two copies of a conditional random variable X|Y, with X supported on Zq = {0,1,..., q − 1} for prime q, are summed modulo q. Specifically, given two i.i.d. copies (X1, Y1) and (X2, Y2) of a pair of random variables (X, Y), with X taking values in Zq, we show H(X1 + X2 | Y1, Y2) - H(X|Y ) ≥ α(q) · H(X|Y)(1 - H(X|Y)) for some α(q) > 0, where H (·) is the normalized (by factor log2q) entropy. In particular, if X|Y is not close to being fully random or fully deterministic and H(X|Y) ∈ (γ, 1-γ), then the entropy of the sum increases by Ωq (γ). Our motivation is an effective analysis of the finite-length behavior of polar codes, for which the linear dependence on γ is quantitatively important. The assumption of q being prime is necessary: for X supported uniformly on a proper subgroup of Zq we have H(X + X) = H(X). For X supported on infinite groups without a finite subgroup (the torsion-free case) and no conditioning, a sumset inequality for the absolute increase in (unnormalized) entropy was shown by Tao in [20]. We use our sumset inequality to analyze Arikan's construction of polar codes and prove that for any q-ary source X, where q is any fixed prime, and any e > 0, polar codes allow efficient data compression of N i.i.d. copies of X into (H(X) + e)N q-ary symbols, as soon as N is polynomially large in 1/e. We can get capacity-achieving source codes with similar guarantees for composite alphabets, by factoring q into primes and combining different polar codes for each prime in factorization. A consequence of our result for noisy channel coding is that for all discrete memoryless channels, there are explicit codes enabling reliable communication within e > 0 of the symmetric Shannon capacity for a block length and decoding complexity bounded by a polynomial in 1/e. The result was previously shown for the special case of binary-input channels [7, 9], and this work extends the result to channels over any alphabet.

查看原文本刊更多论文

一个熵集不等式及其对Shannon容量的多项式快速收敛

我们证明了当一个条件随机变量X|Y有两个副本，且在Zq ={0,1，…具体地说，给定一对随机变量(X, Y)的两个i - id副本(X1, Y1)和(X2, Y2)，其中X取Zq中的值，我们表明H(X1 + X2 | Y1, Y2) - H(X b| Y)≥α(q)·H(X b| Y)(1 - H(X|Y))对于某些α(q) > 0，其中H(·)是归一化的(由log2q因子)熵。特别地，如果X|Y不接近于完全随机或完全确定，且H(X|Y)∈(γ， 1-γ)，则总和的熵增加Ωq (γ)。我们的动机是对极码的有限长度行为的有效分析，其中对γ的线性依赖在定量上是重要的。对于在Zq的固有子群上一致支持的X，我们有H(X + X) = H(X)。对于无有限子群(无扭转情况)且无条件的无限群上支持的X, Tao在[20]中给出了(非归一化)熵绝对增加的一个sumset不等式。我们使用我们的sumset不等式分析了Arikan的极性码构造，并证明了对于任意q元源X，其中q是任意固定素数，以及任意e b> 0，只要N在1/e中多项式地大，极性码允许将X的N个i拷贝有效地压缩为(H(X) + e)N个q元符号。通过将q分解为素数，并在分解过程中为每个素数组合不同的极坐标码，我们可以获得具有类似保证的复合字母的容量实现源代码。我们对噪声信道编码的结果的一个结果是，对于所有离散的无内存信道，有明确的编码，可以在块长度和解码复杂度由1/e的多项式限制的对称香农容量的e > 0范围内实现可靠的通信。先前的结果显示了二进制输入通道的特殊情况[7,9]，这项工作将结果扩展到任何字母表上的通道。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Cybersecurity and Cyberforensics Conference

自引率

0.00%

发文量