Polynomial-time trace reconstruction in the smoothed complexity model

Xi Chen, Anindya De, Chin Ho Lee, R. Servedio, S. Sinha
{"title":"Polynomial-time trace reconstruction in the smoothed complexity model","authors":"Xi Chen, Anindya De, Chin Ho Lee, R. Servedio, S. Sinha","doi":"10.1145/3560819","DOIUrl":null,"url":null,"abstract":"In the trace reconstruction problem, an unknown source string x ∈ {0, 1}n is sent through a probabilistic deletion channel which independently deletes each bit with probability δ and concatenates the surviving bits, yielding a trace of x. The problem is to reconstruct x given independent traces. This problem has received much attention in recent years both in the worst-case setting where x may be an arbitrary string in {0, 1}n [DOS19, NP17, HHP18, HL20, Cha21a, Cha21b] and in the average-case setting where x is drawn uniformly at random from {0, 1}n [PZ17, HPP18, HL20, Cha21a, Cha21b]. This paper studies trace reconstruction in the smoothed analysis setting, in which a “worst-case” string xworst is chosen arbitrarily from {0, 1}n, and then a perturbed version x of xworst is formed by independently replacing each coordinate by a uniform random bit with probability σ. The problem is to reconstruct x given independent traces from it. Our main result is an algorithm which, for any constant perturbation rate 0 < σ < 1 and any constant deletion rate 0 < δ < 1, uses poly(n) running time and traces and succeeds with high probability in reconstructing the string x. This stands in contrast with the worst-case version of the problem, for which \\(\\text{exp}(\\tilde{O}(n^{1/5})) \\) is the best known time and sample complexity [Cha21b]. Our approach is based on reconstructing x from the multiset of its short subwords and is quite different from previous algorithms for either the worst-case or average-case versions of the problem. The heart of our work is a new poly(n)-time procedure for reconstructing the multiset of all O(log n)-length subwords of any source string x ∈ {0, 1}n given access to traces of x.","PeriodicalId":154047,"journal":{"name":"ACM Transactions on Algorithms (TALG)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Algorithms (TALG)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3560819","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 22

Abstract

In the trace reconstruction problem, an unknown source string x ∈ {0, 1}n is sent through a probabilistic deletion channel which independently deletes each bit with probability δ and concatenates the surviving bits, yielding a trace of x. The problem is to reconstruct x given independent traces. This problem has received much attention in recent years both in the worst-case setting where x may be an arbitrary string in {0, 1}n [DOS19, NP17, HHP18, HL20, Cha21a, Cha21b] and in the average-case setting where x is drawn uniformly at random from {0, 1}n [PZ17, HPP18, HL20, Cha21a, Cha21b]. This paper studies trace reconstruction in the smoothed analysis setting, in which a “worst-case” string xworst is chosen arbitrarily from {0, 1}n, and then a perturbed version x of xworst is formed by independently replacing each coordinate by a uniform random bit with probability σ. The problem is to reconstruct x given independent traces from it. Our main result is an algorithm which, for any constant perturbation rate 0 < σ < 1 and any constant deletion rate 0 < δ < 1, uses poly(n) running time and traces and succeeds with high probability in reconstructing the string x. This stands in contrast with the worst-case version of the problem, for which \(\text{exp}(\tilde{O}(n^{1/5})) \) is the best known time and sample complexity [Cha21b]. Our approach is based on reconstructing x from the multiset of its short subwords and is quite different from previous algorithms for either the worst-case or average-case versions of the problem. The heart of our work is a new poly(n)-time procedure for reconstructing the multiset of all O(log n)-length subwords of any source string x ∈ {0, 1}n given access to traces of x.
光滑复杂度模型中的多项式时间轨迹重建
在迹重建问题中,未知源字符串x∈{0,1}n通过概率删除通道发送,该通道以概率δ独立删除每个位,并将幸存的位连接起来,得到x的迹。问题是在给定独立迹的情况下重建x。在最坏情况下,x可能是{0,1n中的任意字符串[DOS19, NP17}, HHP18, HL20, Cha21a, Cha21b];在平均情况下,x从{0,1n中均匀随机抽取[PZ17, HPP18}, HL20, Cha21a, Cha21b],这个问题近年来受到了广泛关注。本文研究了在光滑分析条件下的轨迹重建,在{0,1}n范围内任意选择一个“最坏情况”字符串xworst,然后用概率为σ的均匀随机位独立替换每个坐标,形成xworst的摄动版本x。问题是重建给定独立轨迹的x。我们的主要结果是一种算法,对于任何恒定扰动率0 < σ < 1和任何恒定删除率0 < δ < 1,使用poly(n)运行时间和跟踪,并以高概率成功重建字符串x。这与问题的最坏情况版本形成对比,其中\(\text{exp}(\tilde{O}(n^{1/5})) \)是最知名的时间和样本复杂性[Cha21b]。我们的方法是基于从它的短子词的多集重构x,并且与以前的最坏情况或平均情况版本的问题的算法有很大不同。我们工作的核心是一个新的多(n)时间过程,用于重建任何源字符串x∈{0,1n的所有O(log n)长度子字的多集},给定访问x的轨迹。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信