{"title":"Optimal Codes Correcting a Substring Edit","authors":"Yuting Li;Yuanyuan Tang;Hao Lou;Ryan Gabrys;Farzad Hassanzadeh Farnoud","doi":"10.1109/TIT.2025.3562730","DOIUrl":null,"url":null,"abstract":"The substring edit error replaces a substring <inline-formula> <tex-math>$\\boldsymbol {u}$ </tex-math></inline-formula> of <inline-formula> <tex-math>$\\boldsymbol {x}$ </tex-math></inline-formula> with another string <inline-formula> <tex-math>$\\boldsymbol {v}$ </tex-math></inline-formula>, where the lengths of <inline-formula> <tex-math>$\\boldsymbol {u}$ </tex-math></inline-formula> and <inline-formula> <tex-math>$\\boldsymbol {v}$ </tex-math></inline-formula> are bounded by a given constant <italic>k</i>. It encompasses localized insertions, deletions, and substitutions within a window. Codes correcting one substring edit have redundancy at least <inline-formula> <tex-math>$\\log n+k$ </tex-math></inline-formula>. In this paper, we construct codes correcting one substring edit with redundancy <inline-formula> <tex-math>$\\log n+O_{k}(\\log \\log n)$ </tex-math></inline-formula>, which is almost optimal. We also study the average-case document-exchange problem under one substring edit and construct a hash with an expected length of approximately <inline-formula> <tex-math>$2\\log n+O_{k}(\\log \\log n)$ </tex-math></inline-formula> for any iid distribution for the documents.","PeriodicalId":13494,"journal":{"name":"IEEE Transactions on Information Theory","volume":"71 7","pages":"5178-5191"},"PeriodicalIF":2.9000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Information Theory","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10971390/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The substring edit error replaces a substring $\boldsymbol {u}$ of $\boldsymbol {x}$ with another string $\boldsymbol {v}$ , where the lengths of $\boldsymbol {u}$ and $\boldsymbol {v}$ are bounded by a given constant k. It encompasses localized insertions, deletions, and substitutions within a window. Codes correcting one substring edit have redundancy at least $\log n+k$ . In this paper, we construct codes correcting one substring edit with redundancy $\log n+O_{k}(\log \log n)$ , which is almost optimal. We also study the average-case document-exchange problem under one substring edit and construct a hash with an expected length of approximately $2\log n+O_{k}(\log \log n)$ for any iid distribution for the documents.
期刊介绍:
The IEEE Transactions on Information Theory is a journal that publishes theoretical and experimental papers concerned with the transmission, processing, and utilization of information. The boundaries of acceptable subject matter are intentionally not sharply delimited. Rather, it is hoped that as the focus of research activity changes, a flexible policy will permit this Transactions to follow suit. Current appropriate topics are best reflected by recent Tables of Contents; they are summarized in the titles of editorial areas that appear on the inside front cover.