{"title":"Exact Error Exponents of Concatenated Codes for DNA Storage","authors":"Yan Hao Ling, Jonathan Scarlett","doi":"arxiv-2409.01223","DOIUrl":null,"url":null,"abstract":"In this paper, we consider a concatenated coding based class of DNA storage\ncodes in which the selected molecules are constrained to be taken from an\n``inner'' codebook associated with the sequencing channel. This codebook is\nused in a ``black-box'' manner, and is only assumed to operate at an achievable\nrate in the sense of attaining asymptotically vanishing maximal (inner) error\nprobability. We first derive the exact error exponent in a widely-studied\nregime of constant rate and a linear number of sequencing reads, and show\nstrict improvements over an existing achievable error exponent. Moreover, our\nachievability analysis is based on a coded-index strategy, implying that such\nstrategies attain the highest error exponents within the broader class of codes\nthat we consider. We then extend our results to other scaling regimes,\nincluding a super-linear number of reads, as well as several certain low-rate\nregimes. We find that the latter comes with notable intricacies, such as the\nsuboptimality of codewords with all distinct molecules, and certain\ndependencies of the error exponents on the model for sequencing errors.","PeriodicalId":501082,"journal":{"name":"arXiv - MATH - Information Theory","volume":"37 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - MATH - Information Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.01223","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we consider a concatenated coding based class of DNA storage
codes in which the selected molecules are constrained to be taken from an
``inner'' codebook associated with the sequencing channel. This codebook is
used in a ``black-box'' manner, and is only assumed to operate at an achievable
rate in the sense of attaining asymptotically vanishing maximal (inner) error
probability. We first derive the exact error exponent in a widely-studied
regime of constant rate and a linear number of sequencing reads, and show
strict improvements over an existing achievable error exponent. Moreover, our
achievability analysis is based on a coded-index strategy, implying that such
strategies attain the highest error exponents within the broader class of codes
that we consider. We then extend our results to other scaling regimes,
including a super-linear number of reads, as well as several certain low-rate
regimes. We find that the latter comes with notable intricacies, such as the
suboptimality of codewords with all distinct molecules, and certain
dependencies of the error exponents on the model for sequencing errors.