Typical Depth of a Digital Search Tree built on a general source

Workshop on Analytic Algorithmics and Combinatorics Pub Date : 2014-01-06 DOI:10.1137/1.9781611973204.1

Kanal Hun, B. Vallée

{"title":"Typical Depth of a Digital Search Tree built on a general source","authors":"Kanal Hun, B. Vallée","doi":"10.1137/1.9781611973204.1","DOIUrl":null,"url":null,"abstract":"The digital search tree (dst) plays a central role in compression algorithms, of Lempel-Ziv type. This important structure can be viewed as a mixing of a digital structure (the trie) with a binary search tree. Its probabilistic analysis is thus involved, even in the case when the text is produced by a simple source (a memoryless source, or a Markov chain). After the seminal paper of Flajolet and Sedgewick (1986) [11] which deals with the memoryless unbiased case, many papers, due to Drmota, Jacquet, Louchard, Prodinger, Szpankowski, Tang, published between 1990 and 2005, dealt with general memoryless sources or Markov chains, and performed the analysis of the main parameters of dst's--namely, internal path length, profile, typical depth-- (see for instance [7, 15, 14]). Here, we are interested in a more realistic analysis, when the words are emitted by a general source, where the emission of symbols may depend on the whole previous history. There exist previous analyses of text algorithms or digital structures that have been performed for general sources, for instance for tries ([3, 2]), or for basic sorting and searching algorithms ([22, 4]). However, the case of digital search trees has not yet been considered, and this is the main subject of the paper. The idea of this study is due to Philippe Flajolet and the first steps of the work were performed with him, during the end of 2010. This paper is dedicated to Philippe's memory.","PeriodicalId":340112,"journal":{"name":"Workshop on Analytic Algorithmics and Combinatorics","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Workshop on Analytic Algorithmics and Combinatorics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1137/1.9781611973204.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

The digital search tree (dst) plays a central role in compression algorithms, of Lempel-Ziv type. This important structure can be viewed as a mixing of a digital structure (the trie) with a binary search tree. Its probabilistic analysis is thus involved, even in the case when the text is produced by a simple source (a memoryless source, or a Markov chain). After the seminal paper of Flajolet and Sedgewick (1986) [11] which deals with the memoryless unbiased case, many papers, due to Drmota, Jacquet, Louchard, Prodinger, Szpankowski, Tang, published between 1990 and 2005, dealt with general memoryless sources or Markov chains, and performed the analysis of the main parameters of dst's--namely, internal path length, profile, typical depth-- (see for instance [7, 15, 14]). Here, we are interested in a more realistic analysis, when the words are emitted by a general source, where the emission of symbols may depend on the whole previous history. There exist previous analyses of text algorithms or digital structures that have been performed for general sources, for instance for tries ([3, 2]), or for basic sorting and searching algorithms ([22, 4]). However, the case of digital search trees has not yet been considered, and this is the main subject of the paper. The idea of this study is due to Philippe Flajolet and the first steps of the work were performed with him, during the end of 2010. This paper is dedicated to Philippe's memory.

查看原文本刊更多论文

建立在通用源上的数字搜索树的典型深度

数字搜索树(dst)在Lempel-Ziv型压缩算法中起着核心作用。这个重要的结构可以看作是数字结构(树)和二叉搜索树的混合。因此，即使在文本由简单源(无记忆源或马尔可夫链)产生的情况下，也涉及到它的概率分析。在Flajolet和Sedgewick(1986)的开创性论文[11]处理无记忆无偏情况之后，由于Drmota, Jacquet, Louchard, Prodinger, Szpankowski, Tang在1990年至2005年间发表了许多论文，处理了一般的无记忆源或马尔可夫链，并对dst的主要参数进行了分析-即内部路径长度，轮廓，典型深度-(参见例如[7,15,14])。这里，我们感兴趣的是一种更现实的分析，当单词由一般来源发出时，其中符号的发出可能取决于整个以前的历史。之前已经有针对一般来源的文本算法或数字结构的分析，例如针对try([3,2])或针对基本排序和搜索算法([22,4])的分析。然而，数字搜索树的情况还没有被考虑，这是本文的主要主题。这项研究的想法是由于Philippe Flajolet和工作的第一步是在2010年底与他一起进行的。这篇文章是为了纪念菲利普。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Workshop on Analytic Algorithmics and Combinatorics

自引率

0.00%

发文量