Julien Cassaigne , France Gheeraert , Antonio Restivo , Giuseppe Romana , Marinella Sciortino , Manon Stipulanti
{"title":"基于新字符串吸引子的无限词复杂性","authors":"Julien Cassaigne , France Gheeraert , Antonio Restivo , Giuseppe Romana , Marinella Sciortino , Manon Stipulanti","doi":"10.1016/j.jcta.2024.105936","DOIUrl":null,"url":null,"abstract":"<div><p>A <em>string attractor</em> is a set of positions in a word such that each distinct factor has an occurrence crossing a position from the set. This definition comes from the data compression field, where the size <span><math><msup><mrow><mi>γ</mi></mrow><mrow><mo>⁎</mo></mrow></msup></math></span> of a smallest string attractor represents a lower bound for the output size of a large family of string compressors exploiting repetitions in words, including BWT-based and LZ-based compressors. For finite words, the combinatorial properties of string attractors have been studied in 2021 by Mantaci et al.. Later, Schaeffer and Shallit introduced the <em>string attractor profile function</em>, a complexity function that evaluates for each <span><math><mi>n</mi><mo>></mo><mn>0</mn></math></span> the size <span><math><msup><mrow><mi>γ</mi></mrow><mrow><mo>⁎</mo></mrow></msup></math></span> of the length-<em>n</em> prefix of a one-sided infinite word.</p><p>A natural development of the research on the topic is to link string attractors with other classical notions of repetitiveness in combinatorics on words. Our contribution in this sense is threefold. First, we explore the relation between the string attractor profile function and other well-known combinatorial complexity functions in the context of infinite words, such as the factor complexity and the property of recurrence. Moreover, we study its asymptotic growth in the case of purely morphic words and obtain a complete description in the binary case. Second, we introduce two new string attractor-based complexity functions, in which the structure and the distribution of positions in a string attractor are taken into account, and we study their combinatorial properties. We also show that these measures provide a finer classification of some infinite families of words, namely the Sturmian and quasi-Sturmian words. Third, we explicitly give the three complexities for some specific morphic words called <em>k</em>-bonacci words.</p><p>A preliminary version of some results presented in this paper can be found in [Restivo, Romana, Sciortino, <em>String Attractors and Infinite Words</em>, LATIN 2022].</p></div>","PeriodicalId":50230,"journal":{"name":"Journal of Combinatorial Theory Series A","volume":"208 ","pages":"Article 105936"},"PeriodicalIF":0.9000,"publicationDate":"2024-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S009731652400075X/pdfft?md5=0182f56d4d92ee1616cf2c224e889d2f&pid=1-s2.0-S009731652400075X-main.pdf","citationCount":"0","resultStr":"{\"title\":\"New string attractor-based complexities for infinite words\",\"authors\":\"Julien Cassaigne , France Gheeraert , Antonio Restivo , Giuseppe Romana , Marinella Sciortino , Manon Stipulanti\",\"doi\":\"10.1016/j.jcta.2024.105936\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>A <em>string attractor</em> is a set of positions in a word such that each distinct factor has an occurrence crossing a position from the set. This definition comes from the data compression field, where the size <span><math><msup><mrow><mi>γ</mi></mrow><mrow><mo>⁎</mo></mrow></msup></math></span> of a smallest string attractor represents a lower bound for the output size of a large family of string compressors exploiting repetitions in words, including BWT-based and LZ-based compressors. For finite words, the combinatorial properties of string attractors have been studied in 2021 by Mantaci et al.. Later, Schaeffer and Shallit introduced the <em>string attractor profile function</em>, a complexity function that evaluates for each <span><math><mi>n</mi><mo>></mo><mn>0</mn></math></span> the size <span><math><msup><mrow><mi>γ</mi></mrow><mrow><mo>⁎</mo></mrow></msup></math></span> of the length-<em>n</em> prefix of a one-sided infinite word.</p><p>A natural development of the research on the topic is to link string attractors with other classical notions of repetitiveness in combinatorics on words. Our contribution in this sense is threefold. First, we explore the relation between the string attractor profile function and other well-known combinatorial complexity functions in the context of infinite words, such as the factor complexity and the property of recurrence. Moreover, we study its asymptotic growth in the case of purely morphic words and obtain a complete description in the binary case. Second, we introduce two new string attractor-based complexity functions, in which the structure and the distribution of positions in a string attractor are taken into account, and we study their combinatorial properties. We also show that these measures provide a finer classification of some infinite families of words, namely the Sturmian and quasi-Sturmian words. Third, we explicitly give the three complexities for some specific morphic words called <em>k</em>-bonacci words.</p><p>A preliminary version of some results presented in this paper can be found in [Restivo, Romana, Sciortino, <em>String Attractors and Infinite Words</em>, LATIN 2022].</p></div>\",\"PeriodicalId\":50230,\"journal\":{\"name\":\"Journal of Combinatorial Theory Series A\",\"volume\":\"208 \",\"pages\":\"Article 105936\"},\"PeriodicalIF\":0.9000,\"publicationDate\":\"2024-07-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S009731652400075X/pdfft?md5=0182f56d4d92ee1616cf2c224e889d2f&pid=1-s2.0-S009731652400075X-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Combinatorial Theory Series A\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S009731652400075X\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATHEMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Combinatorial Theory Series A","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S009731652400075X","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICS","Score":null,"Total":0}
New string attractor-based complexities for infinite words
A string attractor is a set of positions in a word such that each distinct factor has an occurrence crossing a position from the set. This definition comes from the data compression field, where the size of a smallest string attractor represents a lower bound for the output size of a large family of string compressors exploiting repetitions in words, including BWT-based and LZ-based compressors. For finite words, the combinatorial properties of string attractors have been studied in 2021 by Mantaci et al.. Later, Schaeffer and Shallit introduced the string attractor profile function, a complexity function that evaluates for each the size of the length-n prefix of a one-sided infinite word.
A natural development of the research on the topic is to link string attractors with other classical notions of repetitiveness in combinatorics on words. Our contribution in this sense is threefold. First, we explore the relation between the string attractor profile function and other well-known combinatorial complexity functions in the context of infinite words, such as the factor complexity and the property of recurrence. Moreover, we study its asymptotic growth in the case of purely morphic words and obtain a complete description in the binary case. Second, we introduce two new string attractor-based complexity functions, in which the structure and the distribution of positions in a string attractor are taken into account, and we study their combinatorial properties. We also show that these measures provide a finer classification of some infinite families of words, namely the Sturmian and quasi-Sturmian words. Third, we explicitly give the three complexities for some specific morphic words called k-bonacci words.
A preliminary version of some results presented in this paper can be found in [Restivo, Romana, Sciortino, String Attractors and Infinite Words, LATIN 2022].
期刊介绍:
The Journal of Combinatorial Theory publishes original mathematical research concerned with theoretical and physical aspects of the study of finite and discrete structures in all branches of science. Series A is concerned primarily with structures, designs, and applications of combinatorics and is a valuable tool for mathematicians and computer scientists.