Joel D. Day , Pamela Fleischmann , Maria Kosche , Tore Koß , Florin Manea , Stefan Siemer
{"title":"对k-子序列通用性的编辑距离","authors":"Joel D. Day , Pamela Fleischmann , Maria Kosche , Tore Koß , Florin Manea , Stefan Siemer","doi":"10.1016/j.jcss.2025.103681","DOIUrl":null,"url":null,"abstract":"<div><div>A word <em>u</em> is a subsequence of another word <em>w</em> if <em>u</em> is obtained from <em>w</em> by deleting some of its letters. In the 1970s, Simon defined the relation <span><math><msub><mrow><mo>∼</mo></mrow><mrow><mi>k</mi></mrow></msub></math></span> (called now Simon-Congruence) as follows: two words having the same set of subsequences of length <em>k</em> are <span><math><msub><mrow><mo>∼</mo></mrow><mrow><mi>k</mi></mrow></msub></math></span>-congruent. It is thus natural to ask, for non <em>k</em>-equivalent words <em>w</em> and <em>u</em>, what is the minimal number of edit operations that we need to perform on <em>w</em> to obtain a word which is <span><math><msub><mrow><mo>∼</mo></mrow><mrow><mi>k</mi></mrow></msub></math></span>-equivalent to <em>u</em>. Here, we consider this problem in a specific setting: when <em>u</em> is a <em>k</em>-subsequence universal word. A word <em>u</em> with <span><math><mi>a</mi><mi>l</mi><mi>p</mi><mi>h</mi><mo>(</mo><mi>u</mi><mo>)</mo><mo>=</mo><mi>Σ</mi></math></span> is called <em>k</em>-subsequence universal if the set of length-<em>k</em> subsequences of <em>u</em> contains all possible words of length <em>k</em> over Σ. As such, our results are a series of efficient algorithms computing the edit distance from <em>w</em> to the language of <em>k</em>-subsequence universal words.</div></div>","PeriodicalId":50224,"journal":{"name":"Journal of Computer and System Sciences","volume":"154 ","pages":"Article 103681"},"PeriodicalIF":1.1000,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The edit distance to k-subsequence universality\",\"authors\":\"Joel D. Day , Pamela Fleischmann , Maria Kosche , Tore Koß , Florin Manea , Stefan Siemer\",\"doi\":\"10.1016/j.jcss.2025.103681\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>A word <em>u</em> is a subsequence of another word <em>w</em> if <em>u</em> is obtained from <em>w</em> by deleting some of its letters. In the 1970s, Simon defined the relation <span><math><msub><mrow><mo>∼</mo></mrow><mrow><mi>k</mi></mrow></msub></math></span> (called now Simon-Congruence) as follows: two words having the same set of subsequences of length <em>k</em> are <span><math><msub><mrow><mo>∼</mo></mrow><mrow><mi>k</mi></mrow></msub></math></span>-congruent. It is thus natural to ask, for non <em>k</em>-equivalent words <em>w</em> and <em>u</em>, what is the minimal number of edit operations that we need to perform on <em>w</em> to obtain a word which is <span><math><msub><mrow><mo>∼</mo></mrow><mrow><mi>k</mi></mrow></msub></math></span>-equivalent to <em>u</em>. Here, we consider this problem in a specific setting: when <em>u</em> is a <em>k</em>-subsequence universal word. A word <em>u</em> with <span><math><mi>a</mi><mi>l</mi><mi>p</mi><mi>h</mi><mo>(</mo><mi>u</mi><mo>)</mo><mo>=</mo><mi>Σ</mi></math></span> is called <em>k</em>-subsequence universal if the set of length-<em>k</em> subsequences of <em>u</em> contains all possible words of length <em>k</em> over Σ. As such, our results are a series of efficient algorithms computing the edit distance from <em>w</em> to the language of <em>k</em>-subsequence universal words.</div></div>\",\"PeriodicalId\":50224,\"journal\":{\"name\":\"Journal of Computer and System Sciences\",\"volume\":\"154 \",\"pages\":\"Article 103681\"},\"PeriodicalIF\":1.1000,\"publicationDate\":\"2025-06-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Computer and System Sciences\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0022000025000637\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"BUSINESS, FINANCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computer and System Sciences","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022000025000637","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BUSINESS, FINANCE","Score":null,"Total":0}
A word u is a subsequence of another word w if u is obtained from w by deleting some of its letters. In the 1970s, Simon defined the relation (called now Simon-Congruence) as follows: two words having the same set of subsequences of length k are -congruent. It is thus natural to ask, for non k-equivalent words w and u, what is the minimal number of edit operations that we need to perform on w to obtain a word which is -equivalent to u. Here, we consider this problem in a specific setting: when u is a k-subsequence universal word. A word u with is called k-subsequence universal if the set of length-k subsequences of u contains all possible words of length k over Σ. As such, our results are a series of efficient algorithms computing the edit distance from w to the language of k-subsequence universal words.
期刊介绍:
The Journal of Computer and System Sciences publishes original research papers in computer science and related subjects in system science, with attention to the relevant mathematical theory. Applications-oriented papers may also be accepted and they are expected to contain deep analytic evaluation of the proposed solutions.
Research areas include traditional subjects such as:
• Theory of algorithms and computability
• Formal languages
• Automata theory
Contemporary subjects such as:
• Complexity theory
• Algorithmic Complexity
• Parallel & distributed computing
• Computer networks
• Neural networks
• Computational learning theory
• Database theory & practice
• Computer modeling of complex systems
• Security and Privacy.