Annual Symposium on Combinatorial Pattern Matching最新文献_第5页

Fully-functional bidirectional Burrows-Wheeler indexes 全功能双向Burrows-Wheeler索引

Annual Symposium on Combinatorial Pattern Matching Pub Date : 2019-01-29 DOI: 10.4230/LIPIcs.CPM.2019.10

F. Cunial, D. Belazzougui

引用次数: 15

Space-Efficient Computation of the LCP Array from the Burrows-Wheeler Transform 基于Burrows-Wheeler变换的LCP阵列空间高效计算

Annual Symposium on Combinatorial Pattern Matching Pub Date : 2019-01-16 DOI: 10.4230/LIPIcs.CPM.2019.7

N. Prezza, Giovanna Rosone

{"title":"Space-Efficient Computation of the LCP Array from the Burrows-Wheeler Transform","authors":"N. Prezza, Giovanna Rosone","doi":"10.4230/LIPIcs.CPM.2019.7","DOIUrl":"https://doi.org/10.4230/LIPIcs.CPM.2019.7","url":null,"abstract":"We show that the Longest Common Prefix Array of a text collection of total size n on alphabet [1, {sigma}] can be computed from the Burrows-Wheeler transformed collection in O(n log {sigma}) time using o(n log {sigma}) bits of working space on top of the input and output. Our result improves (on small alphabets) and generalizes (to string collections) the previous solution from Beller et al., which required O(n) bits of extra working space. We also show how to merge the BWTs of two collections of total size n within the same time and space bounds. The procedure at the core of our algorithms can be used to enumerate suffix tree intervals in succinct space from the BWT, which is of independent interest. An engineered implementation of our first algorithm on DNA alphabet induces the LCP of a large (16 GiB) collection of short (100 bases) reads at a rate of 2.92 megabases per second using in total 1.5 Bytes per base in RAM. Our second algorithm merges the BWTs of two short-reads collections of 8 GiB each at a rate of 1.7 megabases per second and uses 0.625 Bytes per base in RAM. An extension of this algorithm that computes also the LCP array of the merged collection processes the data at a rate of 1.48 megabases per second and uses 1.625 Bytes per base in RAM.","PeriodicalId":236737,"journal":{"name":"Annual Symposium on Combinatorial Pattern Matching","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2019-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131193269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 9

Optimal Rank and Select Queries on Dictionary-Compressed Text 字典压缩文本的最优排序和选择查询

Annual Symposium on Combinatorial Pattern Matching Pub Date : 2018-11-03 DOI: 10.4230/LIPIcs.CPM.2019.4

N. Prezza

引用次数: 14

Approximating Approximate Pattern Matching 近似近似模式匹配

Annual Symposium on Combinatorial Pattern Matching Pub Date : 2018-10-01 DOI: 10.4230/LIPIcs.CPM.2019.15

J. Studeny, P. Uznański

引用次数: 4

Finding a Small Number of Colourful Components 寻找少量彩色组件

Annual Symposium on Combinatorial Pattern Matching Pub Date : 2018-08-10 DOI: 10.4230/LIPICS.CPM.2019.20

L. Bulteau, Konrad K. Dabrowski, G. Fertin, Matthew Johnson, D. Paulusma, Stéphane Vialette

{"title":"Finding a Small Number of Colourful Components","authors":"L. Bulteau, Konrad K. Dabrowski, G. Fertin, Matthew Johnson, D. Paulusma, Stéphane Vialette","doi":"10.4230/LIPICS.CPM.2019.20","DOIUrl":"https://doi.org/10.4230/LIPICS.CPM.2019.20","url":null,"abstract":"A partition (V_1,...,V_k) of the vertex set of a graph G with a (not necessarily proper) colouring c is colourful if no two vertices in any V_i have the same colour and every set V_i induces a connected graph. The Colourful Partition problem, introduced by Adamaszek and Popa, is to decide whether a coloured graph (G,c) has a colourful partition of size at most k. This problem is related to the Colourful Components problem, introduced by He, Liu and Zhao, which is to decide whether a graph can be modified into a graph whose connected components form a colourful partition by deleting at most p edges. \u0000Despite the similarities in their definitions, we show that Colourful Partition and Colourful Components may have different complexities for restricted instances. We tighten known NP-hardness results for both problems by closing a number of complexity gaps. In addition, we prove new hardness and tractability results for Colourful Partition. In particular, we prove that deciding whether a coloured graph (G,c) has a colourful partition of size 2 is NP-complete for coloured planar bipartite graphs of maximum degree 3 and path-width 3, but polynomial-time solvable for coloured graphs of treewidth 2. \u0000Rather than performing an ad hoc study, we use our classical complexity results to guide us in undertaking a thorough parameterized study of Colourful Partition. We show that this leads to suitable parameters for obtaining FPT results and moreover prove that Colourful Components and Colourful Partition may have different parameterized complexities, depending on the chosen parameter.","PeriodicalId":236737,"journal":{"name":"Annual Symposium on Combinatorial Pattern Matching","volume":"77 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131110597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Superstrings with multiplicities 具有多重的超弦

Annual Symposium on Combinatorial Pattern Matching Pub Date : 2018-07-02 DOI: 10.4230/LIPIcs.CPM.2018.21

Bastien Cazaux, Eric Rivals

{"title":"Superstrings with multiplicities","authors":"Bastien Cazaux, Eric Rivals","doi":"10.4230/LIPIcs.CPM.2018.21","DOIUrl":"https://doi.org/10.4230/LIPIcs.CPM.2018.21","url":null,"abstract":"A superstring of a set of words P = {s_1, ..., s_p } is a string that contains each word of P as substring. Given P, the well known Shortest Linear Superstring problem (SLS), asks for a shortest superstring of P. In a variant of SLS, called Multi-SLS, each word s_i comes with an integer m(i), its multiplicity, that sets a constraint on its number of occurrences, and the goal is to find a shortest superstring that contains at least m(i) occurrences of s_i. Multi-SLS generalizes SLS and is obviously as hard to solve, but it has been studied only in special cases (with words of length 2 or with a fixed number of words). The approximability of Multi-SLS in the general case remains open. Here, we study the approximability of Multi-SLS and that of the companion problem Multi-SCCS, which asks for a shortest cyclic cover instead of shortest superstring. First, we investigate the approximation of a greedy algorithm for maximizing the compression offered by a superstring or by a cyclic cover: the approximation ratio is 1/2 for Multi-SLS and 1 for Multi-SCCS. Then, we exhibit a linear time approximation algorithm, Concat-Greedy, and show it achieves a ratio of 4 regarding the superstring length. This demonstrates that for both measures Multi-SLS belongs to the class of APX problems.","PeriodicalId":236737,"journal":{"name":"Annual Symposium on Combinatorial Pattern Matching","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115399672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

On the Maximum Colorful Arborescence Problem and Color Hierarchy Graph Structure 最大彩色树形问题与颜色层次图结构

Annual Symposium on Combinatorial Pattern Matching Pub Date : 2018-07-02 DOI: 10.4230/LIPIcs.CPM.2018.17

G. Fertin, J. Fradin, Christian Komusiewicz

{"title":"On the Maximum Colorful Arborescence Problem and Color Hierarchy Graph Structure","authors":"G. Fertin, J. Fradin, Christian Komusiewicz","doi":"10.4230/LIPIcs.CPM.2018.17","DOIUrl":"https://doi.org/10.4230/LIPIcs.CPM.2018.17","url":null,"abstract":"Let G = (V, A) be a vertex-colored arc-weighted directed acyclic graph (DAG) rooted in some vertex r. The color hierarchy graph H(G) of G is defined as follows: V (H(G)) is the color set C of G, and H(G) has an arc from c to c if G has an arc from a vertex of color c to a vertex of color c. We study the Maximum Colorful Arborescence (MCA) problem, which takes as input a DAG G such that H(G) is also a DAG, and aims at finding in G a maximum-weight arborescence rooted in r in which no color appears more than once. The MCA problem models the de novo inference of unknown metabolites by mass spectrometry experiments. Although the problem has been introduced ten years ago (under a different name), it was only recently pointed out that a crucial additional property in the problem definition was missing: by essence, H(G) must be a DAG. In this paper, we further investigate MCA under this new light and provide new algorithmic results for this problem, with a specific focus on fixed-parameter tractability (FPT) issues for different structural parameters of H(G). In particular, we show there exists an O(3 ∗ H) time algorithm for solving MCA, where nH is the number of vertices of indegree at least two in H(G), thereby improving the O(3) algorithm from Böcker et al. [Proc. ECCB ’08]. We also prove that MCA is W[2]-hard relatively to the treewidth Ht of the underlying undirected graph of H(G), and further show that it is FPT relatively to Ht + lC , where lC := |V | − |C|. 2012 ACM Subject Classification F.2.2 Nonnumerical Algorithms and Problems, G.2.1 Combinatorics, G.2.2 Graph Theory","PeriodicalId":236737,"journal":{"name":"Annual Symposium on Combinatorial Pattern Matching","volume":"4 9","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121008188","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Lyndon Factorization of Grammar Compressed Texts Revisited 林登语法分解压缩文本重访

Annual Symposium on Combinatorial Pattern Matching Pub Date : 2018-05-01 DOI: 10.4230/LIPIcs.CPM.2018.24

Isamu Furuya, Yuto Nakashima, I. Tomohiro, Shunsuke Inenaga, H. Bannai, M. Takeda

引用次数: 4

Computing longest common square subsequences 计算最长公方子序列

Annual Symposium on Combinatorial Pattern Matching Pub Date : 2018-05-01 DOI: 10.4230/LIPIcs.CPM.2018.15

Takafumi Inoue, Shunsuke Inenaga, Heikki Hyyrö, H. Bannai, M. Takeda

引用次数: 10

Longest Lyndon Substring After Edit 编辑后最长的林登子串

Annual Symposium on Combinatorial Pattern Matching Pub Date : 2018-05-01 DOI: 10.4230/LIPIcs.CPM.2018.19

Y. Urabe, Yuto Nakashima, Shunsuke Inenaga, H. Bannai, M. Takeda

引用次数: 10