{"title":"Bounds for parametric sequence comparison","authors":"David Fernández-Baca, T. Seppäläinen, G. Slutzki","doi":"10.1109/SPIRE.1999.796578","DOIUrl":null,"url":null,"abstract":"We consider the problem of computing a global alignment between two or more sequences subject to varying mismatch and indel penalties. We prove a tight 3(n/2/spl pi/)/sup 2/3/+O(n/sup 1/3/logn) bound on the worst-case number of distinct optimum alignments for two sequences of length n as the parameters are varied. This refines a O(n/sup 2/3/) upper bound by D. Gusfield et al. (1994). Our lower bound requires an unbounded alphabet. For strings over a binary alphabet, we prove a /spl Omega/(n/sup 1/2/) lower bound. For the parametric global alignment of k/spl ges/2 sequences under sum-of-pairs scoring, we prove a 3((k/2)n/2/spl pi/)/sup 2/3/+O(k/sup 2/3/n/sup 1/3/logn) upper bound on the number of distinct optimality regions and a /spl Omega/(n/sup 2/3/) lower bound. Based on experimental evidence, we conjecture that for two random sequences, the number of optimality regions is approximately /spl radic/n with high probability.","PeriodicalId":131279,"journal":{"name":"6th International Symposium on String Processing and Information Retrieval. 5th International Workshop on Groupware (Cat. No.PR00268)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1999-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"6th International Symposium on String Processing and Information Retrieval. 5th International Workshop on Groupware (Cat. No.PR00268)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPIRE.1999.796578","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
We consider the problem of computing a global alignment between two or more sequences subject to varying mismatch and indel penalties. We prove a tight 3(n/2/spl pi/)/sup 2/3/+O(n/sup 1/3/logn) bound on the worst-case number of distinct optimum alignments for two sequences of length n as the parameters are varied. This refines a O(n/sup 2/3/) upper bound by D. Gusfield et al. (1994). Our lower bound requires an unbounded alphabet. For strings over a binary alphabet, we prove a /spl Omega/(n/sup 1/2/) lower bound. For the parametric global alignment of k/spl ges/2 sequences under sum-of-pairs scoring, we prove a 3((k/2)n/2/spl pi/)/sup 2/3/+O(k/sup 2/3/n/sup 1/3/logn) upper bound on the number of distinct optimality regions and a /spl Omega/(n/sup 2/3/) lower bound. Based on experimental evidence, we conjecture that for two random sequences, the number of optimality regions is approximately /spl radic/n with high probability.