Querying highly similar sequences.

Q4 Pharmacology, Toxicology and Pharmaceutics
Carl Barton, Mathieu Giraud, Costas S Iliopoulos, Thierry Lecroq, Laurent Mouchard, Solon P Pissis
{"title":"Querying highly similar sequences.","authors":"Carl Barton,&nbsp;Mathieu Giraud,&nbsp;Costas S Iliopoulos,&nbsp;Thierry Lecroq,&nbsp;Laurent Mouchard,&nbsp;Solon P Pissis","doi":"10.1504/IJCBDD.2013.052206","DOIUrl":null,"url":null,"abstract":"<p><p>In this paper, we present a solution to the extreme similarity sequencing problem. The extreme similarity sequencing problem consists of finding occurrences of a pattern p in a set S(0), S(1), \u0085, S(k), of sequences of equal length, where S(i), for all 1≤i≤k, differs from S(0) by a constant number of errors - around 10 in practice. We present an asymptotically fast O(n + occ logocc) time algorithm, as well as a practical O(nk/w) time algorithm for solving this problem, where n is the length of a sequence, occ is the number of candidate occurrences reported by our technique, w is the size of the machine word, and the total number of errors is bounded by k - the number of sequences.</p>","PeriodicalId":39227,"journal":{"name":"International Journal of Computational Biology and Drug Design","volume":" ","pages":"119-30"},"PeriodicalIF":0.0000,"publicationDate":"2013-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJCBDD.2013.052206","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computational Biology and Drug Design","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJCBDD.2013.052206","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2013/2/21 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"Pharmacology, Toxicology and Pharmaceutics","Score":null,"Total":0}
引用次数: 3

Abstract

In this paper, we present a solution to the extreme similarity sequencing problem. The extreme similarity sequencing problem consists of finding occurrences of a pattern p in a set S(0), S(1), …, S(k), of sequences of equal length, where S(i), for all 1≤i≤k, differs from S(0) by a constant number of errors - around 10 in practice. We present an asymptotically fast O(n + occ logocc) time algorithm, as well as a practical O(nk/w) time algorithm for solving this problem, where n is the length of a sequence, occ is the number of candidate occurrences reported by our technique, w is the size of the machine word, and the total number of errors is bounded by k - the number of sequences.

查询高度相似的序列。
在本文中,我们提出了一个解决极端相似排序问题的方法。极端相似排序问题包括在相同长度序列的集合S(0), S(1),…,S(k)中寻找模式p的出现,其中S(i),对于所有1≤i≤k,与S(0)相差恒定数量的误差-在实践中约为10。我们提出了一个渐进快速的O(n + occ logocc)时间算法,以及一个实用的O(nk/w)时间算法来解决这个问题,其中n是序列的长度,occ是我们技术报告的候选出现次数,w是机器字的大小,总错误数以k -序列数为界。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Computational Biology and Drug Design
International Journal of Computational Biology and Drug Design Pharmacology, Toxicology and Pharmaceutics-Drug Discovery
CiteScore
1.00
自引率
0.00%
发文量
8
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信