PMS6: A Fast Algorithm for Motif Discovery.

Shibdas Bandyopadhyay, Sartaj Sahni, Sanguthevar Rajasekaran
{"title":"PMS6: A Fast Algorithm for Motif Discovery.","authors":"Shibdas Bandyopadhyay, Sartaj Sahni, Sanguthevar Rajasekaran","doi":"10.1109/ICCABS.2012.6182627","DOIUrl":null,"url":null,"abstract":"<p><p>We propose a new algorithm, PMS6, for the (<i>l</i>, <i>d</i>)-motif discovery problem in which we are to find all strings of length <i>l</i> that appear in every string of a given set of strings with at most <i>d</i> mismatches. The run time ratio PMS5/PMS6, where PMS5 is the fastest previously known algorithm for motif discovery in large instances, ranges from a high of 2.20 for the (21,8) challenge instances to a low of 1.69 for the (17,6) challenge instances. Both PMS5 and PMS6 require some amount of preprocessing. The preprocessing time for PMS6 is 34 times faster than that for PMS5 for (23,9) instances. When preprocessing time is factored in, the run time ratio PMS5/PMS6 is as high as 2.75 for (13,4) instances and as low as 1.95 for (17,6) instances.</p>","PeriodicalId":89933,"journal":{"name":"IEEE ... International Conference on Computational Advances in Bio and Medical Sciences : [proceedings]. IEEE International Conference on Computational Advances in Bio and Medical Sciences","volume":" ","pages":"1-6"},"PeriodicalIF":0.0000,"publicationDate":"2012-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3744182/pdf/nihms499893.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE ... International Conference on Computational Advances in Bio and Medical Sciences : [proceedings]. IEEE International Conference on Computational Advances in Bio and Medical Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCABS.2012.6182627","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We propose a new algorithm, PMS6, for the (l, d)-motif discovery problem in which we are to find all strings of length l that appear in every string of a given set of strings with at most d mismatches. The run time ratio PMS5/PMS6, where PMS5 is the fastest previously known algorithm for motif discovery in large instances, ranges from a high of 2.20 for the (21,8) challenge instances to a low of 1.69 for the (17,6) challenge instances. Both PMS5 and PMS6 require some amount of preprocessing. The preprocessing time for PMS6 is 34 times faster than that for PMS5 for (23,9) instances. When preprocessing time is factored in, the run time ratio PMS5/PMS6 is as high as 2.75 for (13,4) instances and as low as 1.95 for (17,6) instances.

Abstract Image

PMS6:一种快速的动机发现算法。
我们针对(l, d)-motif发现问题提出了一种新算法PMS6,在这个问题中,我们要找到所有长度为l的字符串,这些字符串出现在一组给定字符串中的每一个字符串中,且最多有d个错配。PMS5/PMS6 的运行时间比从 (21,8) 挑战实例的最高 2.20 到 (17,6) 挑战实例的最低 1.69 不等,其中 PMS5 是之前已知的在大型实例中发现主题的最快算法。PMS5 和 PMS6 都需要一定的预处理。在(23,9)实例中,PMS6 的预处理时间比 PMS5 快 34 倍。如果将预处理时间考虑在内,PMS5/PMS6 的运行时间比在(13,4)实例中高达 2.75,而在(17,6)实例中则低至 1.95。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信