An optimized Parallel Failure-less Aho-Corasick algorithm for DNA sequence matching

Vajira Lasantha Thambawita, R. Ragel, D. Elkaduwe
{"title":"An optimized Parallel Failure-less Aho-Corasick algorithm for DNA sequence matching","authors":"Vajira Lasantha Thambawita, R. Ragel, D. Elkaduwe","doi":"10.1109/ICIAFS.2016.7946533","DOIUrl":null,"url":null,"abstract":"The Aho-Corasick algorithm is a multiple patterns searching algorithm running sequentially in various applications like network intrusion detection and bioinformatics for finding several input strings within a given large input string. The parallel version of the Aho-Corasick algorithm is called as Parallel Failure-less Aho-Corasick algorithm because it doesnt need failure links like in the original Aho-Corasick algorithm. In this research, we implemented an application specific parallel failureless Aho-Corasick algorithm on the general purpose graphic processing unit by applying several cache optimization techniques for matching DNA sequences. Our parallel Aho-Corasick algorithm shows better performance than the available parallel Aho-Corasick algorithm library due to its simplicity and optimized cache memory usage of graphic processing units for matching DNA sequences.","PeriodicalId":237290,"journal":{"name":"2016 IEEE International Conference on Information and Automation for Sustainability (ICIAfS)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Information and Automation for Sustainability (ICIAfS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIAFS.2016.7946533","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The Aho-Corasick algorithm is a multiple patterns searching algorithm running sequentially in various applications like network intrusion detection and bioinformatics for finding several input strings within a given large input string. The parallel version of the Aho-Corasick algorithm is called as Parallel Failure-less Aho-Corasick algorithm because it doesnt need failure links like in the original Aho-Corasick algorithm. In this research, we implemented an application specific parallel failureless Aho-Corasick algorithm on the general purpose graphic processing unit by applying several cache optimization techniques for matching DNA sequences. Our parallel Aho-Corasick algorithm shows better performance than the available parallel Aho-Corasick algorithm library due to its simplicity and optimized cache memory usage of graphic processing units for matching DNA sequences.
DNA序列匹配的优化并行无故障Aho-Corasick算法
ho- corasick算法是一种顺序运行的多模式搜索算法,用于在给定的大输入字符串中查找多个输入字符串,如网络入侵检测和生物信息学。Aho-Corasick算法的并行版本被称为并行无故障Aho-Corasick算法,因为它不像原来的Aho-Corasick算法那样需要故障链接。在本研究中,我们通过应用几种匹配DNA序列的缓存优化技术,在通用图形处理单元上实现了一种特定应用的并行无故障Aho-Corasick算法。我们的并行Aho-Corasick算法由于其简单性和优化的DNA序列匹配图形处理单元的缓存使用,比现有的并行Aho-Corasick算法库表现出更好的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信