Short-Read Mapping by a Systolic Custom FPGA Computation

2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines Pub Date : 2012-04-29 DOI:10.1109/FCCM.2012.37

Thomas B. Preußer, Oliver Knodel, R. Spallek

{"title":"Short-Read Mapping by a Systolic Custom FPGA Computation","authors":"Thomas B. Preußer, Oliver Knodel, R. Spallek","doi":"10.1109/FCCM.2012.37","DOIUrl":null,"url":null,"abstract":"The mapping of reads, i.e. short DNA base pair strings, to large genome databases has become a critical operation for genetic analysis and diagnosis. Although this mapping operation is a simple string search tolerant of some character mismatches, it is yet extremely challenging due to the tremendous size of the searched genome databases. It is the heavy use of search heuristics such as BLAST, Maq and Bowtie, which makes the economic deployment of read mappers possible. While these heuristics achieve feasible computation times, they also sacrifice the accuracy of the mapping results, which is itself a high value for reliable diagnostics. The traditional software implementations are unable to exploit the tremendous parallelism, which is available in the mapping of thousands and millions of reads. Merely a handful of concurrent control flows, and thus searches, can be performed efficiently on contemporary multicores. Even GPU assistance only enables a few dozens of parallel searches. This paper proposes a systolic custom computation on FPGA, which implements the read mapping on a massively parallel architecture. It implements a true search and guarantees to find all read mappings under a configurable threshold of base pair mismatches. The highly regular design from compact string matchers enables the implementation of thousands of parallel search engines on a single FPGA device. The presented map per platform combines highest computational performance with an excellent result accuracy. Its performance is more than twice as high as that of a recently published comparable FPGA map per. Already when implemented on a contemporary mid-size FPGA, it meets the search speed of software heuristics, which only detect little more than half of the valid read mappings. The map per easily scales to large FPGA devices, which can, thus, implement accurate high-performance volume mappers. Accurate mapping is made available in application domains that could only afford fuzzy heuristics by now.","PeriodicalId":226197,"journal":{"name":"2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines","volume":"221 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FCCM.2012.37","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 22

Abstract

The mapping of reads, i.e. short DNA base pair strings, to large genome databases has become a critical operation for genetic analysis and diagnosis. Although this mapping operation is a simple string search tolerant of some character mismatches, it is yet extremely challenging due to the tremendous size of the searched genome databases. It is the heavy use of search heuristics such as BLAST, Maq and Bowtie, which makes the economic deployment of read mappers possible. While these heuristics achieve feasible computation times, they also sacrifice the accuracy of the mapping results, which is itself a high value for reliable diagnostics. The traditional software implementations are unable to exploit the tremendous parallelism, which is available in the mapping of thousands and millions of reads. Merely a handful of concurrent control flows, and thus searches, can be performed efficiently on contemporary multicores. Even GPU assistance only enables a few dozens of parallel searches. This paper proposes a systolic custom computation on FPGA, which implements the read mapping on a massively parallel architecture. It implements a true search and guarantees to find all read mappings under a configurable threshold of base pair mismatches. The highly regular design from compact string matchers enables the implementation of thousands of parallel search engines on a single FPGA device. The presented map per platform combines highest computational performance with an excellent result accuracy. Its performance is more than twice as high as that of a recently published comparable FPGA map per. Already when implemented on a contemporary mid-size FPGA, it meets the search speed of software heuristics, which only detect little more than half of the valid read mappings. The map per easily scales to large FPGA devices, which can, thus, implement accurate high-performance volume mappers. Accurate mapping is made available in application domains that could only afford fuzzy heuristics by now.

查看原文本刊更多论文

短读映射的收缩自定义FPGA计算

reads(即DNA短碱基对串)到大型基因组数据库的映射已成为遗传分析和诊断的关键操作。尽管这种映射操作是一种简单的字符串搜索，可以容忍一些字符不匹配，但由于所搜索的基因组数据库的巨大规模，它仍然极具挑战性。大量使用搜索启发式算法，如BLAST、Maq和Bowtie，使得读取映射器的经济部署成为可能。虽然这些启发式方法实现了可行的计算时间，但它们也牺牲了映射结果的准确性，这本身就是可靠诊断的高值。传统的软件实现无法利用巨大的并行性，这在成千上万的读取映射中是可用的。只有少数并发控制流和搜索可以在当代多核上有效地执行。即使GPU辅助也只能支持几十个并行搜索。提出了一种基于FPGA的压缩自定义计算方法，在大规模并行架构下实现读映射。它实现了真正的搜索，并保证在碱基对不匹配的可配置阈值下找到所有读映射。紧凑字符串匹配器的高度规则设计可以在单个FPGA设备上实现数千个并行搜索引擎。每个平台呈现的地图结合了最高的计算性能和出色的结果准确性。它的性能是最近发布的同类FPGA映射的两倍多。当在现代中型FPGA上实现时，它已经满足了软件启发式的搜索速度，它只检测到一半多一点的有效读映射。映射器很容易扩展到大型FPGA设备，因此可以实现精确的高性能卷映射器。在目前只能提供模糊启发式的应用领域中，可以获得精确的映射。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines

自引率

0.00%

发文量