BioSEAL: In-Memory Biological Sequence Alignment Accelerator for Large-Scale Genomic Data

Proceedings of the 13th ACM International Systems and Storage Conference Pub Date : 2019-01-17 DOI:10.1145/3383669.3398279

R. Kaplan, L. Yavits, R. Ginosar

{"title":"BioSEAL: In-Memory Biological Sequence Alignment Accelerator for Large-Scale Genomic Data","authors":"R. Kaplan, L. Yavits, R. Ginosar","doi":"10.1145/3383669.3398279","DOIUrl":null,"url":null,"abstract":"Genome sequences contain hundreds of millions of DNA base pairs. Finding the degree of similarity between two genomes requires executing a compute-intensive dynamic programming algorithm, such as Smith-Waterman. Traditional von Neumann architectures have limited parallelism and cannot provide an efficient solution for large-scale genomic data. Approximate heuristic methods (e.g. BLAST) are commonly used. However, they are suboptimal and still compute-intensive. In this work, we present BioSEAL, a biological sequence alignment accelerator. BioSEAL is a massively parallel non-von Neumann processing-in-memory architecture for large-scale DNA and protein sequence alignment. BioSEAL is based on resistive content addressable memory, capable of energy-efficient and highperformance associative processing. We present an associative processing algorithm for entire database sequence alignment on BioSEAL and compare its performance and power consumption with state-of-art solutions. We show that BioSEAL can achieve up to 57× speedup and 156× better energy efficiency, compared with existing solutions for genome sequence alignment and protein sequence database search.","PeriodicalId":225327,"journal":{"name":"Proceedings of the 13th ACM International Systems and Storage Conference","volume":"63 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"31","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 13th ACM International Systems and Storage Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3383669.3398279","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 31

Abstract

Genome sequences contain hundreds of millions of DNA base pairs. Finding the degree of similarity between two genomes requires executing a compute-intensive dynamic programming algorithm, such as Smith-Waterman. Traditional von Neumann architectures have limited parallelism and cannot provide an efficient solution for large-scale genomic data. Approximate heuristic methods (e.g. BLAST) are commonly used. However, they are suboptimal and still compute-intensive. In this work, we present BioSEAL, a biological sequence alignment accelerator. BioSEAL is a massively parallel non-von Neumann processing-in-memory architecture for large-scale DNA and protein sequence alignment. BioSEAL is based on resistive content addressable memory, capable of energy-efficient and highperformance associative processing. We present an associative processing algorithm for entire database sequence alignment on BioSEAL and compare its performance and power consumption with state-of-art solutions. We show that BioSEAL can achieve up to 57× speedup and 156× better energy efficiency, compared with existing solutions for genome sequence alignment and protein sequence database search.

查看原文本刊更多论文

大规模基因组数据的内存生物序列比对加速器

基因组序列包含数亿个DNA碱基对。发现两个基因组之间的相似程度需要执行计算密集型动态规划算法，比如史密斯-沃特曼算法。传统的冯诺依曼架构并行性有限，无法为大规模基因组数据提供有效的解决方案。近似启发式方法(例如BLAST)是常用的方法。然而，它们不是最优的，并且仍然是计算密集型的。在这项工作中，我们提出了BioSEAL，一个生物序列比对加速器。BioSEAL是一种大规模并行非冯·诺伊曼处理内存架构，用于大规模DNA和蛋白质序列比对。BioSEAL是基于电阻内容可寻址存储器，能够高效节能和高性能的联想处理。我们提出了一种用于BioSEAL上整个数据库序列对齐的关联处理算法，并将其性能和功耗与最先进的解决方案进行了比较。研究表明，与现有的基因组序列比对和蛋白质序列数据库搜索解决方案相比，BioSEAL可以实现高达57倍的加速和156倍的能效提升。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 13th ACM International Systems and Storage Conference

自引率

0.00%

发文量