FQSqueezer基因组压缩新方法的加速实现

Monica Amich, P. D. Luca, S. Fiscale
{"title":"FQSqueezer基因组压缩新方法的加速实现","authors":"Monica Amich, P. D. Luca, S. Fiscale","doi":"10.1109/ISPDC51135.2020.00030","DOIUrl":null,"url":null,"abstract":"Biological data contain very important information for genoma analysis. In last decades, the size of these data is constantly growing. So the Next Generation Sequence (NGS) data has been introduced. These kind of data are represented by different data formats, such as FASTQ, FASTA, SAM, etc. In order to allow a good analysis and storing of them, due to large dimension of these data, several compressors have been performed. FQSqueezer is a novel genomic compressor for FASTQ data files. But several issues are present due to multithread version that runs on multi-core hardware. It is wellknown that the number of cores in a CPU is limited and very minor with respect to GPUs’ cores number. In order to increase the performance related to this compressor method, in this work we present a GPU-parallel implementation of cited compressor by exploiting CUDA framework. More precisely, a suitable domain decomposition is able to give an appreciable gain of performance in terms of time and reliability. Several execution tests confirm the gain of efficiency achieved by our parallel implementation.","PeriodicalId":426824,"journal":{"name":"2020 19th International Symposium on Parallel and Distributed Computing (ISPDC)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Accelerated implementation of FQSqueezer novel genomic compression method\",\"authors\":\"Monica Amich, P. D. Luca, S. Fiscale\",\"doi\":\"10.1109/ISPDC51135.2020.00030\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Biological data contain very important information for genoma analysis. In last decades, the size of these data is constantly growing. So the Next Generation Sequence (NGS) data has been introduced. These kind of data are represented by different data formats, such as FASTQ, FASTA, SAM, etc. In order to allow a good analysis and storing of them, due to large dimension of these data, several compressors have been performed. FQSqueezer is a novel genomic compressor for FASTQ data files. But several issues are present due to multithread version that runs on multi-core hardware. It is wellknown that the number of cores in a CPU is limited and very minor with respect to GPUs’ cores number. In order to increase the performance related to this compressor method, in this work we present a GPU-parallel implementation of cited compressor by exploiting CUDA framework. More precisely, a suitable domain decomposition is able to give an appreciable gain of performance in terms of time and reliability. Several execution tests confirm the gain of efficiency achieved by our parallel implementation.\",\"PeriodicalId\":426824,\"journal\":{\"name\":\"2020 19th International Symposium on Parallel and Distributed Computing (ISPDC)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 19th International Symposium on Parallel and Distributed Computing (ISPDC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISPDC51135.2020.00030\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 19th International Symposium on Parallel and Distributed Computing (ISPDC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISPDC51135.2020.00030","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

生物学数据包含了基因组分析的重要信息。在过去的几十年里,这些数据的规模不断增长。因此,引入了下一代序列(NGS)数据。这类数据由不同的数据格式表示,如FASTQ、FASTA、SAM等。为了对这些数据进行良好的分析和存储,由于这些数据的维度很大,我们进行了多次压缩。FQSqueezer是一种用于FASTQ数据文件的新型基因组压缩器。但是由于在多核硬件上运行的多线程版本存在一些问题。众所周知,CPU的核心数量是有限的,相对于gpu的核心数量来说非常小。为了提高与此压缩器方法相关的性能,在这项工作中,我们提出了一个利用CUDA框架的gpu并行实现引用压缩器。更准确地说,适当的域分解能够在时间和可靠性方面获得可观的性能增益。几个执行测试证实了我们的并行实现所获得的效率增益。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Accelerated implementation of FQSqueezer novel genomic compression method
Biological data contain very important information for genoma analysis. In last decades, the size of these data is constantly growing. So the Next Generation Sequence (NGS) data has been introduced. These kind of data are represented by different data formats, such as FASTQ, FASTA, SAM, etc. In order to allow a good analysis and storing of them, due to large dimension of these data, several compressors have been performed. FQSqueezer is a novel genomic compressor for FASTQ data files. But several issues are present due to multithread version that runs on multi-core hardware. It is wellknown that the number of cores in a CPU is limited and very minor with respect to GPUs’ cores number. In order to increase the performance related to this compressor method, in this work we present a GPU-parallel implementation of cited compressor by exploiting CUDA framework. More precisely, a suitable domain decomposition is able to give an appreciable gain of performance in terms of time and reliability. Several execution tests confirm the gain of efficiency achieved by our parallel implementation.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信